GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
kyegomez

kyegomez / SparseAttention

星标86
复刻3


问题 官网
 
Loading

该仓库已收录但尚未编辑。项目介绍及使用教程请前往 GitHub 阅读 README


0 条讨论

登录后发表评论

关于

Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"

discord.gg
人工智能attention-is-all-you-needattention-mechanismattention-mechanisms机器学习sparse-matrix
创建时间

2023-10-25

是否国产

否

  修改时间

2025-08-12T01:28:44Z

Readme
相关推荐

语言

  • Python82.5%
  • Makefile17.5%

kyegomez 的其他开源项目

kyegomez/swarms
swarms
@kyegomez

#大语言模型#The Enterprise-Grade Production-Ready Multi-Agent Orchestration Framework. Website: https://swarms.ai

人工智能attention-mechanismgpt4langchain机器学习
Python5.23 k
19 小时前
tree-of-thoughts
@kyegomez

#大语言模型#Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

人工智能ChatGPTgpt4multimodalprompt-engineering
Python4.54 k
1 个月前
BitNet
@kyegomez

#计算机科学#Implementation of "BitNet: Scaling 1-bit Transformers for Large Language Models" in pytorch

人工智能深度神经网络深度学习gpt4机器学习
Python1.87 k
5 天前
Open-AF3
@kyegomez

Implementation of Alpha Fold 3 from the paper: "Accurate structure prediction of biomolecular interactions with AlphaFold3" in PyTorch

人工智能alphafoldbio机器学习geneml
Python793
4 天前

您可能感兴趣的

TripoSR
@VAST-AI-Research

TripoSR: Fast 3D Object Reconstruction from a Single Image

Python5.72 k
1 年前
OpenAI
sparse_attention
OpenAI@openai

Examples of using sparse attention, as in "Generating Long Sequences with Sparse Transformers"

Python1.59 k
5 年前
beamformers
@Enny1991

Easy to use Beamformers for multi-channel speech separation/enhancement

Python194
5 年前
roformer
@ZhuiyiTechnology

Rotary Transformer

Python1.01 k
3 年前
diffusion_distiller
@Hramchenko

🚀 PyTorch Implementation of "Progressive Distillation for Fast Sampling of Diffusion Models(v-diffusion)"

denoising-diffusionimage-generation
Python248
3 年前
NVIDIA Corporation
Megatron-LM
NVIDIA Corporation@NVIDIA

Ongoing research training transformer models at scale

large-language-modelsmodel-paratransformers
Python13.55 k
2 天前
OpenAI
grok
OpenAI@openai

Python4.18 k1
1 年前
neuropsychology/NeuroKit
NeuroKit
@neuropsychology

NeuroKit2: The Python Toolbox for Neurophysiological Signal Processing

PythonedaEEG
Python1.91 k
16 天前
TransformerTranslation
@moon-hotel

A Transformer Framework Based Translation Task

transformer
Python152
3 个月前
Hugging Face
optimum-nvidia
Hugging Face@huggingface

Python944
7 个月前
devika
@stitionai

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. D...

Python19.4 k
1 年前
grok-1
@xai-org

大模型Grok-1开源

Python50.5 k
1 年前
llm-math-education
@DigitalHarborFoundation

Retrieval augmented generation for middle-school math question answering and hint generation.

教学question-answeringretrieval-augmented-generationlanguage-models
Jupyter Notebook39
7 个月前
Open-Sora-Plan
@PKU-YuanGroup

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python12.02 k
2 个月前
OpenHands
@All-Hands-AI

#大语言模型#🙌 OpenHands: Code Less, Make More

agent人工智能大语言模型ChatGPTclaude-ai
Python63.42 k
20 分钟前
Vlogger
@Vchitect

[CVPR2024] Make Your Dream A Vlog

Python427
4 个月前
Phil Wang
local-attention
Phil Wang@lucidrains

#计算机科学#An implementation of local windowed attention for language modeling

人工智能attention-mechanisms深度学习
Python475
2 个月前