GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

long-context

Website
Wikipedia
https://static.github-zh.com/github_avatars/InternLM?size=40
InternLM / InternLM

#大语言模型#Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

聊天机器人gpt大语言模型long-contextrlhffine-tuning-llm中文flash-attentionpretrained-models
Python 6.94 k
4 个月前
https://static.github-zh.com/github_avatars/dvlab-research?size=40
dvlab-research / LongLoRA

#大语言模型#Code and documents of LongLoRA and LongAlpaca (ICLR 2024 Oral)

fine-tuning-llmlarge-language-modelslong-context大语言模型lora
Python 2.66 k
10 个月前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / LongWriter

#大语言模型#[ICLR 2025] LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs

fine-tuning大语言模型long-context
Python 1.66 k
8 个月前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / LongBench

#大语言模型#LongBench v2 and LongBench (ACL 2024)

benchmark大语言模型long-context
Python 893
5 个月前
https://static.github-zh.com/github_avatars/haoliuhl?size=40
haoliuhl / ringattention

Large Context Attention

large-language-modelslong-contextmemory-efficienttransformers
Python 714
5 个月前
https://static.github-zh.com/github_avatars/lucidrains?size=40
lucidrains / MEGABYTE-pytorch

#计算机科学#Implementation of MEGABYTE, Predicting Million-byte Sequences with Multiscale Transformers, in Pytorch

人工智能深度学习learned-tokenizationattention-mechanismslong-contexttransformers
Python 643
6 个月前
https://static.github-zh.com/github_avatars/lucidrains?size=40
lucidrains / ring-attention-pytorch

Implementation of 💍 Ring Attention, from Liu et al. at Berkeley AI, in Pytorch

attention-mechanismlong-context
Python 517
1 个月前
https://static.github-zh.com/github_avatars/NVIDIA?size=40
NVIDIA / kvpress

#大语言模型#LLM KV cache compression made easy

大语言模型inferencekv-cachelong-contextPythonPyTorchtransformerslarge-language-models
Python 499
6 天前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / LongCite

#大语言模型#LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA

benchmarkfine-tuning大语言模型long-context
Python 495
6 个月前
https://static.github-zh.com/github_avatars/lucidrains?size=40
lucidrains / recurrent-memory-transformer-pytorch

#计算机科学#Implementation of Recurrent Memory Transformer, Neurips 2022 paper, in Pytorch

人工智能attention-mechanisms深度学习transformerslong-contextmemoryrecurrence
Python 409
5 个月前
https://static.github-zh.com/github_avatars/thunlp?size=40
thunlp / InfLLM

#大语言模型#The code of our paper "InfLLM: Unveiling the Intrinsic Capacity of LLMs for Understanding Extremely Long Sequences with Training-Free Memory"

large-language-models大语言模型long-context
Python 359
1 年前
https://static.github-zh.com/github_avatars/OpenBMB?size=40
OpenBMB / InfiniteBench

Codes for the paper "∞Bench: Extending Long Context Evaluation Beyond 100K Tokens": https://arxiv.org/abs/2402.13718

benchmarklarge-language-modelslong-context
Python 329
9 个月前
https://static.github-zh.com/github_avatars/dingo-actual?size=40
dingo-actual / infini-transformer

#计算机科学#PyTorch implementation of Infini-Transformer from "Leave No Context Behind: Efficient Infinite Context Transformers with Infini-attention" (https://arxiv.org/abs/2404.07143)

attention-mechanism深度学习long-contextPyTorchtransformers
Python 289
1 年前
https://static.github-zh.com/github_avatars/VITA-MLLM?size=40
VITA-MLLM / Long-VITA

✨✨Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuracy

long-contextmllmvision-language-model
Python 285
1 个月前
https://static.github-zh.com/github_avatars/Infini-AI-Lab?size=40
Infini-AI-Lab / TriForce

#大语言模型#[COLM 2024] TriForce: Lossless Acceleration of Long Sequence Generation with Hierarchical Speculative Decoding

acceleration大语言模型long-contextspeculative-decodingllm-inferenceefficiencyinference
Python 254
10 个月前
https://static.github-zh.com/github_avatars/THUDM?size=40
THUDM / LongAlign

#大语言模型#[EMNLP 2024] LongAlign: A Recipe for Long Context Alignment of LLMs

alignment大语言模型long-context
Python 250
6 个月前
https://static.github-zh.com/github_avatars/metame-ai?size=40
metame-ai / awesome-llm-plaza

#大语言模型#awesome llm plaza: daily tracking all sorts of awesome topics of llm, e.g. llm for coding, robotics, reasoning, multimod etc.

Awesome Lists大语言模型long-contextllm-application
200
6 天前
https://static.github-zh.com/github_avatars/nightdessert?size=40
nightdessert / Retrieval_Head

open-source code for paper: Retrieval Head Mechanistically Explains Long-Context Factuality

large-language-modelslong-context
Python 196
10 个月前
https://static.github-zh.com/github_avatars/ByteDance-Seed?size=40
ByteDance-Seed / ShadowKV

[ICML 2025 Spotlight] ShadowKV: KV Cache in Shadows for High-Throughput Long-Context LLM Inference

long-contextlow-rankllm-inferenceresearchhigh-throughput
Python 195
1 个月前
https://static.github-zh.com/github_avatars/bigai-nlco?size=40
bigai-nlco / LooGLE

#大语言模型#ACL 2024 | LooGLE: Long Context Evaluation for Long-Context Language Models

large-language-modelslong-context大语言模型
Python 184
8 个月前
loading...