transformers · GitHub Topics

labmlai / annotated_deep_learning_paper_implementations

#计算机科学#59 篇深度学习论文的实现，并带有详细注释。包括 transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, ...), gans(cyclegan, stylegan2, ...), 🎮 强化学习 (ppo, dqn), capsnet, distillation, ... 🧠

深度学习 PyTorch Generative Adversarial Network transformers reinforcement-learning optimizers neural-networks transformer 机器学习 attention literate-programming lora

Python 62.24 k

21 小时前

hiyouga / LLaMA-Factory

#自然语言处理#Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

fine-tuning llama 大语言模型 peft transformers rlhf qlora quantization qwen instruction-tuning gpt lora large-language-models agent 人工智能 moe llama3 deepseek gemma 自然语言处理

Python 55.37 k

3 天前

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

人工智能 attention-mechanism transformers 机器视觉 image-classification

Python 23.52 k

6 天前

deepset-ai / haystack

#自然语言处理#Haystack 是一个开源 NLP 框架，利用预训练的 Transformer 模型。帮组开发者能快速实现一个生产级的语义搜索、问答、摘要和文档排名的NLP应用

Python 21.73 k

15 小时前

huggingface / peft

#大语言模型#🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

adapter diffusion 大语言模型 parameter-efficient-learning Python PyTorch transformers lora

Python 19.19 k

2 天前

stas00 / ml-engineering

#大语言模型#Machine Learning Engineering Open Book

PyTorch slurm large-language-models 大语言模型机器学习 scalability transformers machine-learning-engineering mlops 人工智能 inference training

Python 14.6 k

2 天前

BlinkDL / RWKV-LM

#大语言模型#RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN a...

attention-mechanism 深度学习 gpt gpt-2 gpt-3 language-model linear-attention lstm PyTorch rnn transformer transformers rwkv ChatGPT

Python 13.86 k

6 天前

NVIDIA / Megatron-LM

Ongoing research training transformer models at scale

large-language-models model-para transformers

Python 13.02 k

3 天前

PaddlePaddle / PaddleFormers

PaddleHub旨在为开发者提供丰富的、高质量的、直接可用的预训练模型

model transformers

Python 12.92 k

8 天前

PaddlePaddle / PaddleNLP

#搜索#PaddleNLP 2.0是飞桨生态的文本领域核心库，具备易用的文本领域API，多场景的应用示例、和高性能分布式训练三大特点，旨在提升开发者文本领域的开发效率，并提供基于飞桨2.0核心框架的NLP任务最佳实践。

自然语言处理 embedding bert ernie paddlenlp pretrained-models transformers information-extraction question-answering 搜索引擎 semantic-analysis sentiment-analysis neural-search uie document-intelligence compression 大语言模型 distributed-training llama

Python 12.72 k

1 天前

neuml / txtai

#搜索#All-in-one 一站式 embedding 数据库，语义搜索、LLM 编排和语言模型workflows

Python search 机器学习自然语言处理 semantic-search vector-search txtai 大语言模型 vector-database language-model transformers sentence-embeddings large-language-models information-retrieval 搜索引擎 embeddings retrieval-augmented-generation rag 人工智能

Python 11.33 k

11 小时前

qubvel-org / segmentation_models.pytorch

Semantic segmentation models with 500+ pretrained convolutional and transformer-based backbones.

segmentation 图像处理 pspnet unet unet-pytorch PyTorch fpn models imagenet semantic-segmentation image-segmentation segmentation-models deeplabv3 deeplab-v3-plus pretrained-weights 机器视觉 transformers

Python 10.75 k

19 小时前

EleutherAI / gpt-neo

An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.

language-model transformers gpt gpt-2 gpt-3

Python 8.3 k

3 年前

intel / ipex-llm

#大语言模型#Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, DeepSeek, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete...

PyTorch 大语言模型 transformers gpu

Python 8.17 k

1 天前

lucidrains / PaLM-rlhf-pytorch

#计算机科学#Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

人工智能 attention-mechanisms 深度学习 reinforcement-learning transformers human-feedback

Python 7.85 k

3 个月前

jessevig / bertviz

#自然语言处理#BertViz: Visualize Attention in NLP Models (BERT, GPT2, BART, etc.)

自然语言处理机器学习可视化神经网络 PyTorch bert transformer gpt2 roberta transformers

Python 7.58 k

2 个月前

OpenRLHF / OpenRLHF

An Easy-to-use, Scalable and High-performance RLHF Framework based on Ray (PPO & GRPO & REINFORCE++ & vLLM & Ray & Dynamic Sampling & Async Agentic RL)

transformers vllm large-language-models raylib reinforcement-learning-from-human-feedback reinforcement-learning openai-o1 proximal-policy-optimization

Python 7.55 k

1 天前

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

deepspeed-library gpt-3 transformers language-model

Python 7.27 k

10 天前