#大语言模型#RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN a...
#计算机科学#Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
This repository contains demos I made with the Transformers library by HuggingFace.
AI Code Completions
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An...
An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
#自然语言处理#Chinese version of GPT2 training code, using BERT tokenizer.
#自然语言处理#Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
#自然语言处理#An unnecessarily tiny implementation of GPT-2 in NumPy.
#自然语言处理#Open Source Pre-training Model Framework in PyTorch & Pre-trained Model Zoo
#自然语言处理#GPT2 for Chinese chitchat/用于中文闲聊的GPT2模型(实现了DialoGPT的MMI思想)
#自然语言处理#Rust native ready-to-use NLP pipelines and transformer-based models (BERT, DistilBERT, GPT2,...)
#大语言模型#Build, customize and control you own LLMs. From data pre-processing to fine-tuning, xTuring provides an easy way to personalize open-source LLMs. Join our discord community: https://discord.gg/TgHXuSJ...
#计算机科学#Large-scale pretraining for dialogue
#自然语言处理#Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/
#自然语言处理#Kashgari is a production-level NLP Transfer learning framework built on top of tf.keras for text-labeling and text-classification, includes Word2Vec, BERT, and GPT2 Language Embedding.
#大语言模型#Simple UI for LLM Model Finetuning
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
#计算机科学#Guide to using pre-trained large language models of source code