GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

pretrain

Website
Wikipedia
https://static.github-zh.com/github_avatars/brightmart?size=40
brightmart / nlp_chinese_corpus

#自然语言处理#大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP

chinese-datasetchinese-corpuspretrainword2vec自然语言处理bertlanguage-modelWikinewsquestion-answering中文corpuschinese-nlpdatasettext-classification
9.73 k
1 年前
keyu-tian/SparK
https://static.github-zh.com/github_avatars/keyu-tian?size=40
keyu-tian / SparK

#计算机科学#[ICLR'23 Spotlight🔥] The first successful BERT/MAE-style pretraining on any convolutional network; Pytorch impl. of "Designing BERT for Convolutional Networks: Sparse and Hierarchical Masked Modeling...

bertconvnetconvolutional-neural-networksmasked-image-modelingpre-trained-modelself-supervised-learningsparse-convolutionTLS (Transport Layer Security)cnniclr深度学习object-detectionPyTorchinstance-segmentationmask-rcnnpretrainpretraining
Python 1.35 k
1 年前
https://static.github-zh.com/github_avatars/CLUEbenchmark?size=40
CLUEbenchmark / CLUECorpus2020

#自然语言处理#Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料

中文chinese-corpus数据集pretraincorpus自然语言处理bertrobertaalbert
966
3 年前
https://static.github-zh.com/github_avatars/yangjianxin1?size=40
yangjianxin1 / Firefly-LLaMA2-Chinese

#大语言模型#Firefly中文LLaMA-2大模型,支持增量预训练Baichuan2、Llama2、Llama、Falcon、Qwen、Baichuan、InternLM、Bloom等大模型

fireflyllamallama-2llama2大语言模型baichuanbaichuan-13bbloomchatglmfalconinternlmlorapretrainqloraqwen
Python 411
2 年前
https://static.github-zh.com/github_avatars/microsoft?size=40
microsoft / UniVL

An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"

multimodalitypretrainingcaptionpretrainVideoLocalization (l10n)segmentationcoinalignment
Python 356
1 年前
https://static.github-zh.com/github_avatars/open-sciencelab?size=40
open-sciencelab / GraphGen

#大语言模型#GraphGen: Enhancing Supervised Fine-Tuning for LLMs with Knowledge-Driven Synthetic Data Generation

ai4sciencedata-generationllm-trainingpretrainqaqwensftknowledge-graph大语言模型
Python 195
11 天前
https://static.github-zh.com/github_avatars/xcfcode?size=40
xcfcode / What-I-Have-Read

#博客#Paper Lists, Notes and Slides, Focus on NLP. For summarization, please refer to https://github.com/xcfcode/Summarization-Papers

自然语言处理summarizationaclaaainaaclslidespresentationgnnknowledge-distillationpretrainGenerative Adversarial Networknon-autoregressivegenerationgraph-neural-networksnotespresentationsdata-augmentationmeta-learningconversation
165
3 年前
https://static.github-zh.com/github_avatars/THUNLP-AIPoet?size=40
THUNLP-AIPoet / BERT-CCPoem

BERT-CCPoem is an BERT-based pre-trained model particularly for Chinese classical poetry

poetrybertpretrain
Python 154
3 年前
https://static.github-zh.com/github_avatars/thunlp?size=40
thunlp / RE-Context-or-Names

Bert-based models(BERT, MTB, CP) for relation extraction.

relation-extractionPyTorchbertcontrastive-learningpretrain
Python 103
3 年前
https://static.github-zh.com/github_avatars/huzongxiang?size=40
huzongxiang / MatDGL

#计算机科学#MatDGL is a neural network package that allows researchers to train custom models for crystal modeling tasks. It aims to accelerate the research and application of material science.

机器学习深度学习neural-networksgraphtransformermassagepassingTensorflowmaterialspretrain
Python 52
1 年前
https://static.github-zh.com/github_avatars/CoinCheung?size=40
CoinCheung / MFM

code for paper "Masked Frequency Modeling for Self-Supervised Visual Pre-Training" (https://arxiv.org/pdf/2206.07706.pdf)

pretrainself-supervised-learningfftTLS (Transport Layer Security)frequency
Python 24
2 年前
https://static.github-zh.com/github_avatars/SalesforceAIResearch?size=40
SalesforceAIResearch / pretrain-time-series-cloudops

#时序数据库#Official code repository for the paper "Pushing the Limits of Pre-training for Time Series Forecasting in the CloudOps Domain"

forecastingpretraintime-series
Python 23
5 个月前
https://static.github-zh.com/github_avatars/nancheng58?size=40
nancheng58 / SSL4SR

[CCIR 2023] Self-supervised learning for Sequential Recommender Systems

recommender-systempretrainsequential-recommendationbaselinerecommendationself-supervised-learning
Python 22
2 年前
https://static.github-zh.com/github_avatars/bayartsogt-ya?size=40
bayartsogt-ya / albert-mongolian

ALBERT trained on Mongolian text corpus

albertpretrained-modelpretrainlanguage-modeltransformers
Jupyter Notebook 18
4 年前
https://static.github-zh.com/github_avatars/KennethanCeyer?size=40
KennethanCeyer / diy-generative-ai-lm

#自然语言处理#Make your Generative AI LM model from the scratch (Including pretraining / SFT with LoRA)

colabgenaigenerativeai大语言模型lmlora自然语言处理pretrainsfttorchtransformer
Python 16
4 个月前
https://static.github-zh.com/github_avatars/yongzhuo?size=40
yongzhuo / MacroGPT-Pretrain

#大语言模型#macrogpt大模型全量预训练(1b3,32层), 多卡deepspeed/单卡adafactor

deepspeedgpt大语言模型macromicropretrain
Python 14
2 年前
https://static.github-zh.com/github_avatars/janelu9?size=40
janelu9 / EasyLLM

Running Large Language Model easily.

deepspeedfine-tuningpretrainllamanpuqwendeepseekrlhf
Python 9
6 天前
https://static.github-zh.com/github_avatars/mrzjy?size=40
mrzjy / hoyo_public_wiki_parser

#自然语言处理#Parsing Hoyoverse game text corpus from public wikipedia

corpusGenshin Impacthonkai-star-railhoyoverse大语言模型mihoyo自然语言处理pretrainconversationdialoguegameWiki
Python 9
10 个月前
https://static.github-zh.com/github_avatars/pskliff?size=40
pskliff / vtb-data-fusion

#自然语言处理#This repository provides code solution for Data Fusion Contest task 1

自然语言处理bertfine-tuningpretrainclassificationretailhuggingface
Jupyter Notebook 8
4 年前
https://static.github-zh.com/github_avatars/arrrrrmin?size=40
arrrrrmin / albert-guide

#自然语言处理#Understanding "A Lite BERT". An Transformer approach for learning self-supervised Language Models.

language-modelingpretrainingguidepretrain自然语言处理
Python 7
2 年前
loading...