GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

qwen2-5

Website
Wikipedia
https://static.github-zh.com/github_avatars/yusufcanb?size=40
yusufcanb / tlm

#大语言模型#Local CLI Copilot, powered by Ollama. 💻🦙

大语言模型BashPowerShellllama3Zshdeepseek-r1qwen2-5phi4
Go 1.45 k
6 个月前
https://static.github-zh.com/github_avatars/2U1?size=40
2U1 / Qwen2-VL-Finetune

An open-source implementaion for fine-tuning Qwen2-VL and Qwen2.5-VL series by Alibaba Cloud.

聊天机器人multimodalqwen2-vlvision-languagevision-language-modelqwen2-5
Python 1.15 k
2 天前
https://static.github-zh.com/github_avatars/zjunlp?size=40
zjunlp / OmniThink

#自然语言处理#[EMNLP 2025] OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking

人工智能generationgptinformation-seekingknowledge-augmented-generationlarge-language-models自然语言处理qwenretrieval-augmented-generationreport-generationdeepseek-r1deepseek-v3gpt4oqwen2-5
Python 457
22 天前
https://static.github-zh.com/github_avatars/sshh12?size=40
sshh12 / llm_backdoor

Experimental tools to backdoor large language models by re-writing their system prompts at a raw parameter level. This allows you to potentially execute offline remote code execution without running a...

backdoor-attacksllm-securityqwen2-5
Python 184
5 个月前
https://static.github-zh.com/github_avatars/beehive-lab?size=40
beehive-lab / GPULlama3.java

#大语言模型#GPU-accelerated Llama3.java inference in pure Java using TornadoVM.

compilersgpuJavallama3Nvidiagguf大语言模型mistralmistral-7bdeepseek-r1qwen2-5qwen3
Java 167
1 天前
https://static.github-zh.com/github_avatars/harleyszhang?size=40
harleyszhang / lite_llama

#大语言模型#A light llama-like llm inference framework based on the triton kernel.

llamallama3大语言模型llm-inferencePythonattentionqwen2-5
Python 150
1 个月前
https://static.github-zh.com/github_avatars/aws-samples?size=40
aws-samples / easy-model-deployer

Deploy open-source LLMs on AWS in minutes — with OpenAI-compatible APIs and a powerful CLI/SDK toolkit.

deepseekdeepseek-r1ecshuggingfaceinternlm2langchain大语言模型ollamaopenai-compatible-apiqwen2-5sagemakervllmqwq-32bgemma3qwen3deepseek-v3gpt-oss
Python 72
19 天前
https://static.github-zh.com/github_avatars/HenryNdubuaku?size=40
HenryNdubuaku / super-lazy-autograd

#大语言模型#Hand-derived memory-efficient super lazy PyTorch VJPs for training LLMs on laptop, all using one op (bundled scaled matmuls).

人工智能fine-tuningfinetuninghuggingface大语言模型PyTorchqwen2-5transformer
Python 54
5 个月前
https://static.github-zh.com/github_avatars/rbiswasfc?size=40
rbiswasfc / eedi-mining-misconceptions

#自然语言处理#1st Place Solution for Eedi - Mining Misconceptions in Mathematics Kaggle Competition

大语言模型qwen2-5rerankerretriever自然语言处理
Python 51
9 个月前
https://static.github-zh.com/github_avatars/arafkarsh?size=40
arafkarsh / ms-springboot-ai

Java 23, SpringBoot 3.4.1 Examples using Deep Learning 4 Java & LangChain4J for Generative AI using ChatGPT LLM, RAG and other open source LLMs. Sentiment Analysis, Application Context based ChatBots....

chatgpt3chatgpt4generative-aillama2geminiragretrieval-augmented-generationfalcongemmamistral-7blangchainlangchain4jollamaclaude-3gemini-aiconvolutional-neural-networkmultilayer-perceptron-networkrecurrent-neural-networkqwen2-5
Java 33
8 个月前
https://static.github-zh.com/github_avatars/sgl-project?size=40
sgl-project / awesome-sglang

#大语言模型#Make SGLang go brrr

deepseekdeepseek-r1deepseek-v3gpt-ossllama3llama3-1llama4qwen2qwen2-5qwen3omesglangKubernetes大语言模型
29
3 天前
https://static.github-zh.com/github_avatars/WebDevCaptain?size=40
WebDevCaptain / agno-ai-agents

Exploring Agno framework for building AI agents.

ai-agentsdeepseek-r1ollamaopenaiqwen2-5
Python 25
6 个月前
https://static.github-zh.com/github_avatars/Younis-Ahmed?size=40
Younis-Ahmed / qwen-ai-provider

Community-built Qwen AI Provider for Vercel AI SDK - Integrate Alibaba Cloud's Qwen models with Vercel's AI application framework

人工智能vercel-aivercel-ai-sdkqwenqwen2-5qwen2-vlgenerative-aiVercelalibaba-cloudlanguage-model
TypeScript 23
7 天前
https://static.github-zh.com/github_avatars/albertstarfield?size=40
albertstarfield / project-zephyrine

#大语言模型#Project Zephyrine: Your personal experimental glass cockpit for the world of ideas. Let's take flight with a modern, locally-run automaton, using accelerated thought to navigate the both digital aethe...

ChatGPTggmlggufagentic-aiopenai-apirealtimevulkan-apideepseek-r1qwen2-5chatgpt-appflux
HTML 22
6 天前
https://static.github-zh.com/github_avatars/husaynirfan1?size=40
husaynirfan1 / simple-rag

#大语言模型#Simple RAG system.

milvusvectorvector-databasedeepseek-r1llama3-1大语言模型Pythonqwen2-5nltk
Python 19
5 个月前
https://static.github-zh.com/github_avatars/hiroshi-nagaya?size=40
hiroshi-nagaya / Virtual_Try_Off

#计算机科学#Get Clothes from image

深度学习huggingface-transformersimage-generationimage-to-imagemaskPythonqwen2-5segmentationstable-diffusione-commerce
Python 19
2 个月前
https://static.github-zh.com/github_avatars/DaoyuanLi2816?size=40
DaoyuanLi2816 / Kaggle-Eedi-Mining-Misconceptions-in-Mathematics-Silver-Medal

#自然语言处理#Silver Medal Solution for the Kaggle Competition: Eedi - Mining Misconceptions in Mathematics

kaggle-competition大语言模型自然语言处理qwen2-5
Python 19
8 个月前
https://static.github-zh.com/github_avatars/khaidq97?size=40
khaidq97 / SimpleChatbot

#大语言模型#Models: Deepseek R1 models, Llama3.2, Qwen2.5. Integrations: Ollama, Gradio. Supports Local LLM. Test and deploy the latest LLM models in the fastest and most efficient way

聊天机器人deepseekdeepseek-r1大语言模型locallocal-llmollamaqwen2-5
Python 16
7 个月前
https://static.github-zh.com/github_avatars/zli12321?size=40
zli12321 / free-form-grpo

grpo to train long form QA and instructions with long-form reward model

evaluation-frameworkgrpoqwen2-5reinforcement-learning-algorithms
Python 15
2 个月前
https://static.github-zh.com/github_avatars/ictnlp?size=40
ictnlp / FastLongSpeech

FastLongSpeech is a novel framework designed to extend the capabilities of Large Speech-Language Models for efficient long-speech processing without necessitating dedicated long-speech training data.

large-language-modelsllm-training大语言模型multi-modalspeechspeech-processingspeech-recognitionspeech-to-textqwenqwen2-5
Python 10
2 个月前
loading...