#

multimodel

https://static.github-zh.com/github_avatars/lonePatient?size=40
Python 5.41 k
21 天前
https://static.github-zh.com/github_avatars/SkyworkAI?size=40

DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coordi...

JavaScript 2.75 k
11 天前
https://static.github-zh.com/github_avatars/Atomic-man007?size=40

#自然语言处理#Awesome_Multimodel is a curated GitHub repository that provides a comprehensive collection of resources for Multimodal Large Language Models (MLLM). It covers datasets, tuning techniques, in-context l...

342
7 个月前
https://static.github-zh.com/github_avatars/thomas-yanxin?size=40

🧘🏻‍♂️KarmaVLM (相生):A family of high efficiency and powerful visual language model.

Python 88
1 年前
https://static.github-zh.com/github_avatars/chengxuanying?size=40

This is our solution for KDD Cup 2020. We implemented a very neat and simple neural ranking model based on siamese BERT which ranked first among the solo teams and ranked 12th among all teams on the f...

Jupyter Notebook 71
5 年前
https://static.github-zh.com/github_avatars/PINTO0309?size=40
Python 58
3 年前
https://static.github-zh.com/github_avatars/Ankur2606?size=40
Jupyter Notebook 24
7 个月前
https://static.github-zh.com/github_avatars/Overcautious?size=40
Python 17
3 年前
https://static.github-zh.com/github_avatars/arangodb?size=40

ArangoGraph is the easiest way to run ArangoDB. Available on AWS and Google Cloud.

14
2 年前
https://static.github-zh.com/github_avatars/robinlau1981?size=40

Robust particle filter based on dynamic averaging of multiple noise models

MATLAB 9
6 年前
https://static.github-zh.com/github_avatars/Kind-Unes?size=40

This project is a multi-modal model that works with multiple models combined and accepts audio, images, and text as inputs, generating corresponding audio, images, and text outputs.

Python 9
2 年前
https://static.github-zh.com/github_avatars/Ajax0564?size=40

VyomAI: state-of-the-art NLP LLM Vision MultiModel transformers implementation into Pytorch

Python 5
20 天前
https://static.github-zh.com/github_avatars/AdritPal08?size=40

The Pictionary app uses LLaMA 3.1 to generate random drawing prompts and LLaMA 3.2 Vision to predict and judge user drawings based on these prompts. It provides an interactive and fun way to test your...

Python 4
1 年前
https://static.github-zh.com/github_avatars/Jianglin954?size=40

Papers on the topic of multimodal learning with graphs

3
1 年前
loading...
Website
Wikipedia