GitHub 中文社区
回车: Github搜索    Shift+回车: Google搜索
论坛
排行榜
趋势
登录

©2025 GitHub中文社区论坛GitHub官网网站地图GitHub官方翻译

  • X iconGitHub on X
  • Facebook iconGitHub on Facebook
  • Linkedin iconGitHub on LinkedIn
  • YouTube iconGitHub on YouTube
  • Twitch iconGitHub on Twitch
  • TikTok iconGitHub on TikTok
  • GitHub markGitHub’s organization on GitHub
集合主题趋势排行榜
#

spatial-intelligence

Website
Wikipedia
https://static.github-zh.com/github_avatars/manycore-research?size=40
manycore-research / SpatialLM

SpatialLM: Training Large Language Models for Structured Indoor Modeling

scene-understandingspatial-intelligencemllmpoint-clouds
Python 3.93 k
7 天前
https://static.github-zh.com/github_avatars/InternRobotics?size=40
InternRobotics / Aether

[ICCV 2025] Aether: Geometric-Aware Unified World Modeling

embodied-aifoundation-modelsmulti-modalnavigationspatial-intelligencevideo-generationvideo-predictionworld-model
Python 464
2 个月前
https://static.github-zh.com/github_avatars/diankun-wu?size=40
diankun-wu / Spatial-MLLM

#大语言模型#Official implementation of Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence

大语言模型multimodalmultimodal-large-language-modelsspatial-intelligenceaigc
Python 338
3 个月前
https://static.github-zh.com/github_avatars/UMass-Embodied-AGI?size=40
UMass-Embodied-AGI / 3D-Mem

[CVPR 2025] Source codes for the paper "3D-Mem: 3D Scene Memory for Embodied Exploration and Reasoning"

人工智能机器视觉embodied-aispatial-intelligence
Python 172
3 个月前
https://static.github-zh.com/github_avatars/HaoyiZhu?size=40
HaoyiZhu / SPA

[ICLR 2025] SPA: 3D Spatial-Awareness Enables Effective Embodied Representation

embodied-airepresentation-learningrobot-learningspatial-intelligence
Python 165
3 个月前
https://static.github-zh.com/github_avatars/keshik6?size=40
keshik6 / HourVideo

[NeurIPS 2024] Official code for HourVideo: 1-Hour Video Language Understanding

gemini-progpt-4multimodal-large-language-modelsnavigationperceptionsummarizationreasoningevalsspatial-intelligence
Jupyter Notebook 153
2 个月前
https://static.github-zh.com/github_avatars/UMass-Embodied-AGI?size=40
UMass-Embodied-AGI / Mirage

Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens (arXiv 2025)

reasoningspatial-intelligencevlm
Python 151
1 个月前
https://static.github-zh.com/github_avatars/zju3dv?size=40
zju3dv / StarGen

[CVPR 2025] Code for "StarGen: A Spatiotemporal Autoregression Framework with Video Diffusion Model for Scalable and Controllable Scene Generation".

3d-aigcgen-aispatial-intelligence
117
8 个月前
https://static.github-zh.com/github_avatars/UMass-Embodied-AGI?size=40
UMass-Embodied-AGI / MindJourney

Source codes for the paper "MindJourney: Test-Time Scaling with World Models for Spatial Reasoning"

3Dembodied-aispatial-intelligencevision-language-modelworld-models
Python 82
1 个月前
https://static.github-zh.com/github_avatars/worldbench?size=40
worldbench / survey

#Awesome#3D and 4D World Modeling: A Survey

embodied-aivideo-generationworld-models3d-generationspatial-intelligenceAwesome Lists
42
4 天前
https://static.github-zh.com/github_avatars/SOTAMak1r?size=40
SOTAMak1r / GST

[ICLR 2025] Where Am I and What Will I See : An Auto-Regressive Model for Spatial Localization and View Prediction

generative-modelnovel-view-synthesispose-estimationspatial-intelligence
Python 39
1 个月前
https://static.github-zh.com/github_avatars/yaak-ai?size=40
yaak-ai / rbyte

#数据仓库#Multimodal datasets for spatial intelligence

人工智能PyTorchRoboticsspatial-intelligence数据集机器学习rerunpolars
Python 35
6 天前
https://static.github-zh.com/github_avatars/pi3det?size=40
pi3det / toolkit

[ICCV 2025] Perspective-Invariant 3D Object Detection

3d-object-detectionautonomous-drivingdroneRoboticsspatial-intelligence3d-scene-understandingembodied-ailidar-point-cloudmultimodal
30
1 个月前
https://static.github-zh.com/github_avatars/lidarcrafter?size=40
lidarcrafter / toolkit

This is the official implementation of "LiDARCrafter: Dynamic 4D World Modeling from LiDAR Sequences"

aigcautonomous-drivinggenerative-ailidarspatial-intelligenceworld-models3d-generationscene-understanding3d-object-detection
Python 26
1 个月前
https://static.github-zh.com/github_avatars/miladfa7?size=40
miladfa7 / SpatialLM-Gradio

#大语言模型#"Gradio" Interface for SpatialLM Model | A 3D Large Language Model for Structured Scene Understanding, Processing Point Cloud Data from Monocular Videos, RGBD Images, and LiDAR.

3d-object-detectiongradio大语言模型point-cloudsscene-understandingspatial-intelligence
Python 11
5 个月前
https://static.github-zh.com/github_avatars/jagennath-hari?size=40
jagennath-hari / SpatialFusion-LM

SpatialFusion-LM is a real-time spatial reasoning framework that combines neural depth, 3D reconstruction, and language-driven scene understanding.

机器视觉depth-estimationfoundation-modelsmllmpoint-cloudsRoboticsscene-understandingspatial-intelligencestereo-visionzero-shot-learningvision-transformervision-language-modeltransformer
Python 8
4 个月前
https://static.github-zh.com/github_avatars/HarryYancy?size=40
HarryYancy / SolidGeo

#计算机科学#SolidGeo: Measuring Multimodal Spatial Math Reasoning in Solid Geometry

large-language-modelslarge-multimodal-models机器学习数学sciencespatial-intelligencevisual-question-answering
Python 5
1 个月前
https://static.github-zh.com/github_avatars/worldbench?size=40
worldbench / toolkit

Benchmarking 3D and 4D World Models in the Real World

3d-generationaigcembodied-aispatial-intelligencevideo-generationworld-models
2
3 个月前
https://static.github-zh.com/github_avatars/nidhiyashwanth?size=40
nidhiyashwanth / SpatialLM

Trying out SpatialLM (SpatialLM: Large Language Model for Spatial Understanding). Impressed with results 💖

mllmpoint-cloudsscene-understandingspatial-intelligence
Jupyter Notebook 1
5 个月前
https://static.github-zh.com/github_avatars/Yash2378?size=40
Yash2378 / dynamic-4d-vision-reconstruction

Dynamic 4D Vision Reconstruction - Reconstructing dynamic 4D scenes (3D geometry + time) from 2D videos using NeRF and 3D Gaussian Splatting. Progresses from static 3D reconstruction to real-time dyna...

3d-gaussian-splatting机器视觉CUDAembodied-ainerfneural-radiance-fieldsneural-renderingPyTorchRoboticsspatial-intelligence
Jupyter Notebook 0
4 个月前
loading...