mathematical-reasoning · GitHub Topics

#大语言模型#ToRA is a series of Tool-integrated Reasoning LLM Agents designed to solve challenging mathematical reasoning problems by interacting with tools [ICLR'24].

autonomous-agents language-model 大语言模型 mathematical-reasoning tool-learning

Python 1.08 k

1 年前

lupantech / dl4math

#计算机科学#Resources of deep learning for mathematical reasoning (DL4MATH).

深度学习机器学习 mathematical-reasoning natural-language-procressing papers

359

2 年前

HKUNLP / diffusion-of-thoughts

#自然语言处理#[NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

diffusion-models 机器学习 mathematical-reasoning 自然语言处理 non-autoregressive PyTorch text-generation

Python 172

5 个月前

CSfufu / Revisual-R1

🚀ReVisual-R1 is a 7B open-source multimodal language model that follows a three-stage curriculum—cold-start pre-training, multimodal reinforcement learning, and text-only reinforcement learning—to ac...

mathematical-reasoning reinforcement-learning

Python 171

23 天前

akjindal53244 / Arithmo

#大语言模型#Small and Efficient Mathematical Reasoning LLMs

large-language-models 大语言模型 mathematical-reasoning mistral-7b

Python 71

2 年前

OSU-NLP-Group / llm-planning-eval

[ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"

large-language-models mathematical-reasoning planning text-to-sql tree-search

Python 54

1 年前

mukhal / GRACE

#大语言模型#[EMNLP '23] Discriminator-Guided Chain-of-Thought Reasoning

chain-of-thought decoding language-model reasoning text-generation 大语言模型 mathematical-reasoning

Python 48

10 个月前

Alsace08 / OOD-Math-Reasoning

[NeurIPS 2024] Code and Data Repo for Paper "Embedding Trajectory for Out-of-Distribution Detection in Mathematical Reasoning"

mathematical-reasoning out-of-distribution-detection

Python 27

1 年前

QwenLM / PolyMath

Evaluation Code Repo for Paper "PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts"

large-language-models mathematical-reasoning multilingual qwen3

Python 26

2 个月前

conceptmath / conceptmath

#大语言模型#[ACL 2024 Findings] The official repo for "ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language Models".

benchmark 大语言模型 mathematical-reasoning

Python 24

1 年前

alexanderknop / I2DM

The lecture notes for my discrete mathematics classes.

lecture-notes mathematical-reasoning graph-theory game-theory

TeX 18

2 年前

adeelahmad / mlx-grpo

#大语言模型#🧠 Train your own DeepSeek-R1 style reasoning model on Mac! First MLX implementation of GRPO - the breakthrough technique behind R1's o1-matching performance. Build mathematical reasoning AI without e...

人工智能 grpo 大语言模型 MLX thinking apple-silicon chain-of-thought llama mathematical-reasoning rlhf deepseek-r1

Python 16

1 个月前

sparkle-reasoning / sparkle

#计算机科学#Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

large-language-models mathematical-reasoning reinforcement-learning scaling grpo 机器学习 qwen rlhf

Python 13

23 天前

RamonKaspar / MathPrompter

MathPrompter Implementation: This repository hosts an implementation based on the 'MathPrompter: Mathematical Reasoning Using Large Language Models' paper by Microsoft Research. The code replicates th...

large-language-models mathematical-reasoning

Python 13

4 个月前

JunyiYe / CreativeMath

[AAAI 2025] Assessing the Creativity of LLMs in Proposing Novel Solutions to Mathematical Problems

creativity large-language-models mathematical-reasoning benchmarking

Jupyter Notebook 11

3 个月前

Nativeatom / FRoG

#自然语言处理#Fuzzy reasoning of Generalized Quantifiers (EMNLP 2024)

自然语言处理 mathematical-reasoning reasoning

Python 8

7 个月前

SuperBruceJia / GSM8K-Consistency

GSM8K-Consistency is a benchmark database for analyzing the consistency of Arithmetic Reasoning on GSM8K.

mathematical-reasoning foundation-models large-language-models reasoning prompt prompt-engineering prompt-toolkit

2 年前

ahmedmhussein111 / mlx-grpo

#大语言模型#MLX-GRPO allows you to train your own DeepSeek-R1 models directly on your Mac. This implementation simplifies the process of building advanced reasoning AI, making it accessible for developers. 🐙🌟

人工智能 apple-silicon chain-of-thought deepseek-r1 grpo llama 大语言模型 mathematical-reasoning MLX rlhf thinking

Python 1

1 个月前

RamonKaspar / Math-Capabilities-LLM

#大语言模型#We implement and benchmark various prompting techniques for LLMs (i.e. PAL, CoT, PoT, etc.) on a specialized math reasoning dataset (on elementary school grade).

chain-of-thought 大语言模型 mathematical-reasoning sympy

Python 1

1 年前

RamonKaspar / MathDataset-ElementarySchool

#大语言模型#This dataset aggregates carefully selected elementary-level math problems from various existing resources, providing an optimal mix for testing and enhancing math-solving chatbots for young learners.

dataset 大语言模型 mathematical-reasoning

Python 1

1 个月前