matlok
			's Collections
			 
		
			
		Papers - Math - Reasoning
		
	updated
			
 
				
				
	
	
	
			
			Advancing LLM Reasoning Generalists with Preference Trees
		
			Paper
			
•
			2404.02078
			
•
			Published
				
			•
				
				46
			
 
	
	 
	
	
	
			
			ChatGLM-Math: Improving Math Problem-Solving in Large Language Models
  with a Self-Critique Pipeline
		
			Paper
			
•
			2404.02893
			
•
			Published
				
			•
				
				22
			
 
	
	 
	
	
	
			
			MetaMath: Bootstrap Your Own Mathematical Questions for Large Language
  Models
		
			Paper
			
•
			2309.12284
			
•
			Published
				
			•
				
				18
			
 
	
	 
	
	
	
			
			Premise Order Matters in Reasoning with Large Language Models
		
			Paper
			
•
			2402.08939
			
•
			Published
				
			•
				
				28
			
 
	
	 
	
	
	
			
			Improve Mathematical Reasoning in Language Models by Automated Process
  Supervision
		
			Paper
			
•
			2406.06592
			
•
			Published
				
			•
				
				29
			
 
	
	 
	
	
	
			
			We-Math: Does Your Large Multimodal Model Achieve Human-like
  Mathematical Reasoning?
		
			Paper
			
•
			2407.01284
			
•
			Published
				
			•
				
				81
			
 
	
	 
	
	
	
			
			ToRA: A Tool-Integrated Reasoning Agent for Mathematical Problem Solving
		
			Paper
			
•
			2309.17452
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large
  Language Models -- The Story Goes On
		
			Paper
			
•
			2407.08348
			
•
			Published
				
			•
				
				52
			
 
	
	 
	
	
	
			
			DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical
  Reasoning Robustness of Vision Language Models
		
			Paper
			
•
			2411.00836
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			GSM-Symbolic: Understanding the Limitations of Mathematical Reasoning in
  Large Language Models
		
			Paper
			
•
			2410.05229
			
•
			Published
				
			•
				
				22
			
 
	
	 
	
	
	
			
			Physics of Language Models: Part 2.1, Grade-School Math and the Hidden
  Reasoning Process
		
			Paper
			
•
			2407.20311
			
•
			Published
				
			•
				
				5