Kevin16
			's Collections
			 
		
			
		LLM Paperlist
		
	updated
			
 
				
				
	
	
	
			
			Mixture-of-Agents Enhances Large Language Model Capabilities
		
			Paper
			
•
			2406.04692
			
•
			Published
				
			•
				
				59
			
 
	
	 
	
	
	
			
			CRAG -- Comprehensive RAG Benchmark
		
			Paper
			
•
			2406.04744
			
•
			Published
				
			•
				
				48
			
 
	
	 
	
	
	
			
			Boosting Large-scale Parallel Training Efficiency with C4: A
  Communication-Driven Approach
		
			Paper
			
•
			2406.04594
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			Buffer of Thoughts: Thought-Augmented Reasoning with Large Language
  Models
		
			Paper
			
•
			2406.04271
			
•
			Published
				
			•
				
				30
			
 
	
	 
	
	
	
			
			4-bit Shampoo for Memory-Efficient Network Training
		
			Paper
			
•
			2405.18144
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			Self-Exploring Language Models: Active Preference Elicitation for Online
  Alignment
		
			Paper
			
•
			2405.19332
			
•
			Published
				
			•
				
				22
			
 
	
	 
	
	
	
		
			Paper
			
•
			2405.18407
			
•
			Published
				
			•
				
				48
			
 
	
	 
	
	
	
			
			2BP: 2-Stage Backpropagation
		
			Paper
			
•
			2405.18047
			
•
			Published
				
			•
				
				26
			
 
	
	 
	
	
	
			
			Yuan 2.0-M32: Mixture of Experts with Attention Router
		
			Paper
			
•
			2405.17976
			
•
			Published
				
			•
				
				21
			
 
	
	 
	
	
	
			
			LLaMA-NAS: Efficient Neural Architecture Search for Large Language
  Models
		
			Paper
			
•
			2405.18377
			
•
			Published
				
			•
				
				20
			
 
	
	 
	
	
	
			
			LongRAG: Enhancing Retrieval-Augmented Generation with Long-context LLMs
		
			Paper
			
•
			2406.15319
			
•
			Published
				
			•
				
				64
			
 
	
	 
	
	
	
			
			ColPali: Efficient Document Retrieval with Vision Language Models
		
			Paper
			
•
			2407.01449
			
•
			Published
				
			•
				
				50
			
 
	
	 
	
	
	
			
			SeaKR: Self-aware Knowledge Retrieval for Adaptive Retrieval Augmented
  Generation
		
			Paper
			
•
			2406.19215
			
•
			Published
				
			•
				
				31
			
 
	
	 
	
	
	
			
			Visual Haystacks: Answering Harder Questions About Sets of Images
		
			Paper
			
•
			2407.13766
			
•
			Published
				
			•
				
				2