Your Bench
community
						
						
						
						AI & ML interests
None defined yet.
Recent Activity
	View all activity
	
			Organization Card
		
		YourBench is an open-source framework for generating zero-shot benchmarks from your own documents. It helps you test language models on custom domains using automated pipelines for ingestion, summarization, and question generation.
- 📚 Build benchmarks from PDFs, HTML, or text files
 - 🧠 Generate both single-hop and multi-hop questions
 - 🔍 Evaluate top models and deploy leaderboards instantly
 - 🛠️ Fully configurable via a single YAML file
 
Built with 🤗 by the OpenEvals team — GitHub
- 
	
	
	
yourbench/yourbench_reproduction_o4mini_biology
Viewer • Updated • 1.83k • 29 - 
	
	
	
yourbench/yourbench_reproduction_o4mini_business
Viewer • Updated • 829 • 28 - 
	
	
	
yourbench/yourbench_reproduction_o4mini_chemistry
Viewer • Updated • 805 • 35 - 
	
	
	
yourbench/yourbench_reproduction_o4mini_computerscience
Viewer • Updated • 1.81k • 4 
- 
	
	
	
yourbench/yourbench_reproduction_o4mini_biology
Viewer • Updated • 1.83k • 29 - 
	
	
	
yourbench/yourbench_reproduction_o4mini_business
Viewer • Updated • 829 • 28 - 
	
	
	
yourbench/yourbench_reproduction_o4mini_chemistry
Viewer • Updated • 805 • 35 - 
	
	
	
yourbench/yourbench_reproduction_o4mini_computerscience
Viewer • Updated • 1.81k • 4 
			spaces
			7
		
			
	
	
	
	
	
		Running
		
			on 
			
			CPU Upgrade
	
					
					41
YourBench
🚀
Generate custom evaluations from your data easily!
		Sleeping
		
	Essential Web Medical
🏆
Select and annotate high-quality web documents
		Running
		
	View Essentialweb Cleaned
🏃
		Sleeping
		
	Reachy Trivia
🚀
Trivia Questions For The Reachy Mini and Reachy Team!
		Runtime error
		
	Essential Web Annotation
📊
Annotating Essential Web!
		Sleeping
		
	Visualize Expert Level Filter
🔥
Browse and inspect classified documents from a dataset
			models
			0
		
			
	None public yet
			datasets
			84
		
			
	
	
	
	
	yourbench/childrens_books_questions
			Viewer
			• 
	
				Updated
					
				• 
			
			62
	
				• 
					
					13
				
				
				
yourbench/mckinsey_great_trade_global_report
			Viewer
			• 
	
				Updated
					
				• 
			
			511
	
				• 
					
					35
				
				
				
yourbench/aws_bedrock_documentation_demo
			Viewer
			• 
	
				Updated
					
				• 
			
			1.18k
	
				• 
					
					18
				
				
				
yourbench/yourbench-custom-prompts-example-gpt-4.1
			Viewer
			• 
	
				Updated
					
				• 
			
			55
	
				• 
					
					42
				
				
				
yourbench/yourbench-custom-prompts-example-oss-120b
			Viewer
			• 
	
				Updated
					
				• 
			
			3
	
				• 
					
					22
				
				
				
yourbench/yourbench-custom-prompts-example
			Viewer
			• 
	
				Updated
					
				• 
			
			52
	
				• 
					
					39
				
				
				
yourbench/yourbench-simple-example
			Viewer
			• 
	
				Updated
					
				• 
			
			46
	
				• 
					
					12
				
				
				
yourbench/mckinsey_state_of_ai_doc_understanding
			Viewer
			• 
	
				Updated
					
				• 
			
			29
	
				• 
					
					73
				
				
				
yourbench/highpass-medfilter-v2
			Viewer
			• 
	
				Updated
					
				• 
			
			465
	
				• 
					
					13
				
				
				
yourbench/highpassfilter-medical-documents-o4-mini
			Viewer
			• 
	
				Updated
					
				• 
			
			465
	
				• 
					
					13