Datasets used to train SmolDocling
			
	
	HuggingFaceM4
Team 
						
company
						
						
						
						AI & ML interests
None defined yet.
Recent Activity
	View all activity
	
			Organization Card
		
		
    
HuggingFaceM4 is the multimodal team at Hugging Face, working on vision-language models.
Within this organization on the Hugging Face hub, you can access the Idefics models (version 1 IDEFICS, version 2 Idefics2, version 3 Idefics3), datasets used for the training like OBELICS, WebSight, The Cauldron or Docmatix, and interactive tools to visualize the results.
Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation.
			
	
	- 
	
	
	168
IDEFICS2 Playground
🐨Chat with an AI assistant using text and images
 - 
	
	
	
				HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 6.15k • 617 - 
	
	
	
				HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text • 8B • Updated • 183 • 95 - 
	
	
	
				HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 1.86k • 28 
Datasets used to train SmolDocling
			
	
	Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation.
			
	
	- 
	
	
	168
IDEFICS2 Playground
🐨Chat with an AI assistant using text and images
 - 
	
	
	
				HuggingFaceM4/idefics2-8b
Image-Text-to-Text • 8B • Updated • 6.15k • 617 - 
	
	
	
				HuggingFaceM4/idefics2-8b-chatty
Image-Text-to-Text • 8B • Updated • 183 • 95 - 
	
	
	
				HuggingFaceM4/idefics2-8b-base
Image-Text-to-Text • 8B • Updated • 1.86k • 28 
			spaces
			16
		
			
	
	
	
	
	
						pinned
				
		Build error
		
	
					
					377
IDEFICS Playground
🐨
		Running
		
	
					
					197
FineVision: Open Data is All You Need
📝
A new open-source dataset for training VLMs
		Paused
		
	
					
					102
Idefics3
📊
Generate text based on an image and prompt
		Running
		
			on 
			
			Zero
	
					
					17
Florence 2
📉
Answer questions about images using text prompts
		Running
		
			on 
			
			Zero
	
					
					910
Screenshot to HTML
⚡
Convert screenshots to HTML code
			models
			34
		
			
	
	
	
	
	HuggingFaceM4/Idefics3-8B-Llama3
			Image-Text-to-Text
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					102k
				
	
				• 
					
					295
				
HuggingFaceM4/Florence-2-DocVQA
			Image-Text-to-Text
			• 
		
				0.8B
			• 
	
				Updated
					
				
				• 
					
					860
				
	
				• 
					
					62
				
HuggingFaceM4/idefics2-8b
			Image-Text-to-Text
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					6.15k
				
	
				• 
					
					617
				
HuggingFaceM4/idefics2-8b-base
			Image-Text-to-Text
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					1.86k
				
	
				• 
					
					28
				
HuggingFaceM4/idefics2-8b-chatty
			Image-Text-to-Text
			• 
		
				8B
			• 
	
				Updated
					
				
				• 
					
					183
				
	
				• 
					
					95
				
HuggingFaceM4/siglip-so400m-14-364-flash-attn2-navit
			Zero-Shot Image Classification
			• 
		
				0.9B
			• 
	
				Updated
					
				
				
				
	
				• 
					
					1
				
HuggingFaceM4/siglip-so400m-14-700-flash-attn2-navit
			Zero-Shot Image Classification
			• 
		
				0.9B
			• 
	
				Updated
					
				
				• 
					
					2
				
	
				• 
					
					2
				
HuggingFaceM4/siglip-so400m-14-384-flash-attn2-navit
			Zero-Shot Image Classification
			• 
		
				0.9B
			• 
	
				Updated
					
				
				• 
					
					2
				
	
				• 
					
					1
				
HuggingFaceM4/idefics2-8b-chatty-AWQ
			Image-Text-to-Text
			• 
		
				2B
			• 
	
				Updated
					
				
				• 
					
					1
				
	
				• 
					
					5
				
HuggingFaceM4/idefics2-8b-AWQ
			Image-Text-to-Text
			• 
		
				2B
			• 
	
				Updated
					
				
				• 
					
					25
				
	
				• 
					
					26
				
			datasets
			82
		
			
	
	
	
	
	HuggingFaceM4/FineVisionMax
			Viewer
			• 
	
				Updated
					
				• 
			
			24.2M
	
				• 
					
					15.9k
				
				• 
					
					14
				
HuggingFaceM4/FineVision
			Viewer
			• 
	
				Updated
					
				• 
			
			24.2M
	
				• 
					
					251k
				
				• 
					
					426
				
HuggingFaceM4/lmms-eval-embeddings
	
				Updated
					
				
	
				• 
					
					163
				
				• 
					
					1
				
HuggingFaceM4/DoclingMatix
			Viewer
			• 
	
				Updated
					
				• 
			
			1.27M
	
				• 
					
					7.54k
				
				• 
					
					46
				
HuggingFaceM4/Caltech-101
	
				Updated
					
				
	
				• 
					
					385
				
				• 
					
					3
				
HuggingFaceM4/Docmatix
			Viewer
			• 
	
				Updated
					
				• 
			
			2.55M
	
				• 
					
					34.9k
				
				• 
					
					291
				
HuggingFaceM4/the_cauldron
			Viewer
			• 
	
				Updated
					
				• 
			
			1.88M
	
				• 
					
					91.9k
				
				• 
					
					504
				
HuggingFaceM4/FairFace
			Viewer
			• 
	
				Updated
					
				• 
			
			195k
	
				• 
					
					2.2k
				
				• 
					
					23
				
HuggingFaceM4/MMBench
			Viewer
			• 
	
				Updated
					
				• 
			
			11k
	
				• 
					
					601
				
				• 
					
					4
				
HuggingFaceM4/WebSight
			Viewer
			• 
	
				Updated
					
				• 
			
			2.75M
	
				• 
					
					14.6k
				
				• 
					
					374