matlok
			's Collections
			 
		
			
		Papers - Meta
		
	updated
			
 
				
				
	
	
	
			
			LIMA: Less Is More for Alignment
		
			Paper
			
•
			2305.11206
			
•
			Published
				
			•
				
				26
			
 
	
	 
	
	
	
			
			Garment3DGen: 3D Garment Stylization and Texture Generation
		
			Paper
			
•
			2403.18816
			
•
			Published
				
			•
				
				25
			
 
	
	 
	
	
	
			
			EgoLifter: Open-world 3D Segmentation for Egocentric Perception
		
			Paper
			
•
			2403.18118
			
•
			Published
				
			•
				
				12
			
 
	
	 
	
	
	
			
			The Unreasonable Ineffectiveness of the Deeper Layers
		
			Paper
			
•
			2403.17887
			
•
			Published
				
			•
				
				82
			
 
	
	 
	
	
	
			
			Automated Unit Test Improvement using Large Language Models at Meta
		
			Paper
			
•
			2402.09171
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			High Fidelity Neural Audio Compression
		
			Paper
			
•
			2210.13438
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			RoBERTa: A Robustly Optimized BERT Pretraining Approach
		
			Paper
			
•
			1907.11692
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			PointInfinity: Resolution-Invariant Point Diffusion Models
		
			Paper
			
•
			2404.03566
			
•
			Published
				
			•
				
				16
			
 
	
	 
	
	
	
			
			Robust Gaussian Splatting
		
			Paper
			
•
			2404.04211
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			DeiT III: Revenge of the ViT
		
			Paper
			
•
			2204.07118
			
•
			Published
				
			•
				
				1
			
 
	
	 
	
	
	
			
			Megalodon: Efficient LLM Pretraining and Inference with Unlimited
  Context Length
		
			Paper
			
•
			2404.08801
			
•
			Published
				
			•
				
				66
			
 
	
	 
	
	
	
			
			TriForce: Lossless Acceleration of Long Sequence Generation with
  Hierarchical Speculative Decoding
		
			Paper
			
•
			2404.11912
			
•
			Published
				
			•
				
				17
			
 
	
	 
	
	
	
			
			Transformer Language Models without Positional Encodings Still Learn
  Positional Information
		
			Paper
			
•
			2203.16634
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			The Impact of Positional Encoding on Length Generalization in
  Transformers
		
			Paper
			
•
			2305.19466
			
•
			Published
				
			•
				
				2
			
 
	
	 
	
	
	
			
			MultiBooth: Towards Generating All Your Concepts in an Image from Text
		
			Paper
			
•
			2404.14239
			
•
			Published
				
			•
				
				9
			
 
	
	 
	
	
	
			
			MoDE: CLIP Data Experts via Clustering
		
			Paper
			
•
			2404.16030
			
•
			Published
				
			•
				
				15
			
 
	
	 
	
	
	
			
			Are Sixteen Heads Really Better than One?
		
			Paper
			
•
			1905.10650
			
•
			Published
				
			•
				
				2
			
 
	
	 
	
	
	
			
			Meta 3D AssetGen: Text-to-Mesh Generation with High-Quality Geometry,
  Texture, and PBR Materials
		
			Paper
			
•
			2407.02445
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			Branch-Solve-Merge Improves Large Language Model Evaluation and
  Generation
		
			Paper
			
•
			2310.15123
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			Distilling System 2 into System 1
		
			Paper
			
•
			2407.06023
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			SAM 2: Segment Anything in Images and Videos
		
			Paper
			
•
			2408.00714
			
•
			Published
				
			•
				
				117
			
 
	
	 
	
	
	
			
			Poincaré Embeddings for Learning Hierarchical Representations
		
			Paper
			
•
			1705.08039
			
•
			Published
				
			•
				
				1
			
 
	
	 
	
	
	
			
			Movie Gen: A Cast of Media Foundation Models
		
			Paper
			
•
			2410.13720
			
•
			Published
				
			•
				
				98
			
 
	
	 
	
	
	
			
			Augmenting Self-attention with Persistent Memory
		
			Paper
			
•
			1907.01470
			
•
			Published
				
			•
				
				1
			
 
	
	 
	
	
	
			
			Byte Latent Transformer: Patches Scale Better Than Tokens
		
			Paper
			
•
			2412.09871
			
•
			Published
				
			•
				
				108
			
 
	
	 
	
	
	
			
			FastText.zip: Compressing text classification models
		
			Paper
			
•
			1612.03651
			
•
			Published
				
			•
				
				1