matlok
			's Collections
			 
		
			
		Papers - Microsoft
		
	updated
			
 
				
				
	
	
	
			
			Can large language models explore in-context?
		
			Paper
			
•
			2403.15371
			
•
			Published
				
			•
				
				33
			
 
	
	 
	
	
	
			
			GaussianCube: Structuring Gaussian Splatting using Optimal Transport for
  3D Generative Modeling
		
			Paper
			
•
			2403.19655
			
•
			Published
				
			•
				
				19
			
 
	
	 
	
	
	
			
			WavLLM: Towards Robust and Adaptive Speech Large Language Model
		
			Paper
			
•
			2404.00656
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			Enabling Memory Safety of C Programs using LLMs
		
			Paper
			
•
			2404.01096
			
•
			Published
				
			•
				
				1
			
 
	
	 
	
	
	
			
			LLM-ABR: Designing Adaptive Bitrate Algorithms via Large Language Models
		
			Paper
			
•
			2404.01617
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			LayoutLMv3: Pre-training for Document AI with Unified Text and Image
  Masking
		
			Paper
			
•
			2204.08387
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document
  Understanding
		
			Paper
			
•
			2012.14740
			
•
			Published
				
			•
				
				2
			
 
	
	 
	
	
	
			
			LayoutLM: Pre-training of Text and Layout for Document Image
  Understanding
		
			Paper
			
•
			1912.13318
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			PIQA: Reasoning about Physical Commonsense in Natural Language
		
			Paper
			
•
			1911.11641
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			Are NLP Models really able to Solve Simple Math Word Problems?
		
			Paper
			
•
			2103.07191
			
•
			Published
				
			•
				
				1
			
 
	
	 
	
	
	
			
			Learning From Mistakes Makes LLM Better Reasoner
		
			Paper
			
•
			2310.20689
			
•
			Published
				
			•
				
				29
			
 
	
	 
	
	
	
			
			Orca: Progressive Learning from Complex Explanation Traces of GPT-4
		
			Paper
			
•
			2306.02707
			
•
			Published
				
			•
				
				47
			
 
	
	 
	
	
	
			
			TrOCR: Transformer-based Optical Character Recognition with Pre-trained
  Models
		
			Paper
			
•
			2109.10282
			
•
			Published
				
			•
				
				11
			
 
	
	 
	
	
	
			
			RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting
  for Text-to-Speech Synthesis
		
			Paper
			
•
			2404.03204
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			LVLM-Intrepret: An Interpretability Tool for Large Vision-Language
  Models
		
			Paper
			
•
			2404.03118
			
•
			Published
				
			•
				
				26
			
 
	
	 
	
	
	
			
			Direct Nash Optimization: Teaching Language Models to Self-Improve with
  General Preferences
		
			Paper
			
•
			2404.03715
			
•
			Published
				
			•
				
				62
			
 
	
	 
	
	
	
			
			Elephants Never Forget: Memorization and Learning of Tabular Data in
  Large Language Models
		
			Paper
			
•
			2404.06209
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			Visualization-of-Thought Elicits Spatial Reasoning in Large Language
  Models
		
			Paper
			
•
			2404.03622
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			Rho-1: Not All Tokens Are What You Need
		
			Paper
			
•
			2404.07965
			
•
			Published
				
			•
				
				93
			
 
	
	 
	
	
	
			
			ResearchAgent: Iterative Research Idea Generation over Scientific
  Literature with Large Language Models
		
			Paper
			
•
			2404.07738
			
•
			Published
				
			•
				
				2
			
 
	
	 
	
	
	
			
			GLIGEN: Open-Set Grounded Text-to-Image Generation
		
			Paper
			
•
			2301.07093
			
•
			Published
				
			•
				
				4
			
 
	
	 
	
	
	
			
			Grounded Language-Image Pre-training
		
			Paper
			
•
			2112.03857
			
•
			Published
				
			•
				
				3
			
 
	
	 
	
	
	
			
			Phi-3 Technical Report: A Highly Capable Language Model Locally on Your
  Phone
		
			Paper
			
•
			2404.14219
			
•
			Published
				
			•
				
				258
			
 
	
	 
	
	
	
			
			Multi-Head Mixture-of-Experts
		
			Paper
			
•
			2404.15045
			
•
			Published
				
			•
				
				60
			
 
	
	 
	
	
	
			
			Deep Residual Learning for Image Recognition
		
			Paper
			
•
			1512.03385
			
•
			Published
				
			•
				
				8
			
 
	
	 
	
	
	
			
			You Only Cache Once: Decoder-Decoder Architectures for Language Models
		
			Paper
			
•
			2405.05254
			
•
			Published
				
			•
				
				10
			
 
	
	 
	
	
	
			
			MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation
  in Videos
		
			Paper
			
•
			2406.08407
			
•
			Published
				
			•
				
				28
			
 
	
	 
	
	
	
			
			Florence-2: Advancing a Unified Representation for a Variety of Vision
  Tasks
		
			Paper
			
•
			2311.06242
			
•
			Published
				
			•
				
				95
			
 
	
	 
	
	
	
			
			DoLa: Decoding by Contrasting Layers Improves Factuality in Large
  Language Models
		
			Paper
			
•
			2309.03883
			
•
			Published
				
			•
				
				35
			
 
	
	 
	
	
	
			
			SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
		
			Paper
			
•
			2407.09025
			
•
			Published
				
			•
				
				139
			
 
	
	 
	
	
	
		
			Paper
			
•
			2410.05258
			
•
			Published
				
			•
				
				179
			
 
	
	 
	
	
	
			
			1-bit AI Infra: Part 1.1, Fast and Lossless BitNet b1.58 Inference on
  CPUs
		
			Paper
			
•
			2410.16144
			
•
			Published
				
			•
				
				5
			
 
	
	 
	
	
	
			
			Learning a SAT Solver from Single-Bit Supervision
		
			Paper
			
•
			1802.03685
			
•
			Published
				
			•
				
				1
			
 
	
	 
	
	
	
			
			Compiling C to Safe Rust, Formalized
		
			Paper
			
•
			2412.15042
			
•
			Published
				
			•
				
				1