-
ControlLLM: Augment Language Models with Tools by Searching on Graphs
Paper • 2310.17796 • Published • 18 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
upstage/SOLAR-10.7B-Instruct-v1.0
Text Generation • 11B • Updated • 31.1k • 641 -
openchat/openchat-3.5-1210
Text Generation • 7B • Updated • 607 • 279
Collections
Discover the best community collections!
Collections including paper arxiv:2401.00908
-
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Paper • 2312.04474 • Published • 33 -
Training Chain-of-Thought via Latent-Variable Inference
Paper • 2312.02179 • Published • 11 -
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Paper • 2312.01552 • Published • 32 -
AppAgent: Multimodal Agents as Smartphone Users
Paper • 2312.13771 • Published • 54
-
FMViT: A multiple-frequency mixing Vision Transformer
Paper • 2311.05707 • Published • 9 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 190 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper • 2405.00732 • Published • 121 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90
-
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Paper • 2312.09390 • Published • 33 -
OneLLM: One Framework to Align All Modalities with Language
Paper • 2312.03700 • Published • 24 -
Generative Multimodal Models are In-Context Learners
Paper • 2312.13286 • Published • 37 -
The LLM Surgeon
Paper • 2312.17244 • Published • 9
-
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
Paper • 2312.08578 • Published • 20 -
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks
Paper • 2312.08583 • Published • 11 -
Vision-Language Models as a Source of Rewards
Paper • 2312.09187 • Published • 14 -
StemGen: A music generation model that listens
Paper • 2312.08723 • Published • 49
-
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Paper • 2311.07590 • Published • 17 -
Exponentially Faster Language Modelling
Paper • 2311.10770 • Published • 119 -
AppAgent: Multimodal Agents as Smartphone Users
Paper • 2312.13771 • Published • 54 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 190
-
meta-llama/Llama-2-7b-hf
Text Generation • 7B • Updated • 542k • 2.21k -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 190 -
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
Paper • 2401.04398 • Published • 25 -
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models
Paper • 2402.01118 • Published • 32
-
UI Layout Generation with LLMs Guided by UI Grammar
Paper • 2310.15455 • Published • 3 -
You Only Look at Screens: Multimodal Chain-of-Action Agents
Paper • 2309.11436 • Published • 1 -
Never-ending Learning of User Interfaces
Paper • 2308.08726 • Published • 2 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 66
-
ControlLLM: Augment Language Models with Tools by Searching on Graphs
Paper • 2310.17796 • Published • 18 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 78 -
upstage/SOLAR-10.7B-Instruct-v1.0
Text Generation • 11B • Updated • 31.1k • 641 -
openchat/openchat-3.5-1210
Text Generation • 7B • Updated • 607 • 279
-
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Paper • 2312.09390 • Published • 33 -
OneLLM: One Framework to Align All Modalities with Language
Paper • 2312.03700 • Published • 24 -
Generative Multimodal Models are In-Context Learners
Paper • 2312.13286 • Published • 37 -
The LLM Surgeon
Paper • 2312.17244 • Published • 9
-
A Picture is Worth More Than 77 Text Tokens: Evaluating CLIP-Style Models on Dense Captions
Paper • 2312.08578 • Published • 20 -
ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks
Paper • 2312.08583 • Published • 11 -
Vision-Language Models as a Source of Rewards
Paper • 2312.09187 • Published • 14 -
StemGen: A music generation model that listens
Paper • 2312.08723 • Published • 49
-
Chain of Code: Reasoning with a Language Model-Augmented Code Emulator
Paper • 2312.04474 • Published • 33 -
Training Chain-of-Thought via Latent-Variable Inference
Paper • 2312.02179 • Published • 11 -
The Unlocking Spell on Base LLMs: Rethinking Alignment via In-Context Learning
Paper • 2312.01552 • Published • 32 -
AppAgent: Multimodal Agents as Smartphone Users
Paper • 2312.13771 • Published • 54
-
Technical Report: Large Language Models can Strategically Deceive their Users when Put Under Pressure
Paper • 2311.07590 • Published • 17 -
Exponentially Faster Language Modelling
Paper • 2311.10770 • Published • 119 -
AppAgent: Multimodal Agents as Smartphone Users
Paper • 2312.13771 • Published • 54 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 190
-
FMViT: A multiple-frequency mixing Vision Transformer
Paper • 2311.05707 • Published • 9 -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 190 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper • 2405.00732 • Published • 121 -
An Introduction to Vision-Language Modeling
Paper • 2405.17247 • Published • 90
-
meta-llama/Llama-2-7b-hf
Text Generation • 7B • Updated • 542k • 2.21k -
DocLLM: A layout-aware generative language model for multimodal document understanding
Paper • 2401.00908 • Published • 190 -
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table Understanding
Paper • 2401.04398 • Published • 25 -
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models
Paper • 2402.01118 • Published • 32
-
UI Layout Generation with LLMs Guided by UI Grammar
Paper • 2310.15455 • Published • 3 -
You Only Look at Screens: Multimodal Chain-of-Action Agents
Paper • 2309.11436 • Published • 1 -
Never-ending Learning of User Interfaces
Paper • 2308.08726 • Published • 2 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 66