Collections
Discover the best community collections!
Collections including paper arxiv:2307.16789
-
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Paper • 2309.14717 • Published • 45 -
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper • 2310.09199 • Published • 29 -
Can GPT models be Financial Analysts? An Evaluation of ChatGPT and GPT-4 on mock CFA Exams
Paper • 2310.08678 • Published • 14 -
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Paper • 2310.09478 • Published • 21
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 77 -
One Wide Feedforward is All You Need
Paper • 2309.01826 • Published • 33 -
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 42 -
Shepherd: A Critic for Language Model Generation
Paper • 2308.04592 • Published • 32
-
Attention Is All You Need
Paper • 1706.03762 • Published • 99 -
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Paper • 2005.11401 • Published • 14 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 54 -
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Paper • 2205.14135 • Published • 15
-
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 247 -
Large-Scale Automatic Audiobook Creation
Paper • 2309.03926 • Published • 55 -
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Paper • 2309.04269 • Published • 33 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 88
-
Attention Is All You Need
Paper • 1706.03762 • Published • 99 -
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks
Paper • 2005.11401 • Published • 14 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 54 -
FlashAttention: Fast and Memory-Efficient Exact Attention with IO-Awareness
Paper • 2205.14135 • Published • 15
-
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
Paper • 2309.14717 • Published • 45 -
PaLI-3 Vision Language Models: Smaller, Faster, Stronger
Paper • 2310.09199 • Published • 29 -
Can GPT models be Financial Analysts? An Evaluation of ChatGPT and GPT-4 on mock CFA Exams
Paper • 2310.08678 • Published • 14 -
MiniGPT-v2: large language model as a unified interface for vision-language multi-task learning
Paper • 2310.09478 • Published • 21
-
Llama 2: Open Foundation and Fine-Tuned Chat Models
Paper • 2307.09288 • Published • 247 -
Large-Scale Automatic Audiobook Creation
Paper • 2309.03926 • Published • 55 -
From Sparse to Dense: GPT-4 Summarization with Chain of Density Prompting
Paper • 2309.04269 • Published • 33 -
Textbooks Are All You Need II: phi-1.5 technical report
Paper • 2309.05463 • Published • 88
-
Large Language Models as Optimizers
Paper • 2309.03409 • Published • 77 -
One Wide Feedforward is All You Need
Paper • 2309.01826 • Published • 33 -
Self-Alignment with Instruction Backtranslation
Paper • 2308.06259 • Published • 42 -
Shepherd: A Critic for Language Model Generation
Paper • 2308.04592 • Published • 32