Collections
Discover the best community collections!
Collections including paper arxiv:2312.03700
-
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Paper ⢠2312.09390 ⢠Published ⢠33 -
OneLLM: One Framework to Align All Modalities with Language
Paper ⢠2312.03700 ⢠Published ⢠24 -
Generative Multimodal Models are In-Context Learners
Paper ⢠2312.13286 ⢠Published ⢠37 -
The LLM Surgeon
Paper ⢠2312.17244 ⢠Published ⢠9
-
Random Field Augmentations for Self-Supervised Representation Learning
Paper ⢠2311.03629 ⢠Published ⢠10 -
TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models
Paper ⢠2311.04589 ⢠Published ⢠23 -
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs
Paper ⢠2311.04901 ⢠Published ⢠11 -
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models
Paper ⢠2311.06783 ⢠Published ⢠28
-
Attention Is All You Need
Paper ⢠1706.03762 ⢠Published ⢠96 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ⢠2307.08691 ⢠Published ⢠9 -
Mixtral of Experts
Paper ⢠2401.04088 ⢠Published ⢠160 -
Mistral 7B
Paper ⢠2310.06825 ⢠Published ⢠55
-
Self-Rewarding Language Models
Paper ⢠2401.10020 ⢠Published ⢠151 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper ⢠2401.08967 ⢠Published ⢠31 -
Tuning Language Models by Proxy
Paper ⢠2401.08565 ⢠Published ⢠22 -
TrustLLM: Trustworthiness in Large Language Models
Paper ⢠2401.05561 ⢠Published ⢠69
-
OneLLM: One Framework to Align All Modalities with Language
Paper ⢠2312.03700 ⢠Published ⢠24 -
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Paper ⢠2402.03162 ⢠Published ⢠19 -
Rolling Diffusion Models
Paper ⢠2402.09470 ⢠Published ⢠14 -
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Paper ⢠2402.12226 ⢠Published ⢠45
-
Trusted Source Alignment in Large Language Models
Paper ⢠2311.06697 ⢠Published ⢠12 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper ⢠2311.12908 ⢠Published ⢠50 -
SuperHF: Supervised Iterative Learning from Human Feedback
Paper ⢠2310.16763 ⢠Published ⢠1 -
Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Paper ⢠2311.15657 ⢠Published ⢠2
-
Levels of AGI for Operationalizing Progress on the Path to AGI
Paper ⢠2311.02462 ⢠Published ⢠38 -
Ultra-Long Sequence Distributed Transformer
Paper ⢠2311.02382 ⢠Published ⢠6 -
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code
Paper ⢠2311.07989 ⢠Published ⢠26 -
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper ⢠2311.09213 ⢠Published ⢠13
-
Woodpecker: Hallucination Correction for Multimodal Large Language Models
Paper ⢠2310.16045 ⢠Published ⢠17 -
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Paper ⢠2310.14566 ⢠Published ⢠27 -
SILC: Improving Vision Language Pretraining with Self-Distillation
Paper ⢠2310.13355 ⢠Published ⢠9 -
Conditional Diffusion Distillation
Paper ⢠2310.01407 ⢠Published ⢠20
-
Self-Rewarding Language Models
Paper ⢠2401.10020 ⢠Published ⢠151 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper ⢠2401.08967 ⢠Published ⢠31 -
Tuning Language Models by Proxy
Paper ⢠2401.08565 ⢠Published ⢠22 -
TrustLLM: Trustworthiness in Large Language Models
Paper ⢠2401.05561 ⢠Published ⢠69
-
Weak-to-Strong Generalization: Eliciting Strong Capabilities With Weak Supervision
Paper ⢠2312.09390 ⢠Published ⢠33 -
OneLLM: One Framework to Align All Modalities with Language
Paper ⢠2312.03700 ⢠Published ⢠24 -
Generative Multimodal Models are In-Context Learners
Paper ⢠2312.13286 ⢠Published ⢠37 -
The LLM Surgeon
Paper ⢠2312.17244 ⢠Published ⢠9
-
OneLLM: One Framework to Align All Modalities with Language
Paper ⢠2312.03700 ⢠Published ⢠24 -
Direct-a-Video: Customized Video Generation with User-Directed Camera Movement and Object Motion
Paper ⢠2402.03162 ⢠Published ⢠19 -
Rolling Diffusion Models
Paper ⢠2402.09470 ⢠Published ⢠14 -
AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling
Paper ⢠2402.12226 ⢠Published ⢠45
-
Trusted Source Alignment in Large Language Models
Paper ⢠2311.06697 ⢠Published ⢠12 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper ⢠2311.12908 ⢠Published ⢠50 -
SuperHF: Supervised Iterative Learning from Human Feedback
Paper ⢠2310.16763 ⢠Published ⢠1 -
Enhancing Diffusion Models with Text-Encoder Reinforcement Learning
Paper ⢠2311.15657 ⢠Published ⢠2
-
Random Field Augmentations for Self-Supervised Representation Learning
Paper ⢠2311.03629 ⢠Published ⢠10 -
TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models
Paper ⢠2311.04589 ⢠Published ⢠23 -
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs
Paper ⢠2311.04901 ⢠Published ⢠11 -
Q-Instruct: Improving Low-level Visual Abilities for Multi-modality Foundation Models
Paper ⢠2311.06783 ⢠Published ⢠28
-
Levels of AGI for Operationalizing Progress on the Path to AGI
Paper ⢠2311.02462 ⢠Published ⢠38 -
Ultra-Long Sequence Distributed Transformer
Paper ⢠2311.02382 ⢠Published ⢠6 -
Unifying the Perspectives of NLP and Software Engineering: A Survey on Language Models for Code
Paper ⢠2311.07989 ⢠Published ⢠26 -
GRIM: GRaph-based Interactive narrative visualization for gaMes
Paper ⢠2311.09213 ⢠Published ⢠13
-
Attention Is All You Need
Paper ⢠1706.03762 ⢠Published ⢠96 -
FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning
Paper ⢠2307.08691 ⢠Published ⢠9 -
Mixtral of Experts
Paper ⢠2401.04088 ⢠Published ⢠160 -
Mistral 7B
Paper ⢠2310.06825 ⢠Published ⢠55
-
Woodpecker: Hallucination Correction for Multimodal Large Language Models
Paper ⢠2310.16045 ⢠Published ⢠17 -
HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
Paper ⢠2310.14566 ⢠Published ⢠27 -
SILC: Improving Vision Language Pretraining with Self-Distillation
Paper ⢠2310.13355 ⢠Published ⢠9 -
Conditional Diffusion Distillation
Paper ⢠2310.01407 ⢠Published ⢠20