Models
Datasets
Spaces
Docs
Enterprise
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2408.11878

about 13 hours ago

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 189
COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Paper • 2401.00849 • Published Jan 1, 2024 • 17
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Paper • 2311.05437 • Published Nov 9, 2023 • 51
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing

Paper • 2311.00571 • Published Nov 1, 2023 • 43

GenAI-based Time Series

Leveraging generative models like transformers and GANs for advanced time series prediction and analysis.

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Paper • 2409.16040 • Published Sep 24, 2024 • 16
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts

Paper • 2410.10469 • Published Oct 14, 2024 • 1
Unified Training of Universal Time Series Forecasting Transformers

Paper • 2402.02592 • Published Feb 4, 2024 • 8
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published Sep 3, 2024 • 34
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17, 2024 • 27
Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126
Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17, 2024 • 22

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63

LLM + Datasets : Finance

MMMU/MMMU

Viewer • Updated Sep 19, 2024 • 11.6k • 52.2k • 293
takala/financial_phrasebank

Updated Jan 18, 2024 • 4.72k • 240
zeroshot/twitter-financial-news-sentiment

Viewer • Updated Feb 23, 2024 • 11.9k • 4.54k • 154
yixuantt/FinEntity

Viewer • Updated Jan 24, 2024 • 979 • 83 • 4

Runtime error

2.77k

XTTS

🐸

2.77k

Generate speech from text using a reference voice
deadman44/Flux_Photoreal_LoRA

Text-to-Image • 12B • Updated Nov 25, 2024 • 242 • 18
stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4 • 38k • 1.52k
Running

218

Kokoro Text-to-Speech

🗣

218

High-quality speech synthesis powered by Kokoro TTS

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63
The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20, 2024 • 23

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 57
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17, 2024 • 52
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 44
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63

TheFinAI/FinLLaVA

Image-Text-to-Text • 8B • Updated Aug 28, 2024 • 93 • 21
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63

about 13 hours ago

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 189
COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

Paper • 2401.00849 • Published Jan 1, 2024 • 17
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

Paper • 2311.05437 • Published Nov 9, 2023 • 51
LLaVA-Interactive: An All-in-One Demo for Image Chat, Segmentation, Generation and Editing

Paper • 2311.00571 • Published Nov 1, 2023 • 43

Runtime error

2.77k

XTTS

🐸

2.77k

Generate speech from text using a reference voice
deadman44/Flux_Photoreal_LoRA

Text-to-Image • 12B • Updated Nov 25, 2024 • 242 • 18
stepfun-ai/GOT-OCR2_0

Image-Text-to-Text • 0.7B • Updated Feb 4 • 38k • 1.52k
Running

218

Kokoro Text-to-Speech

🗣

218

High-quality speech synthesis powered by Kokoro TTS

GenAI-based Time Series

Leveraging generative models like transformers and GANs for advanced time series prediction and analysis.

Time-MoE: Billion-Scale Time Series Foundation Models with Mixture of Experts

Paper • 2409.16040 • Published Sep 24, 2024 • 16
Moirai-MoE: Empowering Time Series Foundation Models with Sparse Mixture of Experts

Paper • 2410.10469 • Published Oct 14, 2024 • 1
Unified Training of Universal Time Series Forecasting Transformers

Paper • 2402.02592 • Published Feb 4, 2024 • 8
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published Sep 3, 2024 • 34
Phidias: A Generative Model for Creating 3D Content from Text, Image, and 3D Conditions with Reference-Augmented Diffusion

Paper • 2409.11406 • Published Sep 17, 2024 • 27
Diffusion Models Are Real-Time Game Engines

Paper • 2408.14837 • Published Aug 27, 2024 • 126
Segment Anything with Multiple Modalities

Paper • 2408.09085 • Published Aug 17, 2024 • 22

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63
The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20, 2024 • 23

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63

LLM Pruning and Distillation in Practice: The Minitron Approach

Paper • 2408.11796 • Published Aug 21, 2024 • 57
TableBench: A Comprehensive and Complex Benchmark for Table Question Answering

Paper • 2408.09174 • Published Aug 17, 2024 • 52
To Code, or Not To Code? Exploring Impact of Code in Pre-training

Paper • 2408.10914 • Published Aug 20, 2024 • 44
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63

LLM + Datasets : Finance

MMMU/MMMU

Viewer • Updated Sep 19, 2024 • 11.6k • 52.2k • 293
takala/financial_phrasebank

Updated Jan 18, 2024 • 4.72k • 240
zeroshot/twitter-financial-news-sentiment

Viewer • Updated Feb 23, 2024 • 11.9k • 4.54k • 154
yixuantt/FinEntity

Viewer • Updated Jan 24, 2024 • 979 • 83 • 4

TheFinAI/FinLLaVA

Image-Text-to-Text • 8B • Updated Aug 28, 2024 • 93 • 21
Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 63

Previous
1
2
Next

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs