Adrian Murat Ozdemir
muratowski
AI & ML interests
None yet
Organizations
None yet
Dolphin R1
Diarization
GGUF
Embedding Models
Coding LLM
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation ⢠236B ⢠Updated ⢠2.89k ⢠668 -
PJMixers-Archive/LLaMa-3-CursedStock-v1.8-8B
Text Generation ⢠8B ⢠Updated ⢠6 -
legraphista/codegeex4-all-9b-IMat-GGUF
Text Generation ⢠9B ⢠Updated ⢠963 ⢠8 -
zai-org/codegeex4-all-9b
Text Generation ⢠9B ⢠Updated ⢠2.08k ⢠259
Interesting Datasets for LLM
-
HannahRoseKirk/prism-alignment
Viewer ⢠Updated ⢠77.9k ⢠1.3k ⢠92 -
Salesforce/xlam-function-calling-60k
Viewer ⢠Updated ⢠60k ⢠5.22k ⢠541 -
instruction-pretrain/ft-instruction-synthesizer-collection
Viewer ⢠Updated ⢠249k ⢠326 ⢠63 -
gretelai/synthetic_pii_finance_multilingual
Viewer ⢠Updated ⢠55.9k ⢠690 ⢠69
Dataset types for Synthetic Data Creation Methods for LLM
Voice Generation Models
Multimodal LLM
Stable Diffusion Image Model
Largest LLM Models
Text to image models
Small LLM
OCR and Image Definition
-
microsoft/Florence-2-large-ft
Image-Text-to-Text ⢠0.8B ⢠Updated ⢠52.6k ⢠373 -
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text ⢠0.7B ⢠Updated ⢠48.8k ⢠1.52k -
jinaai/jina-colbert-v2
0.6B ⢠Updated ⢠58.7k ⢠135 -
vidore/colpali-v1.2
Visual Document Retrieval ⢠Updated ⢠35.7k ⢠112
Image manipulation for e-conmmerce with Generative AI
Background remover
Classification
Face Adapter
Sheets HF integration
Non-alignment LLM
Image Segmentation Models
Spaces
Strong small LLM Models
-
internlm/internlm2_5-7b-chat-1m
Text Generation ⢠8B ⢠Updated ⢠259 ⢠72 -
AI-MO/NuminaMath-7B-TIR
Text Generation ⢠7B ⢠Updated ⢠46 ⢠348 -
SciPhi/Triplex
Text Generation ⢠4B ⢠Updated ⢠734 ⢠306 -
internlm/internlm2_5-20b-chat
Text Generation ⢠20B ⢠Updated ⢠248 ⢠91
Multi-token prediction
Image Manipulation Model
Rag Models
Transcription
Datasets
Markdown convertor
Deep Research Models
Image manipulation for e-conmmerce with Generative AI
Dolphin R1
Background remover
Diarization
Classification
GGUF
Face Adapter
Embedding Models
Sheets HF integration
Coding LLM
-
deepseek-ai/DeepSeek-Coder-V2-Instruct
Text Generation ⢠236B ⢠Updated ⢠2.89k ⢠668 -
PJMixers-Archive/LLaMa-3-CursedStock-v1.8-8B
Text Generation ⢠8B ⢠Updated ⢠6 -
legraphista/codegeex4-all-9b-IMat-GGUF
Text Generation ⢠9B ⢠Updated ⢠963 ⢠8 -
zai-org/codegeex4-all-9b
Text Generation ⢠9B ⢠Updated ⢠2.08k ⢠259
Non-alignment LLM
Interesting Datasets for LLM
-
HannahRoseKirk/prism-alignment
Viewer ⢠Updated ⢠77.9k ⢠1.3k ⢠92 -
Salesforce/xlam-function-calling-60k
Viewer ⢠Updated ⢠60k ⢠5.22k ⢠541 -
instruction-pretrain/ft-instruction-synthesizer-collection
Viewer ⢠Updated ⢠249k ⢠326 ⢠63 -
gretelai/synthetic_pii_finance_multilingual
Viewer ⢠Updated ⢠55.9k ⢠690 ⢠69
Image Segmentation Models
Dataset types for Synthetic Data Creation Methods for LLM
Spaces
Voice Generation Models
Strong small LLM Models
-
internlm/internlm2_5-7b-chat-1m
Text Generation ⢠8B ⢠Updated ⢠259 ⢠72 -
AI-MO/NuminaMath-7B-TIR
Text Generation ⢠7B ⢠Updated ⢠46 ⢠348 -
SciPhi/Triplex
Text Generation ⢠4B ⢠Updated ⢠734 ⢠306 -
internlm/internlm2_5-20b-chat
Text Generation ⢠20B ⢠Updated ⢠248 ⢠91
Multimodal LLM
Multi-token prediction
Stable Diffusion Image Model
Image Manipulation Model
Largest LLM Models
Rag Models
Text to image models
Transcription
Small LLM
Datasets
OCR and Image Definition
-
microsoft/Florence-2-large-ft
Image-Text-to-Text ⢠0.8B ⢠Updated ⢠52.6k ⢠373 -
stepfun-ai/GOT-OCR2_0
Image-Text-to-Text ⢠0.7B ⢠Updated ⢠48.8k ⢠1.52k -
jinaai/jina-colbert-v2
0.6B ⢠Updated ⢠58.7k ⢠135 -
vidore/colpali-v1.2
Visual Document Retrieval ⢠Updated ⢠35.7k ⢠112
Markdown convertor