2 8 25

Mert Ege

mertege

mertege

AI & ML interests

None yet

Recent Activity

updated a model 18 days ago

mertege/moda

published a model 21 days ago

mertege/qwen2.5-7b-lora-tr_v3_epoch0_5-merged

published a model 21 days ago

mertege/moda

View all activity

Organizations

updated a model 18 days ago

mertege/moda

8B • Updated 18 days ago • 125

published 2 models 21 days ago

mertege/qwen2.5-7b-lora-tr_v3_epoch0_5-merged

8B • Updated Aug 18 • 5

mertege/moda

8B • Updated 18 days ago • 125

updated a model 2 months ago

databoss/bge_reranker_v2_m3_db_v1

0.6B • Updated Sep 10 • 2

updated a model 3 months ago

mertege/checkpoint-2050-merged_linear_Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Aug 18 • 8

published a model 3 months ago

mertege/checkpoint-2050-merged_linear_Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Aug 18 • 8

updated a model 3 months ago

mertege/qwen2.5-7b-lora-tr_v3_epoch0_5-merged

8B • Updated Aug 18 • 5

upvoted 3 papers 3 months ago

Synthetic Data (Almost) from Scratch: Generalized Instruction Tuning for Language Models

Paper • 2402.13064 • Published Feb 20, 2024 • 50

LexC-Gen: Generating Data for Extremely Low-Resource Languages with Large Language Models and Bilingual Lexicons

Paper • 2402.14086 • Published Feb 21, 2024 • 12

ReasonRank: Empowering Passage Ranking with Strong Reasoning Ability

Paper • 2508.07050 • Published Aug 9 • 116

liked a Space 9 months ago

3.47k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 9 months ago

humain-ai/ALLaM-7B-Instruct-preview

Text Generation • 7B • Updated Jul 14 • 22.1k • 150

upvoted a paper 10 months ago

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 424

liked 2 models 10 months ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-32B

Text Generation • 33B • Updated Feb 24 • 1.75M • • 1.47k

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27 • 473k • • 12.8k

upvoted a paper 11 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 376

liked 2 datasets about 1 year ago

abdoelsayed/Open-ArabicaQA

Preview • Updated Mar 27, 2024 • 343 • 9

BAAI/Infinity-Instruct

Viewer • Updated Jun 17 • 21.9M • 2.99k • 680

liked a model about 1 year ago

maywell/Qwen2-7B-Multilingual-RP

Text Generation • 8B • Updated Jun 25, 2024 • 22 • • 57

liked a dataset about 1 year ago

macadeliccc/opus_samantha

Viewer • Updated Jun 21, 2024 • 3.19k • 36 • 21

Mert Ege

AI & ML interests

Recent Activity

Organizations

mertege's activity

The Ultra-Scale Playbook