Shariar Kabir

shariar076

AI & ML interests

NLP, Mech Interp, Data Science

Recent Activity

updated a collection 26 days ago

Stats and LLM

upvoted a paper 26 days ago

Agent Learning via Early Experience

upvoted a paper 26 days ago

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention

View all activity

Organizations

updated a collection 26 days ago

Stats and LLM

Collection

3 items • Updated 26 days ago

upvoted 2 papers 26 days ago

Agent Learning via Early Experience

Paper • 2510.08558 • Published Oct 9 • 260

Why Low-Precision Transformer Training Fails: An Analysis on Flash Attention

Paper • 2510.04212 • Published Oct 5 • 22

upvoted a paper about 1 month ago

Large Reasoning Models Learn Better Alignment from Flawed Thinking

Paper • 2510.00938 • Published Oct 1 • 57

upvoted an article about 2 months ago

Article

Red-Teaming Large Language Models

Feb 24, 2023

• 33

updated a collection about 2 months ago

Stats and LLM

Collection

3 items • Updated 26 days ago

authored 2 papers about 2 months ago

Beyond the Surface: Probing the Ideological Depth of Large Language Models

Paper • 2508.21448 • Published Aug 29

Testing Conviction: An Argumentative Framework for Measuring LLM Political Stability

Paper • 2504.17052 • Published Apr 23

upvoted a paper about 2 months ago

Statistical Methods in Generative AI

Paper • 2509.07054 • Published Sep 8 • 11

upvoted 2 papers 2 months ago

Visual Representation Alignment for Multimodal Large Language Models

Paper • 2509.07979 • Published Sep 9 • 83

Scaling up Multi-Turn Off-Policy RL and Multi-Agent Tree Search for LLM Step-Provers

Paper • 2509.06493 • Published Sep 8 • 11

upvoted an article 2 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4

• 253

updated a model 5 months ago

shariar076/Llama-3.1-8B-DPO-0R100L

Text Generation • 8B • Updated May 29 • 2

published a model 5 months ago

shariar076/Llama-3.1-8B-DPO-0R100L

Text Generation • 8B • Updated May 29 • 2

updated a model 5 months ago

shariar076/Llama-3.1-8B-DPO-25R75L

Text Generation • 8B • Updated May 29

published a model 5 months ago

shariar076/Llama-3.1-8B-DPO-25R75L

Text Generation • 8B • Updated May 29

updated a model 5 months ago

shariar076/Llama-3.1-8B-DPO-50R50L

Text Generation • 8B • Updated May 29

published a model 5 months ago

shariar076/Llama-3.1-8B-DPO-50R50L

Text Generation • 8B • Updated May 29

updated a model 5 months ago

shariar076/Llama-3.1-8B-DPO-75R25L

Text Generation • 8B • Updated May 29

published a model 5 months ago

shariar076/Llama-3.1-8B-DPO-75R25L

Text Generation • 8B • Updated May 29

Shariar Kabir

AI & ML interests

Recent Activity

Organizations

shariar076's activity

Red-Teaming Large Language Models

Welcome EmbeddingGemma, Google's new efficient embedding model