Jonas Geiping

JonasGeiping

https://jonasgeiping.github.io/

AI & ML interests

Machine Learning Safety, Security and Privacy; Optimization in Deep Learning; Mathematical Optimization: Federated Learning

Recent Activity

new activity 15 days ago

tomg-group-umd/huginn-dataset:Improve Huginn Dataset card: Add paper/code links, sample usage, and update formatting

upvoted a paper 18 days ago

Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models

commented on a paper 18 days ago

Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models

View all activity

Organizations

New activity in tomg-group-umd/huginn-dataset 15 days ago

Improve Huginn Dataset card: Add paper/code links, sample usage, and update formatting

#2 opened 17 days ago by

nielsr

commented a paper 18 days ago

Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models

Paper • 2510.14961 • Published 18 days ago • 6 •

commented a paper 27 days ago

Training Dynamics Impact Post-Training Quantization Robustness

Paper • 2510.06213 • Published 27 days ago • 3 •

commented a paper about 1 month ago

Strategic Dishonesty Can Undermine AI Safety Evaluations of Frontier LLM

Paper • 2509.18058 • Published Sep 22 • 12 •

New activity in tomg-group-umd/huginn-0125 4 months ago

Supervised fine-tuning and dpo implementation

#11 opened 7 months ago by

Vitabile

Update paper reference and code link, and citation

#13 opened 4 months ago by

nielsr

New activity in tomg-group-umd/huginn_swa_100_10_avg_0.9_merge 4 months ago

Improve model card: Add paper and code links, update citation

#1 opened 4 months ago by

nielsr

New activity in tomg-group-umd/huginn-0125 4 months ago

Update paper reference and citation in model card

#12 opened 4 months ago by

nielsr

commented a paper 4 months ago

GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching

Paper • 2506.20480 • Published Jun 25 • 7 •

New activity in tomg-group-umd/huginn-0125 7 months ago

Issues faced in reproducing the paper's experiments

#8 opened 9 months ago by

Chensmile

New activity in tomg-group-umd/huginn-dataset 7 months ago

[bot] Conversion to Parquet

#1 opened 8 months ago by

parquet-converter

New activity in tomg-group-umd/pez-dispenser 8 months ago

Runtime Error

#7 opened 8 months ago by

Marseve

New activity in tomg-group-umd/huginn-0125 8 months ago

Fine-tuning Model

#9 opened 9 months ago by

MrWheels523

Could you describe in simple words how it really works?

🤗 1

#3 opened 9 months ago by

MarcinCF

Can we quantize the model to GGUF or GPTQ?

#10 opened 8 months ago by

MLDataScientist

Confusion in the model architecture

#4 opened 9 months ago by

Ink

New activity in tomg-group-umd/huginn-0125 9 months ago

Confusion about the description of evaluation settings

#7 opened 9 months ago by

CrazyD

Add link to code and project page

#6 opened 9 months ago by

nielsr

Parallelization support

#5 opened 9 months ago by

yigitbekir

commented a paper 9 months ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 150 •

Jonas Geiping

AI & ML interests

Recent Activity

Organizations

JonasGeiping's activity

Improve Huginn Dataset card: Add paper/code links, sample usage, and update formatting

Supervised fine-tuning and dpo implementation

Update paper reference and code link, and citation

Improve model card: Add paper and code links, update citation

Update paper reference and citation in model card

Issues faced in reproducing the paper's experiments

[bot] Conversion to Parquet

Runtime Error

Fine-tuning Model

Could you describe in simple words how it really works?

Can we quantize the model to GGUF or GPTQ?

Confusion in the model architecture

Confusion about the description of evaluation settings

Add link to code and project page

Parallelization support