Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Anna Wegmann's picture
1 14

Anna Wegmann

AnnaWegmann
ruthenian8's profile picture sofom's profile picture SethTharo's profile picture
·
https://annawegmann.github.io/
  • anna_wegmann
  • AnnaWegmann
  • annawegmann.bsky.social

AI & ML interests

Including language variation in ML/NLP | Evaluation | Open-Source | Tokenizers | Diverse Pre-Training Data

Recent Activity

liked a model 6 days ago
Blablablab/multilingual-style-representation-Llama-3.2
liked a Space 6 days ago
HuggingFaceTB/smol-training-playbook
liked a model 8 days ago
simplescaling/s1-32B
View all activity

Organizations

NLP Group at Utrecht University's profile picture

authored a paper 4 months ago

Tokenization is Sensitive to Language Variation

Paper • 2502.15343 • Published Feb 21
authored 2 papers about 1 year ago

What's Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs

Paper • 2404.06670 • Published Apr 10, 2024 • 1

Same Author or Just Same Topic? Towards Content-Independent Style Representations

Paper • 2204.04907 • Published Apr 11, 2022 • 1
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs