Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
2.6
TFLOPS
1
John
IlikeLLM
Follow
0 followers
·
1 following
AI & ML interests
llm
Recent Activity
new
activity
about 1 month ago
unsloth/Qwen3-VL-8B-Thinking-FP8:
Why is the pad token of all QWEN VL models in Unsloth "<|vision_pad|>", while QWEN officially uses "pad_token": "<|endoftext|>"
updated
a collection
about 1 year ago
moe
updated
a collection
about 1 year ago
moe
View all activity
Organizations
None yet
IlikeLLM
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
New activity in
unsloth/Qwen3-VL-8B-Thinking-FP8
about 1 month ago
Why is the pad token of all QWEN VL models in Unsloth "<|vision_pad|>", while QWEN officially uses "pad_token": "<|endoftext|>"
1
#1 opened about 1 month ago by
IlikeLLM
updated
a collection
about 1 year ago
moe
Collection
1 item
•
Updated
Aug 25, 2024