AI & ML interests

Making funny and goofy LM's and AI's

FlameF0Xย 
posted an update 5 months ago
view post
Post
4063
I am very sad to say that the budget in creating of SnowflakeCore-G1 1b and 7b MoE models ran out and I can't pre-train them anymore.
ยท
FlameF0Xย 
posted an update 5 months ago
view post
Post
508
the training for SnowflakeCore-G1-1B and 7B would be retaken because now I implemented DeepSpeed and management to use two gpus.
FlameF0Xย 
posted an update 5 months ago
view post
Post
271
The development of SnowflakeCore-G1-7B-MoE it getting delay. In the mean time I am working on SnowflakeCore-G1-1B-MoE witch would be a pre-train chatbot.
  • 1 reply
ยท
FlameF0Xย 
posted an update 5 months ago
view post
Post
2954
The development of SnowflakeCore-G1-7B-MoE. I can't say when it would be publish yet because it's big and it requires a lot of computational power.
  • 1 reply
ยท
FlameF0Xย 
posted an update 5 months ago
FlameF0Xย 
posted an update 5 months ago
view post
Post
313
Hello! Important announcement, I will rename SnowflakeCore-G1-Medium to SnowflakeCore-G1-Tiny2 because it's going to have the same parameters as the Tiny version, but this one is trained on more data.
  • 1 reply
ยท
FlameF0Xย 
posted an update 6 months ago
view post
Post
745
Currently working on SnowflakeCore-G1-Medium. [Updated loss cruve]
  • 3 replies
ยท
FlameF0Xย 
posted an update 6 months ago
FlameF0Xย 
posted an update 6 months ago