@nyuuzyou on Hugging Face: "🦅 SmolLM2-Eagle Collection -…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

nyuuzyou

posted an update 10 days ago

Post

3546

🦅 SmolLM2-Eagle Collection - nyuuzyou/smollm2-eagle-680263bf97f0c7e6bbe4936b

Collection of fine-tuned bilingual language models featuring:
- Models in three parameter sizes: 135M, 360M, and 1.7B based on HuggingFaceTB's SmolLM2 models
- Both standard and GGUF formats for flexible deployment in llama.cpp and Ollama
- Fine-tuned on nyuuzyou/EagleSFT dataset (536,231 Russian-English QA pairs derived from 739k+ real user queries)
- Experimental Russian language capabilities while maintaining English performance
- Limited Russian capabilities due to SFT-only approach without Russian pre-training
- Environmental impact: ~19.75 kg CO2eq

This collection provides compact models for research on bilingual language capabilities, resource-constrained environments, and educational applications. Not recommended for production use due to experimental nature and inherent limitations. Available under Apache 2.0 license.

agentlans

9 days ago

Yeah, the pretraining is important.
And SmolLM2's English tokenizer and small vocab size makes it hard to adapt to other languages (especially Chinese).
On the other hand, I trained a French chatbot on a multilingual base and it's better than expected. Also Apache 2.0 like yours.

In this post