1 1 30

Michael Benayoun

michaelbenayoun

AI & ML interests

None yet

Recent Activity

updated a model 5 days ago

michaelbenayoun/llama-2-tiny-4kv-heads-16layers-random

liked a Space 2 months ago

nanotron/ultrascale-playbook

liked a model 3 months ago

deepseek-ai/DeepSeek-R1

View all activity

Organizations

michaelbenayoun's activity

updated a model 5 days ago

michaelbenayoun/llama-2-tiny-4kv-heads-16layers-random

Text Generation • Updated 5 days ago • 4.28k

liked a Space 2 months ago

2.53k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 3 months ago

deepseek-ai/DeepSeek-R1

Text Generation • Updated Mar 27 • 1.72M • • 12k

updated 2 models 7 months ago

michaelbenayoun/llama-2-tiny-4kv-heads-4layers-random

Text Generation • Updated Oct 14, 2024 • 4.66k

michaelbenayoun/t5-tiny-random

Text2Text Generation • Updated Oct 10, 2024 • 1.24k

liked a model 7 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 5.67M • • 3.9k

liked a Space 9 months ago

653

Whisper Large V3

🤫

Transcribe audio and YouTube videos to text

liked 3 models 9 months ago

liked a model 10 months ago

mistralai/Mistral-7B-Instruct-v0.3

Text Generation • Updated Aug 21, 2024 • 710k • • 1.65k

liked a model 11 months ago

meta-llama/Meta-Llama-3-8B-Instruct

Text Generation • Updated Sep 27, 2024 • 1.18M • • 3.93k

updated 2 models 11 months ago

optimum/tiny_random_bert_neuron

Feature Extraction • Updated Jun 5, 2024 • 10 • 1

optimum/tiny_random_bert_neuronx

Feature Extraction • Updated Jun 5, 2024 • 12

liked a dataset 11 months ago

HuggingFaceFW/fineweb

Viewer • Updated Jan 31 • 25B • 839k • 2.12k

updated a collection 11 months ago

Distributed Training

Collection

Papers and resources related to distributed training. • 5 items • Updated Jun 3, 2024