Ali El Filali's picture

Ali El Filali PRO

alielfilali01

·

AI & ML interests

AI Psychometrician ? | NLP (mainly for Arabic) | Interests include Reinforcement Learning and Cognitive sciences among others

Recent Activity

upvoted an article about 15 hours ago

Tiny Agents: a MCP-powered agent in 50 lines of code

reacted to ImranzamanML's post with 👍 about 15 hours ago

🚀 New paper out: "Improving Arabic Multi-Label Emotion Classification using Stacked Embeddings and Hybrid Loss Function" https://huggingface.co./papers/2410.03979 In this work, we tackle some major challenges in Arabic multi-label emotion classification especially the issues of class imbalance and label correlation that often hurt model performance, particularly for minority emotions. Our approach: Stacked contextual embeddings from fine-tuned ArabicBERT, MarBERT, and AraBERT models. A meta-learning strategy that builds richer representations. A hybrid loss function combining class weighting, label correlation matrices, and contrastive learning to better handle class imbalances. 🧠 Model pipeline: stacked embeddings → meta-learner → Bi-LSTM → fully connected network → multi-label classification. 🔍 Extensive experiments show significant improvements across Precision, Recall, F1-Score, Jaccard Accuracy, and Hamming Loss. 🌟 The hybrid loss function in particular helped close the gap between majority and minority classes! We also performed ablation studies to break down each component’s contribution and the results consistently validated our design choices. This framework isn't just for Arabic it offers a generalizable path for improving multi-label emotion classification in other low-resource languages and domains. Big thanks to my co-authors: Muhammad Azeem Aslam, Wang Jun, Nisar Ahmed, Li Yanan, Hu Hongfei, Wang Shiyu, and Xin Liu! Would love to hear your thoughts on this work! 👇

View all activity

Organizations

alielfilali01's activity

upvoted an article about 15 hours ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

4 days ago

• 172

upvoted a paper 3 days ago

PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models

Paper • 2504.16074 • Published 6 days ago • 33

upvoted an article 12 days ago

Article

Introducing the Open Chain of Thought Leaderboard

Apr 23, 2024

• 33

upvoted a collection 12 days ago

DataDecide

A suite of models, data, and evals over 25 corpora, 14 sizes, and 3 seeds to measure how accurately small experiments predict rankings at large scale. • 358 items • Updated 12 days ago • 13

upvoted an article 19 days ago

Article

Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More

21 days ago

• 16

upvoted a collection 23 days ago

Llama 4

Llama 4 release • 10 items • Updated 23 days ago • 455

upvoted an article 23 days ago

Article

Welcome Llama 4 Maverick & Scout on Hugging Face!

24 days ago

• 141

upvoted a collection 28 days ago

TxGemma Release

Collection of open models to accelerate the development of therapeutics. • 5 items • Updated 25 days ago • 50

upvoted an article about 1 month ago

Article

Introducing Gradio's new Dataframe!

Mar 24

• 23

upvoted an article about 2 months ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 403

upvoted 2 collections about 2 months ago

Gemma 3 Release

24 items • Updated 10 days ago • 346

KITAB-Bench

A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding • 24 items • Updated Feb 24 • 11