Hi, I'm

Wail EL BANI

NLP Engineer & ML Practitioner

I build intelligent systems that understand and generate human language. Passionate about transformers, information extraction, and making research reproducible.

W

About Me

A bit about who I am and what drives me.

I'm a data scientist and NLP engineer with a strong focus on building language-understanding systems. My work spans the full lifecycle — from data collection and model fine-tuning to deployment and monitoring.

I'm particularly interested in information extraction, multilingual NLP, and efficient fine-tuning of large language models. I believe good ML engineering means writing clean, reproducible code that bridges research and production.

When I'm not training models, I enjoy writing about ML topics, contributing to open-source projects, and exploring new datasets.

Based in Morocco
M.Sc. in Data Science & AI

Research interests

  • Natural Language Processing
  • Information Extraction & NER
  • Large Language Models
  • Deep Learning

Skills

Technologies and tools I work with regularly.

Languages

PythonSQLBashTypeScript

ML / DL Frameworks

PyTorchscikit-learnKeras / TensorFlowXGBoost

NLP

HuggingFace TransformersspaCyNLTKLangChainGensim

Data & Visualization

PandasNumPyMatplotlib / SeabornPlotly

Infrastructure & Tools

GitDockerMLflowFastAPIJupyter
Legend:expertadvancedintermediatebeginner

Projects

A selection of things I've built.

Multilingual NER with BERT

2024

Fine-tuned mBERT on CoNLL-2003 for Named Entity Recognition across English, French, and Arabic, achieving 91.2 F1. Includes a FastAPI inference endpoint.

NLPPyTorchTransformersHuggingFaceFastAPI

Aspect-Based Sentiment Analysis

2024

End-to-end pipeline for aspect-level sentiment classification on product reviews using a fine-tuned RoBERTa model with custom span extraction.

NLPPyTorchRoBERTaPython

Automated Text Classification Pipeline

2023

Scalable ML pipeline for multi-label document classification. Compares TF-IDF + SVM baseline against fine-tuned DistilBERT, with MLflow experiment tracking.

NLPscikit-learnTransformersMLflow

Other Projects

Time-Series Forecasting with LSTMs

2023

LSTM-based model for energy consumption forecasting. Achieves 8% MAPE on a public dataset, with a Streamlit dashboard for interactive exploration.

Deep LearningPyTorchStreamlitTime Series

RAG-based Question Answering System

2024

Retrieval-Augmented Generation system over a private document corpus using LangChain, FAISS, and a quantized Mistral-7B model for local inference.

LLMRAGLangChainFAISSPython

Experience

My professional and academic journey.

NLP / Machine Learning Intern

Company Name·City, Country

Jun 2024Dec 2024

  • Developed a multilingual NER pipeline using fine-tuned BERT models, improving entity recall by 15%.
  • Built a text classification system for automatic ticket routing, processing 10k+ daily requests.
  • Reduced model inference latency by 30% through ONNX export and quantization.
PyTorchHuggingFace TransformersFastAPIPython

M.Sc. in Data Science & Artificial Intelligence

Your University·City, Country

Sep 2023Present

  • Specialization in Natural Language Processing and Deep Learning.
  • Relevant coursework: Statistical Learning, Neural Networks, Information Retrieval, Advanced NLP.

B.Sc. in Computer Science

Your University·City, Country

Sep 2020Jun 2023

  • Graduated with honours. Final-year project on text summarization with sequence-to-sequence models.

Get in Touch

I'm open to research collaborations, NLP roles, and interesting projects.

Whether you have a question, a project idea, or just want to talk about NLP — my inbox is always open. I'll do my best to get back to you promptly.