hello

Hello there!

I’m Malek BEN YOUSSEF

- Data Science Junior
- MLOps Enthusiast

ABOUT ME

I'm a Data Science student at INSAT INSAT Logo university, specializing in AI and passionate about its quick progress.

šŸ“œ Google Cloud Professional ML Engineer Google Cloud Professional ML Engineer Certified.
šŸ“Š Transforming data into insights .
ā˜ļø Experienced in Google Cloud Platform (GCP).

Walter Patterson

WHAT I OFFER

As a data scientist, I work on building data pipelines to collect, clean, and move data.
I create smooth data flows that support machine learning and analytics using cloud solutions like GCP.

AI Modeling
ML Model Development
nlp
Natural language processing
gen-ai
Generative AI and Prompt Engineering
Data Visualization and Reporting
Data Visualization and Reporting
Cloud-Based AI Solutions
Cloud-Based AI Solutions

SKILLS

With a knack for quick learning, I focus on mastering many skills and technologies needed for Data Science.

Apache Airflow
Apache Airflow

EXPERIENCE

I have over 3 years of experience working on many projects and technologies.
Here is a timeline of my key experiences.

Download CV

Data Engineer Intern
@Crewmeister
Feb 2025 – Present
Munich, Germany
  • Built automated ETL pipelines to ingest and process data from multiple SaaS APIs into BigQuery.
  • Replaced Airflow pipelines and standard views with BigQuery materialized views, reducing query costs by over 25%.
  • Implemented data quality checks and anomaly detection using BigQuery + Dataplex, resulting in 98% data accuracy.
  • Optimized complex SQL queries, cutting query size and reducing BigQuery costs significantly.
Machine Learning Intern
@Premier Cloud Inc - Remote
July 2024 - Present
Victoria, British Columbia, Canada
  • Built an NL2SQL system that converts prompts from sales managers into SQL queries and generates natural language responses about customers billing information.
  • Worked with Gemini LLM and achieved approximately 95% output accuracy.
  • Performed SQL query optimization, improving information retrieval speed and enhanced LLM output accuracy through well-structured prompt engineering.
  • Deployed the application using Google Cloud Run and managed a CI/CD pipeline with Google Cloud Build
Machine Learning Engineer (Part-Time)
@Silver Brain AI AG
Nov 2023 - Jan 2023
Geneva, Switzerland - Remote
  • Developed a chatbot using the pre-trained Llama 2 model with 7 billion parameters to deliver detailed information about the company's services to clients.
  • Employed Retrieval-Augmented Generation (RAG) techniques to optimize LLM text generation, resulting in an improvement of approximately 30% in output quality.
Full Stack Developer Intern
@Safran
Jul 2023 - Sep 2023
Tunis, Tunisia
  • Created a dashboard for 100+ users, offering quick access to the company's equipment status.
  • Integrated data visualization and real-time updates, achieving a 25% improvement in operational efficiency.
Data Science Intern
@University of Moncton
Jul 2022 - Aug 2022
New Brunswick, Canada - Remote
  • Implemented a solution for colon disease detection that reduces 72% of false positives.
  • Enhanced diagnostic precision in colon disease detection through advanced data augmentation and EfficientNet model optimization techniques.
PROJECTS
June 2024
Autonomous Robot Navigation using Deep Reinforcement Learning

Personal Project

  • Developed an autonomous robot navigation system employing the TD3 algorithm, resulting in a significant 85% improvement in navigation accuracy.
  • Implemented deep reinforcement learning techniques to enable obstacle avoidance, achieving a 92% success rate in various test environments.
December 2023
Soccer Chatbot

Personal Project

  • Built an information retrieval system about soccer based on 1000+ PDF file pages.
  • Applied Llama 2 Model and RAG techniques, achieving an exceptional 95% accuracy in information retrieval.
  • Utilized the Streamlit library to develop and showcase an interactive web app