Summary
Overview
Work History
Education
Skills
Languages
Timeline
Generic

Jean-Michel CARUGE

Arlington,MA

Summary

Results-oriented Data Scientist with over 10 years of experience across ML, AI, and Big Data. Proven track record in building AI models to protect online applications, optimize database marketing strategies, and leading R&D teams in cutting-edge nanotechnology. Recently laid off as part of a company wide reorganization. I am looking forward to new opportunities to use my ML, AI and excellent programming skills to deliver great products for customers, as an individual contributor or manager.

Overview

18
18
years of professional experience

Work History

Senior Data Scientist II

Akamai Technologies
11.2019 - Current
  • Built gradient boosted trees classifiers as web application firewalls to protect Akamai's customers from malicious web traffic. Achieved false positive rate below 0.5%. Collaborated with engineering team to (1) deploy models in production and (2) track KPIs in real time. Model stack is n-gram extraction + vector embedding + principal component analysis + GBT classifier.
  • Built a ML based, spark streaming application to detect distributed denial of service (DDoS) cyber-attacks. It sends detection signals to the Security Operations Control Center, which mitigates the DDoS attack. Detected 99.5 % of SIN flood and 85 % of DNS flood. Spark MLLib implementation (PySpark/SQL) with Azure Event Hubs streaming and Azure Storage.
  • Pioneered web session tracking (instead of signature tracking) to identify malicious agentic web traffic. Word2Vec embedding with large context window uncovers correlation between sequences of web page visits. Detected google-shopping, chatgpt-user, comet browser with more than 93.5 % accuracy.
  • Crafted prompt engineering with LLMs in MCP framework to discover/fixed vulnerabilities in cyber-defenses (red teaming).
  • Reduced third party cloud spend by $65,000 per month by (1) migrating daily ML jobs on-premises and (2) balancing small size ML models with KPIs constraints.
  • Mentored junior data scientists on advanced statistical techniques and best practices.

Data Scientist L6

Amazon Alexa
05.2017 - 04.2019
  • Trained NLP, NLU and Deep Learning models for Amazon Alexa. Added/Fixed more than 130 features for Alexa, FireTV and Echo Show working in a CI/CD DevOps environment.
  • More than 20 on-calls. Provided technical expertise during production-related severity incidents: root cause analysis, quick model fix, and deployment.
  • Proficiency with exact matched and pattern matched rules, finite state transducers, NLP, NLU, conditional random fields and automatic speech recognition models.
  • Mentored junior data scientists, fostering skills in advanced analytics and model development.
  • Automated repetitive tasks using scripting languages such as Python, Shell Scripting, saving time during the analytical process significantly.

Senior Statistical Developer / Consultant

Wellington Management
10.2015 - 04.2017
  • Built real-time trading algorithms for sovereign bond traders. Between 5-7% positive impact on P&L, which is significant since top traders can have multi-million-dollar daily profit.
  • Migrated legacy ExcelVBA trading algorithms onto modern C# trading platforms.

Data Scientist

Jobcase, Inc.
06.2014 - 05.2015
  • Use random forest and Xgboost ML models to optimize database marketing. Measured a 20 % improvement in customer acquisition, click rate and customer conversion rate.

Manager / Principal Scientist

QD Vision, Inc.
04.2012 - 05.2014

Managed a cross-functional team to develop quantum dots LEDs for displays.

Managed budget of $500,000 and 6 full time employees.

Planned and scheduled deadlines for 2 projects per month.

Senior Quantitative Analyst

Barrie & Hibbert
01.2008 - 01.2012
  • Modeled financial market risks and calibrated stochastic models for real-world and market-consistent scenarios. Used SDEs, correlation matrix to correlate stochastic shocks, and Monte Carlo to project cashflows for value at risk, conditional tale expectation projections.
  • Provide technical expertise for sales teams during the acquisition of multi-million-dollar contracts.

Education

Postdoctoral Research - Nanoscience & Nanotechnology

Massachusetts Institute of Technology (MIT)
Cambridge, MA
01.2007

PhD - Physics

Bordeaux I University (France)
Bordeaux, France
01.2001

Master - Physics

Bordeaux I University (France)
Bordeaux, France
01.1998

Skills

  • Machine Learning
  • Deep Learning
  • NLP, NLU
  • Transformers, LLMs
  • Python
  • PySpark
  • Shell Scripting
  • Azure Databricks
  • Microsoft Azure, AWS
  • CI/CD
  • Dimensionality reduction
  • Social network analysis
  • Agile framework

Languages

English
Native or Bilingual
French
Native or Bilingual

Timeline

Senior Data Scientist II

Akamai Technologies
11.2019 - Current

Data Scientist L6

Amazon Alexa
05.2017 - 04.2019

Senior Statistical Developer / Consultant

Wellington Management
10.2015 - 04.2017

Data Scientist

Jobcase, Inc.
06.2014 - 05.2015

Manager / Principal Scientist

QD Vision, Inc.
04.2012 - 05.2014

Senior Quantitative Analyst

Barrie & Hibbert
01.2008 - 01.2012

Postdoctoral Research - Nanoscience & Nanotechnology

Massachusetts Institute of Technology (MIT)

PhD - Physics

Bordeaux I University (France)

Master - Physics

Bordeaux I University (France)
Jean-Michel CARUGE