Model training - From architecture to performance

Training is at the heart of every AI project. I design models adapted to your data and objectives, combining scientific rigor and mastery of modern tools.

Choice of architecture

Selection according to the nature of the problem: CNN, RNN, LSTM, Transformer, diffusion models.

Data pre-processing

Standardization, augmentation, split train/test to guarantee data quality and representativeness.

Model optimization

Use of AdamW, learning rate management, gradient clipping to stabilize and accelerate learning.

Distributed training

Scalability with multi-GPU, TPU, DeepSpeed, FSDP for large or complex models.

Experience follow-up

Tracking of runs, hyperparameters and metrics via Weights & Biases, MLflow for optimum reproducibility.

Model training - From architecture to performance

My expertise

AI Development

LLM & NLP

Fine-tuning & Training

Neural networks & machine learning

Retrieval-Augmented Generation (RAG)

Edge AI

Deployment & MLOps