Data Scientist always in Beta mode.

Just a guy who loves drinking coffee and doing some machine learning stuffs. My motto is: "All models are wrong, but some are useful". I hope to develop useful ones.

Data Scientist always in Beta mode.

Areas of Interest

Machine Learning

Experience with regression, classification, clustering, and dimensionality reduction. Skills in hyperparameter tuning and predictive modeling.

Deep Learning

Experience with artificial neural networks such as CNNs and vision transformers. Skills in data augmentation, regularization, and optimization.

Natural Language Processing

Experience with multiple NLP models and techniques: from frequency-based approaches to Doc2Vec, embeddings, and Large Language Models like GPT and T5.

Data Visualization

Experience in creating clear and engaging visualizations, including reports, dashboards, and interactive tools to communicate data insights.

Data Management

Experience in designing and maintaining databases, writing SQL queries, and implementing data models. Skills in data cleansing and preprocessing.

Web Development

Experience in building modern, responsive, and visually appealing web applications. Skills in front-end and back-end web development.



Recent Projects

Transformers-Based Fire Detection in Forest Environments

Computer Vision project focused on detecting smoke and fire in wild environments. The Google Vision Transformer was fine-tuned on a custom dataset.

GPT-Based Vocal Virtual Assistant

GIVA is a vocal assistant that combines speech recognition and text-to-speech with the capabilities of GPT. Prompts are engineered so that GPT provides outputs that are short and adapted to be converted to audio.

T5-Based Headline Generator

Text-2-Title is a T5-based model fine-tuned for headline generation. The model was trained on over 110K article abstracts and related titles.

Toxic Comments Classification

NLP project focused on the Semantic Representation of text. Different techniques are proposed and compared, and a final web app classifies comments on demand.

Theme Park Ride Accidents Analysis

Data Integration project focused on theme park accident analysis leveraging Semantic Web and Linked Data technologies. A web app with interactive visualizations allows the exploration of the results.

Real-Time Credit Card Transactions Analysis

Big Data analysis project focused on transactional data stream processing. A real- time dashboard presents results and insights.

Certifications

Get in Touch

Feel free to contact me using this form. You can also write me on Medium, Twitter, and LinkedIn