Hi there, I'm Mahendra 👋

DataScientist with AI Expert
& ML/DL/NLP Generative AI Engineer

The journey from raw data to impactful solutions is what truly excites me.
A highly dedicated and results-oriented Data Scientist with a passion for
solving complex business problems with enthusiasm

View my work More about me

Ruben Kuipers



TECHNICAL SKILLS



Skill Name

Machinelearning

Skill Name

Deep learning

Skill Name

NLP

Skill Name

Generative AI

Skill Name

Python

Skill Name

Flask

Skill Name

Pandas

Skill Name

Numpy

Skill Name

Matplotlib

Skill Name

Seaborn

Skill Name

Scikit Learn

Skill Name

Tensorflow

Skill Name

Keras

Skill Name

Langchain

Skill Name

Langraph

Skill Name

Hugging Face

Skill Name

OpenAI

Skill Name

Azure Data

Skill Name

Azure AI

Skill Name

PostgreSQL

Skill Name

MongoDB

Skill Name

Pinecone

Skill Name

Chroma

Skill Name

Git

Skill Name

Github

Skill Name

DVC

Skill Name

MlfLow

Skill Name

Airflow

Skill Name

Docker

Skill Name

Evidently AI

Skill Name

Github Action

Skill Name

AWS

Skill Name

Azure

Skill Name

Fabric

Skill Name

Excel

Skill Name

Tableau

Skill Name

Linux

Skill Name

Streamlit




REAL WORLD CAPABILITIES





DATASCIENCE TECHNIQUES


  • Collecting Raw Data
  • Defining Problem statement
  • Exploratory Data Analysis
  • Feature Engineering
  • Feature selection
  • Model Building
  • Hyperparameter Tuning
  • Model Evaluation



NLP & Deep Learning Techniques


  • Text Classification
  • Sentimental Analysis
  • NLP text Preprocessing
  • Text Encoding Techniques
  • Word Embeddings
  • ANN, LSTM, RNN
  • Transformer



Machine Learning Techniques


  • Classfication
  • Regression
  • Clustering
  • Outlier Detection
  • Tree & Non Tree ML Models
  • Hyperparameter Tuning
  • Ensemble Learning - Bagging, Boosting, Stacking
  • Evaluation Metrics Performance Optimization



PRODUCT DEVELOPMENT


  • Proficient in utilizing Git and GitHub for managing source code and ensuring version control.My commitment to quality is evident in my proficiency in Python OOP to design and build robust and scalable data science solutions, automating ML, DL, Generative AI (LLM) and NLP pipelines across the entire lifecycle - from data ingestion to model monitoring. I ensure seamless integration, scalability, and deployment using CI/CD, Docker, and Cloud Services, guaranteeing reliability at every stage. My solutions are not only efficient today but also adaptable for tomorrow's challenges



Generative AI Techniques


  • RAG & Agentic RAG
  • LLM Summarizer
  • Chatbots-Q/A
  • Knowledge Graphs
  • Fine Tuning LLM
  • AI Agents & SQL Agents
  • Prompt Engineering
  • Open source & Open AI Models



PRODUCT MONITORING


  • I use Dvc for tracking the data and pipelines .By storing user-generated data in a secure database MongoDB or Pinecone (LLM), I ensure a robust foundation for model training. Leveraging Airflow, I automate continuous model training, enabling it to adapt to evolving trends and patterns in the data seamlessly. Additionally, I employ Evidently AI for meticulous model monitoring, ensuring model performance remains optimal over time



PROJECTS


There's no greater satisfaction than building end-to-end pipelines, deploying models strategically, and developing ML DL NLP Generative AI products that solve complex business problems.






EXPERIENCE



Geak Minds Nov 2024 - present


Data scientist


Gained hands-on experience in cutting-edge Generative AI techniques and cloud technologies while working as a Data Scientist at Geak Minds. Applied advanced methodologies, including Retrieval-Augmented Generation (RAG), AI Agents, SQL Agents, and LLM Summarizers, to create innovative solutions such as Chatbots for Q&A systems and fine-tuning Large Language Models (LLMs) for domain-specific tasks.

Leveraged tools like Microsoft Fabric and Azure Data Fundamentals to architect scalable and efficient data solutions. Built expertise in working with Transformer-based architectures, contributing to the development of next-generation AI applications.


Skills:   Generative AI- RAG, AI Agents & SQL Agents, LLM Summarizer, Chatbot-Q&A, Finetune LLM , Microsoft Fabric, Azure Data Fundamental, Transformer

Codesoft Oct 2023


Fullstack DataScience Internship


Developed an end-to-end customer attrition prediction application at Codesoft, including setting up a virtual environment and a collaborative GitHub repository.

Built a robust training pipeline with ordinal encoding, ADASYN, and SMOTE techniques, achieving 99.17% accuracy using the Extra Tree Classifier. Developed a user-friendly interface with Flask and HTML, and deployed the application on an Azure cloud server using Docker and DVC for version control and resource optimization.

Skills:   Got awareness of solving problem statements , Preprocessing techniques, Model Building, Model Evaluation, Deployment, DVC, Docker, Building pipelines from data ingestion to mode evaluation

Oyasis Infobyte Aug 2023


Datascience Internship


I led a team of 7, developing a PyPI-published automated MongoDB Database connector in Python venv 3.8. I managed dependencies, conducted rigorous testing, automated setup with init_setup.sh, and implemented CI/CD for smooth deployment, significantly enhancing MongoDB connectivity and developer productivity.


Skills:  Unit Testing, Integrate testing, MongoDB, Python package Development, Pipy, CI-CD




EDUCATION


Your gif


University Logo

LOVELY

PROFESSIONAL

UNIVERSITY



Bachelor of Technology


April 2020 - May 2024


Computer Science Engineering,


Specialization in Data Science