Hi, my name is

Akalbir

Teaching machines t

A passionate AI/ML engineer with a knack for turning data into decisions. I specialize in crafting intelligent systems that can predict the future, recognize patterns, and occasionally beat me at chess. My algorithms are so smart, they’ve started leaving me sticky notes with optimization tips.

About Me

As an Artificial Intelligence and Machine Learning engineer, I don’t just build intelligent systems—I bring them to life. With a deep passion for pushing the boundaries of what’s possible, I’ve made it my mission to create AI that not only thinks, but understands, adapts, and evolves. My journey began with a strong foundation in computer science, but it was the limitless potential of AI that truly captivated me. From developing machine learning models that can predict the future with uncanny accuracy to implementing deep learning algorithms that can recognize patterns even the human eye might miss, I’ve had the privilege of working on projects that are as exciting as they are impactful.

Currently, I’m diving deep into the world of large language models and recommendation systems, crafting AI that can not only understand and respond to human language, but also anticipate and cater to individual needs and preferences. It’s a thrilling time to be in this field, and I’m always eager to collaborate with like-minded innovators to transform groundbreaking ideas into game-changing realities.

So if you’re ready to embark on a journey to the forefront of AI and ML, I’m ready to be your Mr. Miyagi. Let’s create intelligent systems that don’t just change the way we live and work, but redefine what we believe is possible.

The Tools of My Trade:
  • Python
  • C++
  • SQL & NoSQL DBs
  • TensorFlow & PyTorch
  • Scikit-learn
  • Hugging Face
  • Ray
  • MLflow
  • Apache Spark
  • Surprise & Merlin
  • Kubeflow
  • Hadoop

Experience

Software Engineer Lead Intern – ML Focus - Workup
May 2024 - present
  • AI-Driven Multimodal Recommendation System: Conceptualized, pitched to CEO, and secured approval for a multimodal AI-driven recommendation system, subsequently leading a team of 6 to integrate computer vision, NLP, and audio processing techniques for comprehensive candidate profile analysis, resulting in a 30% reduction in candidate screening time.
  • Custom Deep Learning Architecture: Engineered a custom deep learning architecture using PyTorch, leveraging state-of-the-art techniques such as Vision Transformers (ViT) for video analysis, BERT for text processing, and WaveNet for audio feature extraction. Resulted in a 45% increase in candidate-job matching accuracy and a 40% improvement in shortlisting efficiency.
  • Implemented Efficient Large-Scale Data Processing: Utilized NVIDIA Merlin for efficient large-scale data processing and model training, reducing computation time by 60% and enabling real-time recommendations for over 1500 candidates, leading to a 50% increase in candidate processing capacity.
  • Developed Scalable Backend and Frontend Integration: Built a scalable backend using Flask and MongoDB, capable of handling 1000+ concurrent users, and integrated with the frontend via RESTful APIs and Axios, achieving a 30% reduction in server response time and a 25% increase in user satisfaction.
  • Designed Analytics Dashboard for Recruiters: Designed and implemented an analytics dashboard for recruiters, providing real-time statistics on candidate pools, match rates, and hiring trends, leading to a 25% increase in recruiter productivity and a 15% improvement in data-driven decision making.
Research Collaborator/Intern - SCAAI
Apr 2022 - Jan 2023
  • Pre-trained Deep Neural Network: Utilized self-supervised learning techniques like Non-Parametric Instance Discrimination (NPID) and contrastive learning in PyTorch to pre-train a deep neural network on a large, unlabeled retinal fundus dataset, improving feature extraction accuracy by 30% without manual annotations.
  • Transfer Learning for Retinal Disease Classification: Fine-tuned the pre-trained model on a dataset of over 6000 retinal fundus images, effectively leveraging general visual understanding to classify various retinal diseases.
  • Distributed Computing with Apache Spark: Employed Apache Spark for distributed computing and data processing, efficiently scaling training and inference processes to achieve 86% accuracy in diagnosing diabetic retinopathy and other retinal conditions.
  • Web Platform Development: Designed and deployed a web platform using Flask and Heroku, achieving 70% real-time diagnostic accuracy. Integrated D3.js for dynamic data visualization to present diagnostic results clearly.
  • Natural Language Processing (NLP) for Enhanced Predictions: Implemented Python-based NLP techniques to analyze patient medical records, incorporating relevant information into the model. This approach improved predictive accuracy by 15%, enabling more informed and precise diagnoses.
Research Intern - Tata Consultacy Services
Apr 2021 - Jan 2022
  • Customer Behavior Analysis and Clustering: Analyzed 150K customer trip data points using K-Means clustering in scikit-learn, applying silhouette analysis and the elbow method to identify distinct transportation behavior patterns and enable targeted service improvements.
  • Emissions Forecasting and Model Development: Utilized insights from clustering analysis to engineer LSTM, ARIMA, and regression models on Amazon SageMaker, leveraging sensor data from 50K vehicles. Implemented sliding window techniques and hyperparameter tuning via grid search, reducing emissions forecast error by 15%.
  • Data Visualization and Insights: Integrated model outputs and 500K emissions data points into an interactive Tableau dashboard, featuring advanced visualizations like heatmaps and geospatial plots. Facilitated data-driven decision-making for executive stakeholders by extracting key insights through correlation analysis and feature importance ranking.
  • Model Optimization and Deployment: Enhanced model explainability by 35% using SHAP values and LIME, and improved reliability by 18% through ensemble methods and cross-validation techniques. Ensured consistent deployment using Git for version control and Docker for containerization.
  • Research and Knowledge Sharing: Published a conference paper on novel time series approaches for environmental forecasting, contributing to the field of sustainable transportation and demonstrating the impact of the internship project.

Education

2023 - 2025
Master of Computer Science (Artificial Intelligence Specialisation)
University of Southern California, Los Angeles
I am pursuing a Master’s in Computer Science with a specialization in Artificial Intelligence at the University of Southern California. Through USC’s world-class program, I am gaining expertise in cutting-edge AI technologies such as Generative AI, Large Language Models (LLMs), Reinforcement Learning, and Recommendation Systems. With hands-on projects and guidance from renowned faculty, I am developing the skills to create intelligent systems that can revolutionize industries. USC’s vibrant learning environment inspires me to explore the boundless potential of AI and create impactful solutions in the field of computer science.
2019 - 2023
Bachelor of Computer Engineering
International Institute of Information Technology, Pune
  • Graduated in the top 1% of the class with a 3.74/4 GPA
  • Gained expertise in Python, C/C++, Machine Learning, Deep Learning, NLP, Computer Vision, Algorithms, and Data Structures
  • Published 4 research papers in renowned conferences and journals like IEEE and Springer Nature
  • Developed a strong grasp of cutting-edge AI techniques and their real-world applications through hands-on projects and collaborations with esteemed professors

Extracurricular Activities

  1. IEEE:
  • As the youngest IEEE Student Branch Chair, led a 100+ member team and oversaw 15 activities engaging 300+ students
  • Served as IEEE Student Section Representative, leading 25+ branches and collaborating with 8 sections to organize 5 events for 1000+ participants
  • Promoted IEEExtreme 15 as an IEEE Ambassador, increasing participation by 150%
  1. CESA:
  • Spearheaded 12 coding competitions, 5 hackathons, and 20 tech talks for a 600+ member community
  • Organized “Code Uncode” and “HackOverflow” events, attracting 500+ participants from 15 colleges (40% increase)
  • Conducted 10 workshops on trending CS topics, enabling skill development for 250+ students

Projects

RXPurposeAI - A Gen AI Approach to Drug Repurposing
Generative Adversarial Networks (GANs) Pytorch & Tensorflow Generative Tensorial Reinforcement Learning (GENTRL) RDKit Apache Kafka PyAutoDock
RXPurposeAI - A Gen AI Approach to Drug Repurposing
Led an AI-driven drug discovery and repurposing project, training a DeepPurpose model on 7,229 molecules and 1,849 proteins. Utilized MPNN and CNN encodings, achieving an AUROC of 0.71. Integrated Generative AI techniques, including planned implementation of GENTRL for innovative drug design. Developed a framework combining large-scale generative AI with DeepPurpose, improving data engine accuracy to 78.85% and streamlining potential drug candidate identification.
Multiple Reinforcement Learning Approaches on Rogue-Gym
Reinforcement Learning Pytorch & JAX Proximal Policy Optimization (PPO) Apache Spark
Multiple Reinforcement Learning Approaches on Rogue-Gym
Developed deep RL agents using PPO and PyTorch for procedurally generated Rogue-Gym environments, improving average episode rewards by 15%. Implemented hierarchical RL, transformer-based approaches, and multi-task learning, reducing required training episodes by 30% through enhanced generalization. Utilized latent variable models with PyTorch and JAX, employing sparse training and distributed computing to boost performance by 40% on complex environments.
Predictive Lead Scoring for Cost-Effective Customer Acquisition
Pandas Scikit-learn Pycaret MLFlow Apache Airflow PyTest
Predictive Lead Scoring for Cost-Effective Customer Acquisition
Developed a lead scoring system to predict the likelihood of leads purchasing a product, helping an ed-tech startup optimize marketing efforts and reduce customer acquisition costs. Conducted extensive EDA using pandas profiling, handled missing values, and reduced high cardinality. Utilized Pycaret for model experimentation, identifying LightGBM as the best-performing model with 89% accuracy. Implemented data, training, and inference pipelines using Airflow for streamlined processing. Logged models to MLflow registry for tracking and comparison. Developed unit tests using pytest to validate preprocessing functionalities.
SmartGesture TV:- AI-Powered Remote-Free Control System
Computer Vision Tensorflow CNN & RNN Pandas Video Processing OpenCV Transfer Learning
SmartGesture TV:- AI-Powered Remote-Free Control System
Developed an advanced gesture recognition system for smart TVs using deep learning techniques. Implemented and compared two approaches: Conv3D and Conv2D+RNN with transfer learning (MobileNet). Processed a dataset of 30-frame gesture videos, applying data preprocessing techniques including image resizing and normalization. Achieved 97% validation accuracy with the Conv2D+RNN model, enabling accurate recognition of five distinct gestures for TV control. The system provides an intuitive, remote-free interface for volume adjustment, video navigation, and playback control.
NewsGPT: Intelligent News Summarization with Extractive and Abstractive Methods
Gradio Natural Language Processing (NLP) Information Retrival GPT-3 Language Model Text Summarization Web Crawling
NewsGPT: Intelligent News Summarization with Extractive and Abstractive Methods
Developed an efficient news summarization application using two distinct methods: extractive summarization and abstractive summarization with GPT-3. The extractive approach selects the most important sentences from the original article, while the abstractive method generates human-like summaries using GPT-3's language understanding capabilities. Implemented a comparison feature to highlight the differences in content, style, and length between the two approaches. Created an intuitive web interface using Gradio for easy user interaction and summary evaluation.

Achievements

Founder & CEO - ISEAS
Successfully founded and led ISEAS Startup as CEO, developing AI/ML-based vehicular and personal safety solutions. Managed a cross-functional team of 15, overseeing product development, engineering, and marketing initiatives.
Best IEEE Student Volunteer
Selected as the top volunteer among 1000+ IEEE members in the region, recognizing exceptional contributions that increased student engagement by 40% and led three community outreach programs impacting over 500 local students.
Best IEEE Student Chair Award
Chosen from 50 colleges chapters for outstanding leadership, successfully organizing 15 high-impact events that increased chapter membership by 75% and secured $5,000 in sponsorships for AI and ML initiatives.
Best IEEE Student Researcher Award
Awarded to the top researcher out of 500 undergraduate students, acknowledging groundbreaking work in AI and ML that resulted in two publications that were presented at a prestigious international conference with 1000+ attendees.

Get in Touch

My inbox is always open. Whether you have a question or just want to say hi, I’ll try my best to get back to you!