Aumkesh Chaudhary

Researcher in Machine Learning and Applied AI

∞ About Me

Academic Journey

BS in Computer Science and Data Analytics at the Indian Institute of Technology, Patna. Investigating complex systems through a lens of both structure and curiosity.

Research Passion

Exploring the depths of Machine Learning, Mathematical Modeling, and Scientific Simulations. Bridging the gap between theoretical knowledge and real-world applications.

Philosophy

Driven by curiosity and guided by logic, I believe in learning through building, experimenting, and asking the right questions. Every problem is a puzzle waiting to be solved.

Education

Indian Institute of Technology, Patna

Bachelor of Science (Honours) in Computer Science and Data Analytics

CPI: 9.12

Work Experience

Research Intern
Ahmedabad University (On-site)
Project: Understanding Decision Making and Coordination in Animal Groups
Supervisor: Dr. Jitesh Jhawar — Funded by Max Planck Society, Germany
May 2025 – June 2025
  • Studied collective behavior in animal groups via simulations and mathematical modeling
  • Configured a multi-user Linux workstation with fstab-based persistent disk mounting, user environment management, and centralized Anaconda deployment for shared computational workflows
  • Processed and annotated video data collected daily over 4 months, labeling frames with multiple bird species for behavior and identity tagging
  • Used YOLO architecture to detect and analyze species-specific behaviors in datasets of birds
  • Utilized Idtrackerai to track ants and spiders, rendering trajectory data to visualize movement paths and group dynamics
Project Intern
IIT Mandi iHub and HCi Foundation (Remote)
Apr 2025 – June 2025
  • Contributed to backend development of a web platform for coursework and user management
  • Built core features using Django, integrated REST APIs, and managed data pipelines with SQLite
  • Integrated Google OAuth2.0 for secure login, role-based access control, and improved platform security
  • Ensured system reliability through modular code design and efficient database handling
  • Collaborated with cross-functional teams to design scalable backend logic supporting multi-user roles and secure data access

Projects

Developed CareerNavigator, a Machine Learning model that evaluates candidates' employability by analyzing key attributes and predicting suitability for a job role.
  • Cleaned, pre-processed, and performed feature engineering on the dataset containing 70k+ datapoints.
  • Designed, trained, and evaluated multiple algorithms, utilizing performance metrics such as accuracy, confusion matrix, and F1-score to optimize model effectiveness.
  • Selected kernel support vector machine as the best-fit model with the highest accuracy of 80% and F1 score of 0.82.
  • Presented findings to a panel of faculty and industry experts, receiving recognition for its innovation and effectiveness.
Developed an object detection model using YOLOv8n to identify and locate solar panels in low resolution satellite imagery.
  • Processed and annotated a comprehensive dataset of 29,625 solar panel instances with high precision.
  • Achieved 94.27% precision and 91.77% recall, significantly improving solar infrastructure mapping capabilities.
  • Implemented sophisticated object detection techniques with mean Average Precision (mAP50) of 96.8%.
  • Designed and deployed an optimized real-time inference pipeline, hosting it on Hugging Face for seamless accessibility and large-scale solar panel detection.
Fine-tuned Microsoft's SpeechT5 model to improve pronunciation of Technical English Terms focusing on modifying the phonetic representation to ensure precise pronunciation of abbreviations and acronyms.
  • Achieved a 25% enhancement in speech quality over the baseline TTS model, as reflected in a significant Mean Opinion Score (MOS) improvement, with notable enhancements in handling technical terms.
  • Optimized the baseline model to generate a Native Italian Voice by enhancing pronunciation, prosody, and stress patterns in line with the phonological rules of the Italian language, significantly improving speech quality and naturalness compared to other existing models.
  • Harnessed tools like Transformers, PyTorch, and Hugging Face Datasets to implement advanced machine learning and NLP techniques, ensuring optimal model performance and reliability.
  • Implemented 8-bit dynamic quantization to linear layers using PyTorch’s native API, reducing memory usage by 30% while maintaining inference accuracy.
Developed a dynamic web application combining Optical Character Recognition (OCR) and Text-to-Speech (TTS) technologies to improve accessibility.
  • Integrated Tesseract for multi language OCR and Web Speech API for TTS, enabling accurate text extraction from images and high-quality text-to-speech conversion.
  • Designed a responsive interface with intuitive navigation, ensuring a seamless user experience.
  • Enabled PDF export, word/character count, and keyword search for efficient document handling.
  • Developed functionality for managing user profiles, such as signup, login, activity tracking, and editable data.

Extracurricular Activities

🎹 Piano

7+ years of experience in classical and contemporary styles. Actively compose original pieces that explore and innovate across diverse genres and elements of music.

🎸 Guitar

2+ years of experience in Indian and western contemporary styles. Exploring fingerstyle techniques and songwriting.

Technical Competence

Languages
Python
Java
R
C
MATLAB
Web Development
Django
Flask
Node.js
React
HTML/CSS/JS
Bootstrap
Databases & APIs
MySQL
SQLite
MongoDB
REST APIs
Operating Systems
Linux (Ubuntu)
Windows
macOS

🌐 Connect with Me

The most beautiful thing we can experience is the mysterious. It is the source of all true art and science.

— Albert Einstein