Mukul Yadav

B.Tech (CSE) Student | Aspiring Data Engineer

LinkedIn | GitHub

About

Highly motivated B.Tech (CSE) student with a 7.7 CGPA, specializing in Data Engineering and Big Data Analytics. Proven ability to design and implement scalable data pipelines, optimize data warehousing solutions, and develop data-driven applications using Python, SQL, AWS, Azure, and Databricks. Eager to leverage technical expertise and project leadership experience to drive impactful data initiatives in a dynamic professional environment.

Work Experience

Ai Engineer Intern

Infosys

Sep 2025 - Present

Remote

Developing an AI-driven expense forecasting application leveraging LLMs and ML models to provide personalized financial insights and budget planning. Implemented time-series forecasting and predictive analytics to identify spending patterns and project future expenses. Integrated natural language interfaces powered by LLMs for intuitive, conversational financial queries.

  • Applied regression models, ARIMA/LSTM, and anomaly detection techniques to enhance forecast accuracy and detect irregular spending.
  • Delivered an interactive dashboard for real-time visualization of financial health, enabling smarter decision-making

Data Engineer Intern

Mactores

Jul 2025 - Aug 2025

United States (Remote)

Developed expertise in designing and implementing scalable data warehousing and ETL pipelines using Snowflake, Databricks, and AWS services to support real-time data initiatives.

  • Gained hands-on experience in designing and implementing data warehousing and ETL pipelines utilizing Snowflake, Databricks, and AWS services (S3, Lambda, Glue, Redshift) for efficient data transformation.
  • Collaborated with cross-functional teams to analyze and define requirements for real-time data ingestion and processing, ensuring alignment with business needs.
  • Contributed to the development of scalable data solutions, focusing on optimizing data flow and enhancing system performance.

Data Centre Intern

Rashtriya Raksha University

Sep 2023 - May 2024

Gandhinagar, Gujarat, India

Managed data centre operations and enhanced system security, ensuring optimal performance and proactive issue resolution for critical university infrastructure.

  • Assisted in the maintenance and optimization of data centre systems, contributing to enhanced operational efficiency and system stability.
  • Monitored systems continuously, identifying and addressing potential security vulnerabilities and operational issues in accordance with established IT protocols.

Education

Computer Science and Engineering

Rashtriya Raksha University

CGPA: 7.7

Sep 2022 - Jun 2026

Gandhinagar, Gujarat, IN

Courses

  • Data Structure and Algorithms
  • Database Management Systems
  • Big Data Analytics
  • Human Computer Interaction
  • Cloud Computing
  • Data Analytics and Visualization
  • Software Security

Volunteer

Volunteer

National Security Council (NSCS)

Jan 2023 - Dec 2024

Gandhinagar, Gujarat, IN

Volunteered for the National Cyber Security Exercise, contributing to national cybersecurity initiatives and awareness for NCX-2023 & NCX-2024.

  • Contributed to the National Cyber Security Exercise (NCX-2023 & NCX-2024) organized by the National Security Council (NSCS), supporting critical national security objectives.
  • Assisted in various capacities to ensure the smooth execution of cybersecurity drills and awareness programs.
  • Gained exposure to national-level cybersecurity strategies and operational protocols.

Projects Lead

GDSC'RRU

Jan 2023 - Apr 2024

Gandhinagar, Gujarat, IN

Led technical initiatives and community engagement as Projects Lead for GDSC'RRU, fostering skill development and collaboration among students.

  • Orchestrated and led technical workshops, enhancing programming and leadership skills for the student community.
  • Facilitated networking events and collaboration opportunities for over 100 students, fostering a vibrant technical ecosystem.
  • Managed project timelines and resources, ensuring successful execution of student-led technical projects.

Projects

CrimeScope: Crime Records Analytics

Developed a web-based crime record analysis hub utilizing Python, HTML, and CSS to predict crime patterns and visualize incident locations, enhancing public safety insights.

Airline Booking Data Architecture

Designed and implemented a scalable data architecture for airline booking data using Databricks and Delta Lake, optimizing data processing and enhancing decision-making capabilities.

Smart Classroom Management Software

Jan 2023 - Dec 2023

Developed a data-driven Smart Classroom Management Software using HTML, CSS, JavaScript, Flask, and Python, providing real-time attendance tracking, consolidated alerting, and resource organization for over 200 students and teachers.

Languages

English

Skills

Programming Languages

  • Python
  • Java
  • C
  • C++

Databases & Query Languages

  • SQL
  • MySQL
  • MongoDB

Big Data & Streaming Technologies

  • PySpark
  • Apache Kafka
  • Apache Flink
  • Hadoop
  • Databricks
  • Snowflake
  • Delta Lake

Cloud Data Services

  • Azure Data Factory
  • Azure Synapse Analytics
  • Microsoft Fabric
  • AWS S3
  • AWS Lambda
  • AWS Glue
  • AWS Redshift

Version Control & DevOps

  • Git
  • GitHub
  • CI/CD pipelines

Web Technologies

  • HTML
  • CSS
  • JavaScript
  • Bootstrap
  • Flask

Data Analysis & Visualization

  • Pandas
  • Geopy Geocoder
  • DBT (Data Build Tool)

Machine Learning

  • Computer Vision
  • Machine Learning Projects