Below is a list of some of the projects I worked on.


Click an image or hyperlink in the description to learn more.

Picko

Stack : React Native, TypeScript, Expo, Python, FastAPI, AWS, Vercel

► Developed a recommendation engine using ML techniques (TF-IDF, cosine similarity, semantic search) for personalized movie, TV show, and book suggestions and deployed the app on iOS and Android stores.

► Built an over 300,000-item SQL database with tools like SQLModel and SQLAlchemy for high concurrency and efficiency.

Text-to-Action

Stack : Python, Transformers, LLM, PyTorch, sPacy

► Built an open-source system to enable automation for applications by translating natural language queries to programmatic actions.

► Designed vector-store and parameter-extractor components for efficient similarity search using LLM and NER for query processing.

► With over 1500+ installs, it supports use cases like API interfaces, chatbots, automated task systems, and voice-controlled apps.

AutoVE

Stack : C++, wxWidgets, Electron, FastAPI, Python, LLM, GroqInc, Docker

► Developed a video editing tool that enables text-to-video edits by analyzing and executing text-described actions.

► Implemented a backend in Python and FastAPI to process text inputs, detect actions, extract parameters, and determine edits.

Pragyantra

Stack : Python, Whisper, pyttsxt, facebook-mms, Llama3, microsoft-trOCR, vit-gpt2-image-captioning

► PragYantra is a simple software project to simulate a humanoid robot with vision, hearing, speech, and memory functionalities.

► I wanted this project to create a flexible platform for experimenting with artificial intelligence and human-machine interaction.

► It can work fully offline ( may require a stronger device for faster inference) and online for low-end devices.

Enhanced Video Assistant (EVA)

Stack : ReactJS, Python, FastAPI, Docker, Azure, Ffmpeg, Vite, SQL

► Developed a production-ready video assistant web app, EVA, as a part of capstone project which offers hybrid approach to video editing. Utilized ffmpeg for robust video processing.

► Designed with a user-centric approach, EVA can be used by content creators of all levels. It offers a streamlined experience in video editing and audio processing, requiring no prior editing expertise.

► Integrated open-source machine learning models into workflow to enhance features: improve audio, give multi-layered feedback and condenses video to key moments.

Watch our project video here.

FaceSpyder

Stack : C++, CMake, wxWidgets, OpenCV

► a Face Recognition and Privacy Analysis Tool, a software application developed with the primary goal of providing face and object recognition capabilities. While its current focus is on recognizing faces and objects in various media elements.

► The inspiration for this tool is the growing concern regarding data privacy and the potential misuse of facial recognition technology

► Goal is to build lot of features on top of this face recognition tool.

You can download the application here

VidCaptio

Stack : Python, OpenAI's Whisper, ffmpeg, wxPython

► VidCaptio is a free video captioning software

► Multi-Language Captioning: VidCaptio allows users to add captions in multiple languages to their videos.

► Flexible Captioning Options: Users have the option to choose which languages they want to add captions for.

► Speech Recognition: Uses OpenAI's Whisper to accurately transcribes speech from videos for captioning.

► Video Processing: Utilizing ffmpeg, VidCaptio processes videos for captioning, ensuring high-quality output.

PyOrcid

Stack : Python, Orcid API, OAuth2.0

► PyOrcid is a Python package and API client designed to simplify interactions with the ORCID API.

► ORCID (Open Researcher and Contributor ID) is a nonprofit organization that provides unique identifiers to researchers, ensuring their work is accurately attributed and discoverable.

► PyOrcid enables developers to seamlessly integrate ORCID functionality into their software, allowing users to collect, track, and sync their publication materials, research activities, and other related information.

Wordle

Stack : HTML, CSS, Javascript, Ajax, Python, Flask, beautifulsoup4, Words API

► Play a static version of the game here

► Developed an innovative variation of the classic Wordle game, featuring a dynamic and user-engaging interface including a daily leaderboard.

► Utilized Python and Flask for backend development, handling game logic and server-client interactions efficiently.

► Integrated Ajax for smooth and asynchronous data exchange, keeping the gameplay fluid and responsive.

► Implemented user authentication features, requiring players to log in, thus personalizing the gaming experience and enabling score tracking.

Facebook Clone

Stack : React, React-Native, JS

► For learning purposes, I've been working on a facebook-clone app using React-native with expo.

► One of my first full-scale mobile app-dev experience and I've learned much about react-native's fundamental concepts like components, hooks, states, navigation etc..

WildLens

Stack : Python, Sci-kit Learn, Neural Networks

► WildLens is a machine learning model for fauna recognition, for a non-invasive approach to continuous wildlife census

► This system can be invaluable for automating species identification, aiding wildlife researchers, and contributing to biodiversity conservation efforts.

► Our model trained on large datasets of around 60,000 images while the ethicals considerations were prioritized.

► Learn more about the project in our paper.

Bug Squash Game development

Stack : C++, C, CMake, wxWidgets, XML

► Created a multi-level game called 'Bug Squash' involving various types of bugs attempting to infect programs.

► Implemented agile software development practices along with Unit-Testing practices, dynamic Animations, OOP, and concepts of Visitor and Observer patterns, Polymorphism.

► Contributed to a winning team in a Level 3 design contest organized by the CSE department.

Javascript Piano

I created an interactive piano entirely using HTML, CSS, and JavaScript. This project not only highlights my web development skills but also demonstrates my ability to build beautiful and mobile-friendly front-end applications. Click here to explore and play the piano.

Piano

Brainwaive app

Stack : Python, Django, HTML, CSS, Tailwind, Javascript, OpenAI, NodeJS

► Recipient of the Best Emerging Technology Award at SpartaHack.

► Developed a fully operational, full-stack educational web application utilizing the Django framework.

► Empowers personalized learning with a multitude of components such as custom quizzes and flashcards tailored to students educational requirements.

► This provides a range of complementary services that enhance personalized learning products, equipping students with comprehensive tools essential for success throughout their academic journey.

► Currently in the development phase, our goal is to construct highly customized AI models designed to assist students across different grades, subjects, and formats.

Musical Machines

Stack : C++, C, CMake, miniaudio, wxWidgets, XML

► Developed a musical machine that utilized card input (with rows mapped to musical notes) to enable the various components to play.

► Designed a comprehensive UML class diagram, incorporating associations, inheritance, and attributes, as a preliminary step to development.

► Implemented advanced programming concepts such as Tweening, Visitor and Observer patterns, Polymorphism, Adaptation, Inheritance, and Composition.

► Successfully integrated and adapted the animated software into another program, to create an animated movie.

Note: This is a project from one of my classes (CSE335)

Auto scrapper

Automated Scraping & Data Reorganization Software

Stack : Python, selenium, beautifulsoup4, PyZotero, streamlit

► Developed an automated software solution, simplifying the tasks of merging and reformatting data from diverse sources. This resulted in the creation of an efficient and current departmental database.

► Streamlined the navigation process from the old database format to a new, efficient format that provides enhanced and relevant information for each member of the department.

► Utilized PyZotero and web-scraping techniques to integrate data and references, enhancing the accuracy and credibility of the department's database content.

► Contributed to enhancing departmental efficiency by providing a data-driven solution that enabled faster access to accurate information, ultimately supporting better decision-making processes.

Aquarium

Stack : C++, C, CMake, wxWidgets

► Developed a fun program involving a heavy animated aquarium that can be populated.

► This project involves concepts like file saving/loading of formats, Refactoring, Inheritance, and Unit testing.

Tic-Tac-Toe game

Stack : Python, Tkinter

► A fun personalized project where I developed a Tic Tac Toe game utilizing the Tkinter library in Python. This interactive game allows users to enjoy the classic gameplay experience on their desktop.

► Incorporated intelligent logic to handle player turns, validate moves, and determine winning conditions, ensuring a seamless and enjoyable gameplay experience for users of all skill levels.

Want to know more about any of the projects? Let's Connect!