Warehouse

Found 13 projects

Showcase

Showing 13 of 13 projects

Filter by tags (0 selected)

AI & ML

App

Research

Terminal

Web

✨

Driftly

In Development

December 2024

About the Project

Driftly is an AI-driven social media assistant that helps users write, schedule, and automate social media posts. Users can generate engaging content using AI and schedule posts for optimal engagement times. Additionally, users can provide a set of links, and Driftly will scrape data from them to automatically generate and post content daily or weekly.

Built with

React

Vite

Express

MongoDB

Node.js

LangGraph

TailwindCSS

Clerk

OAuth

Jina AI

GPT 4o mini

Key Features

📄

AI-Powered Post Generation

Generate engaging social media posts instantly using LangChain

🔗

Link-Based Content Creation

Provide links, and Driftly will scrape data and use it for content creation

🤖

Auto-Posting & Multi-Platform Support

Integrated with Twitter, LinkedIn, Facebook, and Instagram

🔑

OAuth Authentication

Secure login with Clerk, Passport, and OAuth 1.0a

Workflow

1
User Input: Generate a Twitter thread about AI trends in 2025
2
AI Processing: Uses LangChain to generate text, hashtags, and captions
3
Scheduling & Automation: Review & schedule posts or enable auto-posting

AI-powered social media post generation & scheduling 🚀

Dashboard

AI-powered content generation interface

Post Scheduler

Automated posting calendar

Analytics

Engagement metrics visualization

Analytics

Engagement metrics visualization

Analytics

Engagement metrics visualization

Analytics

Engagement metrics visualization

🤖

Plato

In Development

August 2024

About the Project

Plato is a high-level abstraction over LangChain that makes it easy to build, configure, and deploy AI-powered agents. It provides a structured way to manage:

• LLMs: Seamless integration with multiple language models

• Tools: Simple tool management and integration

• Memory: Efficient conversational memory handling

• Workflow: Simplified orchestration of complex workflows

With just a few lines of code, developers can: • Create multi-agent systems • Integrate external tools • Automate AI-driven tasks

All of this without dealing with the underlying complexity of LangChain.

Built with

LangChain

OpenAI API

LangGraph

Python

Arxiv API

Mermaid

Key Features

🚀

LLM Abstraction

Instantly access multiple models like GPT-4o, GPT-4, and GPT-3.5

🛠

Modular Tools

Easily integrate AI tools like Arxiv search, Python REPL, and more

💬

Memory Management

Supports conversation history with buffer-based memory

🤖

Agent Creation

Define agents with prebuilt prompts and tool integration

🔄

Workflow Orchestration

Route messages between agents dynamically

📊

Graph Visualization

Generate workflow graphs for easy debugging

Workflow

1
Define agents and their available tools
2
Configure memory and conversation handling
3
Set up workflow routing between agents
4
Deploy and monitor agent interactions

View Source

Simplified AI Agent Orchestration with LangChain 🔥🤖

🔍

AI-Powered Visual & Semantic Search

Completed

June 2024

About the Project

AI-Powered Visual & Semantic Search enables users to find products effortlessly by leveraging both images and text. Built with OpenAI's CLIP, it understands product images and descriptions in a shared embedding space, ensuring highly relevant search results. Users can upload images or describe products naturally, and the system intelligently retrieves matching items from the store inventory.

Built with

Flask API

Python

CLIP Embeddings

Pinecone Vector Storage

Shopify Liquid

Javascript

PostgreSQL

Key Features

🖼️

Visual Product Search

Upload an image to find similar products instantly

🔤

Semantic Search

Describe a product in words and get relevant results

🔍

AI-Powered Matching

Utilizes CLIP embeddings for precise search

🛒

Intelligent Recommendations

Suggests products based on user intent

Workflow

1
User uploads an image or enters a text query
2
AI processes the input and matches it with products
3
Displays the most relevant product listings

Revolutionizing e-commerce with AI-driven product search 🛍️🔍

👗

Shop The Look

Completed

January 2024

About the Project

Shop The Look allows users to upload a photo of an influencer or celebrity, and our AI system will identify the clothing & accessories worn in the image. Then, the app will find similar products from the store and display them for purchase.

Built with

LangSAM

Flask API

CLIP Embeddings

Pinecone Vector Storage

Key Features

📸

AI-Powered Outfit Detection

Uses LangSAM (Segment Anything Model) to identify clothing & accessories

🛍️

Shop Similar Items

Finds matching or similar products from the store using Streamlit

🔍

Multi-Category Matching

Detects shirts, dresses, shoes, watches, bags, sunglasses, and more

📱

Seamless User Experience

Simple UI for uploading images & browsing matched products

Workflow

1
Upload a photo containing fashion items
2
AI detects and segments individual clothing items
3
System finds similar products from store inventory
4
Browse and purchase matched items

AI-powered fashion discovery from images 👗🕶️

🎙️

SayNSave

Completed

January 2024

About the Project

SayNSave is an Android app that lets users record their voice, automatically transcribe it into text, and rewrite it into a beautifully formatted PDF. Users can store, revisit, and listen to past recordings, as well as access their saved PDFs anytime.

Built with

Flutter

Dart

PDF Generation

S3 Bucket

MongoDB

Self Hosted OpenAI Whisper Model

Key Features

🎤

Record & Transcribe

Convert speech into text with high accuracy

📄

Beautiful PDF Generation

Automatically formats the transcription into well-structured PDFs

🔄

Revisit & Replay

Listen to past recordings whenever needed

☁️

Cloud Storage

Securely upload recordings and PDFs with S3 Bucket

💡

Custom UI/UX

Uses Font Awesome for icons & Flutter Markdown for rich text

Workflow

1
Record voice using the app interface
2
AI processes speech to text conversion
3
Generate formatted PDF from transcription
4
Save and access content from cloud storage

Voice-to-Text Transcription with PDF Generation 🎙️📄

📰

Noit News App

Completed

January 2024

About the Project

A Flutter-based AI-powered news app that curates and summarizes the latest happenings into 2-minute digestible updates. The app intelligently tracks user interactions, such as which news articles they pause on and read, to refine recommendations over time. By leveraging AI-driven content analysis, it ensures that users stay informed with the most relevant and engaging news stories - based on the trends from x, insta and other social platforms as well. The app features AI-generated audio narration of news summaries, allowing users to listen to the latest news on the go. News content is sourced from newsdata.io API to ensure comprehensive and up-to-date coverage.

Built with

Flutter

Dart

MongoDB

S3 Bucket

Cron

NewsData API

Text To Speech Suno AI Model

GPT 4o Mini

Key Features

🔍

AI-powered news summarization

Delivers personalized, concise news summaries

📰

Personalized content recommendations

Delivers personalized, concise news summaries

📱

Social media trend analysis

Delivers personalized, concise news summaries

🎧

AI-generated audio narration

User can listen to news summaries on the go with most human like audio narration

Workflow

1
User can login using Google OAuth
2
Summarize news articles
3
Personalized content recommendations based on their interests
4
Social media trend analysis
5
Listen to news summaries on the go with most human like audio narration

AI-powered news aggregator that delivers personalized, concise news summaries.

🗺️

Voyage

In Development

August 2023

About the Project

Developed Voyage, a route-planning and booking system that processes OpenStreetMap geo-data to identify train stations, airports, and bus stations. The system finds the most efficient path between any two locations, optimizing across multiple modes of transportation (trains, flights, buses). Additionally, users can book all required transport options in one seamless transaction.

Built with

Neo4j

OpenStreetMap

Node.js

blessed

osm-pbf-parser

rbush

Key Features

🚏

Multi-Modal Routing

Finds optimised paths across trains, buses, and flights

📍

OSM-Based Data Extraction

Uses OpenStreetMap to process real-world geo-coordinates

📈

Graph-Based Search

Implements Neo4j with Dijkstra/A* for efficient routing

🎟

Seamless Booking

Users can book all required tickets in a single step

Workflow

1
User Input: Start from Berlin → Destination: Paris
2
Processing: Identifies nearest transport hubs and constructs route graph
3
Output: Generates optimal multi-modal route
4
User Action: Book all tickets at once

🚄 Train:Berlin → Frankfurt

🛫 Flight:Frankfurt → Paris

🚍 Bus:Airport to final destination

Multi-modal route optimization and booking using OpenStreetMap geo-data and Neo4j graph search.

🎹

AI Music Composer

Completed

March 2023

About the Project

Developed an LSTM-based deep learning model for AI-generated music composition. The model learns musical patterns and structures from training data and generates coherent, melodious sequences. By leveraging sequence-to-sequence modeling, it produces realistic AI-composed music, demonstrating the potential of deep learning in creative applications.

Built with

TensorFlow

Keras

Python

MIDI Processing

LSTM

Key Features

🎵

LSTM-based composition

Generates structured and melodious music

🎹

Pattern learning

Learns from existing music datasets to create original pieces

AI-powered music generation using LSTM-based deep learning models.

AI-Generated Music #1

LSTM

0:000:00

AI-Generated Music #2

LSTM

0:000:00

🌙

Low Light Enhancement using EnlightenGAN

Completed

March 2023

About the Project

Trained an EnlightenGAN-based low-light enhancement model to improve visibility in real-life low-light conditions. The model was tested on various real-world datasets to evaluate its effectiveness in enhancing dark images without overexposure or artifacts. The results demonstrated the potential for real-world applications in photography, surveillance, and night vision enhancement.

Built with

PyTorch

TensorFlow

GANs

Image Processing

Key Features

📸

GAN-powered low-light enhancement

Learns to brighten dark images naturally

🌙

Real-world inference testing

Evaluated on actual low-light scenarios

⚡

No overexposure or artifacts

Preserves image details while improving brightness

Trained a GAN-based model for real-world low-light image enhancement.

👕

Product Recommendation System and TryOn

Completed

November 2021 - December 2021

About the Project

Developed a product recommendation system utilizing sentiment analysis of user comments and implemented T-SNE with cosine similarity to find ingredient similarities. The project also features a virtual try-on system using BeautyGAN, allowing users to visualize how products would look on them. This project was developed as part of the Myntra Hackerramp competition.

Built with

Tensorflow

Python

Keras

BeautyGAN

T-SNE

View Source

🔍

Novel Data Augmentation for NER

Published

June 2021 - January 2022

About the Project

During my internship at Samsung Research, I published research in the International Journal of Speech Technology (SCI journal) focusing on improving ASR text analysis for named entity identification and categorization. The implementation achieved significant improvements in NER models, with Base BERT NER F1 score increasing from 94.73% to 95.37%, and RoBERTa NER from 94.13% to 95.14%.

Published

Journal:International Journal of Speech Technology (SCI)

Date:November 2023

Built with

BERT

RoBERTa

Python

NLP

ASR

Read Publication

Research Overview

NER Data Augmentation Architecture

Samsung Excellence Award

🚗

Safe Ride

Completed

February 2021 - April 2021

About the Project

This prototype aims to demonstrate a driver safety application that ideally should be integrated into vehicles themselves, providing enhanced control over braking and other critical functions. It was selected as one of the top 50 semifinalists in the Google Solution Challenge 2021. The application features drowsiness detection, traffic sign recognition using SSDMobilenet, and an accident detection algorithm that triggers automatic location-based alerts. Performance optimizations have reduced object detection time from 600ms to 96ms per frame, significantly enhancing real-time safety measures.

Built with

Tensorflow

Android Studio

Java

Python

SSD Mobilenet

Drowsiness Detection

Accident Detection

Key Features

🚗

Driver Safety Monitoring

Real-time safety monitoring

🚨

Accident Detection

Automated alerts

🚦

Traffic Sign Recognition

Traffic sign recognition using SSD Mobilenet

👀

Drowsiness Detection

Drowsiness detection

View Source

Google Solution Challenge 2021 Certificate

Top 50 Semifinalist Certificate

🔗

Entity Linking using BERT Similarity

Completed

November 2020

About the Project

This project focuses on enhancing Entity Linking by integrating BERT similarity with the existing ABACO method. Entity Linking is the process of mapping textual mentions to real-world entities, and this work aims to improve its accuracy using deep learning techniques. By incorporating BERT similarity alongside BM25 ranking, the project significantly improves entity disambiguation, making the linking process more robust. The approach leverages DBpedia, Wikipedia APIs, and custom graph-based methods to refine entity selection. This project was submitted as part of the CSE2003 Data Structures and Algorithms coursework at the School of Computer Science and Engineering, Nov 2020.