Found 13 projects

Showcase

Showing 13 of 13 projects

Filter by tags (0 selected)

AI & ML
App
Research
Terminal
Web
✨

Driftly

In Development
December 2024

About the Project

Driftly is an AI-driven social media assistant that helps users write, schedule, and automate social media posts. Users can generate engaging content using AI and schedule posts for optimal engagement times. Additionally, users can provide a set of links, and Driftly will scrape data from them to automatically generate and post content daily or weekly.

Built with

React
Vite
Express
MongoDB
Node.js
LangGraph
TailwindCSS
Clerk
OAuth
Jina AI
GPT 4o mini

Key Features

πŸ“„

AI-Powered Post Generation

Generate engaging social media posts instantly using LangChain

πŸ”—

Link-Based Content Creation

Provide links, and Driftly will scrape data and use it for content creation

πŸ€–

Auto-Posting & Multi-Platform Support

Integrated with Twitter, LinkedIn, Facebook, and Instagram

πŸ”‘

OAuth Authentication

Secure login with Clerk, Passport, and OAuth 1.0a

Workflow

  1. 1

    User Input: Generate a Twitter thread about AI trends in 2025

  2. 2

    AI Processing: Uses LangChain to generate text, hashtags, and captions

  3. 3

    Scheduling & Automation: Review & schedule posts or enable auto-posting

AI-powered social media post generation & scheduling πŸš€

Dashboard

Dashboard

AI-powered content generation interface

Post Scheduler

Post Scheduler

Automated posting calendar

Analytics

Analytics

Engagement metrics visualization

Analytics

Analytics

Engagement metrics visualization

Analytics

Analytics

Engagement metrics visualization

Analytics

Analytics

Engagement metrics visualization

πŸ€–

Plato

In Development
August 2024

About the Project

Plato is a high-level abstraction over LangChain that makes it easy to build, configure, and deploy AI-powered agents. It provides a structured way to manage:

β€’ LLMs: Seamless integration with multiple language models

β€’ Tools: Simple tool management and integration

β€’ Memory: Efficient conversational memory handling

β€’ Workflow: Simplified orchestration of complex workflows

With just a few lines of code, developers can: β€’ Create multi-agent systems β€’ Integrate external tools β€’ Automate AI-driven tasks

All of this without dealing with the underlying complexity of LangChain.

Built with

LangChain
OpenAI API
LangGraph
Python
Arxiv API
Mermaid

Key Features

πŸš€

LLM Abstraction

Instantly access multiple models like GPT-4o, GPT-4, and GPT-3.5

πŸ› 

Modular Tools

Easily integrate AI tools like Arxiv search, Python REPL, and more

πŸ’¬

Memory Management

Supports conversation history with buffer-based memory

πŸ€–

Agent Creation

Define agents with prebuilt prompts and tool integration

πŸ”„

Workflow Orchestration

Route messages between agents dynamically

πŸ“Š

Graph Visualization

Generate workflow graphs for easy debugging

Workflow

  1. 1

    Define agents and their available tools

  2. 2

    Configure memory and conversation handling

  3. 3

    Set up workflow routing between agents

  4. 4

    Deploy and monitor agent interactions

Simplified AI Agent Orchestration with LangChain πŸ”₯πŸ€–

πŸ”

AI-Powered Visual & Semantic Search

Completed
June 2024

About the Project

AI-Powered Visual & Semantic Search enables users to find products effortlessly by leveraging both images and text. Built with OpenAI's CLIP, it understands product images and descriptions in a shared embedding space, ensuring highly relevant search results. Users can upload images or describe products naturally, and the system intelligently retrieves matching items from the store inventory.

Built with

Flask API
Python
CLIP Embeddings
Pinecone Vector Storage
Shopify Liquid
Javascript
PostgreSQL

Key Features

πŸ–ΌοΈ

Visual Product Search

Upload an image to find similar products instantly

πŸ”€

Semantic Search

Describe a product in words and get relevant results

πŸ”

AI-Powered Matching

Utilizes CLIP embeddings for precise search

πŸ›’

Intelligent Recommendations

Suggests products based on user intent

Workflow

  1. 1

    User uploads an image or enters a text query

  2. 2

    AI processes the input and matches it with products

  3. 3

    Displays the most relevant product listings

Revolutionizing e-commerce with AI-driven product search πŸ›οΈπŸ”

Video thumbnail
πŸ‘—

Shop The Look

Completed
January 2024

About the Project

Shop The Look allows users to upload a photo of an influencer or celebrity, and our AI system will identify the clothing & accessories worn in the image. Then, the app will find similar products from the store and display them for purchase.

Built with

LangSAM
Flask API
CLIP Embeddings
Pinecone Vector Storage

Key Features

πŸ“Έ

AI-Powered Outfit Detection

Uses LangSAM (Segment Anything Model) to identify clothing & accessories

πŸ›οΈ

Shop Similar Items

Finds matching or similar products from the store using Streamlit

πŸ”

Multi-Category Matching

Detects shirts, dresses, shoes, watches, bags, sunglasses, and more

πŸ“±

Seamless User Experience

Simple UI for uploading images & browsing matched products

Workflow

  1. 1

    Upload a photo containing fashion items

  2. 2

    AI detects and segments individual clothing items

  3. 3

    System finds similar products from store inventory

  4. 4

    Browse and purchase matched items

AI-powered fashion discovery from images πŸ‘—πŸ•ΆοΈ

Video thumbnail
πŸŽ™οΈ

SayNSave

Completed
January 2024

About the Project

SayNSave is an Android app that lets users record their voice, automatically transcribe it into text, and rewrite it into a beautifully formatted PDF. Users can store, revisit, and listen to past recordings, as well as access their saved PDFs anytime.

Built with

Flutter
Dart
PDF Generation
S3 Bucket
MongoDB
Self Hosted OpenAI Whisper Model

Key Features

🎀

Record & Transcribe

Convert speech into text with high accuracy

πŸ“„

Beautiful PDF Generation

Automatically formats the transcription into well-structured PDFs

πŸ”„

Revisit & Replay

Listen to past recordings whenever needed

☁️

Cloud Storage

Securely upload recordings and PDFs with S3 Bucket

πŸ’‘

Custom UI/UX

Uses Font Awesome for icons & Flutter Markdown for rich text

Workflow

  1. 1

    Record voice using the app interface

  2. 2

    AI processes speech to text conversion

  3. 3

    Generate formatted PDF from transcription

  4. 4

    Save and access content from cloud storage

Voice-to-Text Transcription with PDF Generation πŸŽ™οΈπŸ“„

Video thumbnail
πŸ“°

Noit News App

Completed
January 2024

About the Project

A Flutter-based AI-powered news app that curates and summarizes the latest happenings into 2-minute digestible updates. The app intelligently tracks user interactions, such as which news articles they pause on and read, to refine recommendations over time. By leveraging AI-driven content analysis, it ensures that users stay informed with the most relevant and engaging news stories - based on the trends from x, insta and other social platforms as well. The app features AI-generated audio narration of news summaries, allowing users to listen to the latest news on the go. News content is sourced from newsdata.io API to ensure comprehensive and up-to-date coverage.

Built with

Flutter
Dart
MongoDB
S3 Bucket
Cron
NewsData API
Text To Speech Suno AI Model
GPT 4o Mini

Key Features

πŸ”

AI-powered news summarization

Delivers personalized, concise news summaries

πŸ“°

Personalized content recommendations

Delivers personalized, concise news summaries

πŸ“±

Social media trend analysis

Delivers personalized, concise news summaries

🎧

AI-generated audio narration

User can listen to news summaries on the go with most human like audio narration

Workflow

  1. 1

    User can login using Google OAuth

  2. 2

    Summarize news articles

  3. 3

    Personalized content recommendations based on their interests

  4. 4

    Social media trend analysis

  5. 5

    Listen to news summaries on the go with most human like audio narration

AI-powered news aggregator that delivers personalized, concise news summaries.

Video thumbnail
πŸ—ΊοΈ

Voyage

In Development
August 2023

About the Project

Developed Voyage, a route-planning and booking system that processes OpenStreetMap geo-data to identify train stations, airports, and bus stations. The system finds the most efficient path between any two locations, optimizing across multiple modes of transportation (trains, flights, buses). Additionally, users can book all required transport options in one seamless transaction.

Built with

Neo4j
OpenStreetMap
Node.js
blessed
osm-pbf-parser
rbush

Key Features

🚏

Multi-Modal Routing

Finds optimised paths across trains, buses, and flights

πŸ“

OSM-Based Data Extraction

Uses OpenStreetMap to process real-world geo-coordinates

πŸ“ˆ

Graph-Based Search

Implements Neo4j with Dijkstra/A* for efficient routing

🎟

Seamless Booking

Users can book all required tickets in a single step

Workflow

  1. 1

    User Input: Start from Berlin β†’ Destination: Paris

  2. 2

    Processing: Identifies nearest transport hubs and constructs route graph

  3. 3

    Output: Generates optimal multi-modal route

  4. 4

    User Action: Book all tickets at once

πŸš„ Train:Berlin β†’ Frankfurt
πŸ›« Flight:Frankfurt β†’ Paris
🚍 Bus:Airport to final destination

Multi-modal route optimization and booking using OpenStreetMap geo-data and Neo4j graph search.

Video thumbnail
🎹

AI Music Composer

Completed
March 2023

About the Project

Developed an LSTM-based deep learning model for AI-generated music composition. The model learns musical patterns and structures from training data and generates coherent, melodious sequences. By leveraging sequence-to-sequence modeling, it produces realistic AI-composed music, demonstrating the potential of deep learning in creative applications.

Built with

TensorFlow
Keras
Python
MIDI Processing
LSTM

Key Features

🎡

LSTM-based composition

Generates structured and melodious music

🎹

Pattern learning

Learns from existing music datasets to create original pieces

AI-powered music generation using LSTM-based deep learning models.

AI-Generated Music #1

LSTM

0:000:00

AI-Generated Music #2

LSTM

0:000:00
πŸŒ™

Low Light Enhancement using EnlightenGAN

Completed
March 2023

About the Project

Trained an EnlightenGAN-based low-light enhancement model to improve visibility in real-life low-light conditions. The model was tested on various real-world datasets to evaluate its effectiveness in enhancing dark images without overexposure or artifacts. The results demonstrated the potential for real-world applications in photography, surveillance, and night vision enhancement.

Built with

PyTorch
TensorFlow
GANs
Image Processing

Key Features

πŸ“Έ

GAN-powered low-light enhancement

Learns to brighten dark images naturally

πŸŒ™

Real-world inference testing

Evaluated on actual low-light scenarios

⚑

No overexposure or artifacts

Preserves image details while improving brightness

Trained a GAN-based model for real-world low-light image enhancement.

Video thumbnail
πŸ‘•

Product Recommendation System and TryOn

Completed
November 2021 - December 2021

About the Project

Developed a product recommendation system utilizing sentiment analysis of user comments and implemented T-SNE with cosine similarity to find ingredient similarities. The project also features a virtual try-on system using BeautyGAN, allowing users to visualize how products would look on them. This project was developed as part of the Myntra Hackerramp competition.

Built with

Tensorflow
Python
Keras
BeautyGAN
T-SNE
πŸ”

Novel Data Augmentation for NER

Published
June 2021 - January 2022

About the Project

During my internship at Samsung Research, I published research in the International Journal of Speech Technology (SCI journal) focusing on improving ASR text analysis for named entity identification and categorization. The implementation achieved significant improvements in NER models, with Base BERT NER F1 score increasing from 94.73% to 95.37%, and RoBERTa NER from 94.13% to 95.14%.

Published

Journal:International Journal of Speech Technology (SCI)
Date:November 2023

Built with

BERT
RoBERTa
Python
NLP
ASR
Research Overview

Research Overview

NER Data Augmentation Architecture

Samsung Excellence Award

Samsung Excellence Award

Samsung Excellence Award

πŸš—

Safe Ride

Completed
February 2021 - April 2021

About the Project

This prototype aims to demonstrate a driver safety application that ideally should be integrated into vehicles themselves, providing enhanced control over braking and other critical functions. It was selected as one of the top 50 semifinalists in the Google Solution Challenge 2021. The application features drowsiness detection, traffic sign recognition using SSDMobilenet, and an accident detection algorithm that triggers automatic location-based alerts. Performance optimizations have reduced object detection time from 600ms to 96ms per frame, significantly enhancing real-time safety measures.

Built with

Tensorflow
Android Studio
Java
Python
SSD Mobilenet
Drowsiness Detection
Accident Detection

Key Features

πŸš—

Driver Safety Monitoring

Real-time safety monitoring

🚨

Accident Detection

Automated alerts

🚦

Traffic Sign Recognition

Traffic sign recognition using SSD Mobilenet

πŸ‘€

Drowsiness Detection

Drowsiness detection

Google Solution Challenge 2021 Certificate

Google Solution Challenge 2021 Certificate

Top 50 Semifinalist Certificate

πŸ”—

Entity Linking using BERT Similarity

Completed
November 2020

About the Project

This project focuses on enhancing Entity Linking by integrating BERT similarity with the existing ABACO method. Entity Linking is the process of mapping textual mentions to real-world entities, and this work aims to improve its accuracy using deep learning techniques. By incorporating BERT similarity alongside BM25 ranking, the project significantly improves entity disambiguation, making the linking process more robust. The approach leverages DBpedia, Wikipedia APIs, and custom graph-based methods to refine entity selection. This project was submitted as part of the CSE2003 Data Structures and Algorithms coursework at the School of Computer Science and Engineering, Nov 2020.

Built with

BERT
DBpedia
Python
BM25 Ranking
Graph-based Linking
Tensorflow
Keras

Key Features

πŸ”—

Entity Linking

Entity linking using BERT similarity

πŸ”ƒ

BERT Training

Fine-tuning BERT for entity disambiguation tasks

πŸ“ˆ

Centrality Score

Centrality score for subgraph ranking

πŸ”

Best Route

Find the fastest/shortest route for matching start node to end node for entity linking

Workflow

  1. 1

    Entity Mention Detection

  2. 2

    Candidate Generation

  3. 3

    BM25 Ranking

  4. 4

    Bidirectional Understanding with BERT

  5. 5

    Centrality Score

  6. 6

    Final Entity Linking

Entity Linking Architecture

Entity Linking Architecture

Complete overview of the Entity Linking system architecture

Candidate Generation

Candidate Generation

Process of generating candidate entities for mentions

BM25 Ranking Phase

BM25 Ranking Phase

Initial ranking of candidates using BM25 algorithm

Bidirectional Understanding with BERT

Bidirectional Understanding with BERT

How BERT processes context in both directions for better entity understanding

BERT Training Process

BERT Training Process

Fine-tuning BERT for entity disambiguation tasks

Degree of Centrality

Degree of Centrality

Graph-based approach for entity disambiguation using centrality metrics