Project Analysis
image
FIFA 19 Player Value Prediction & Recruitment Strategy

We decided to analyze the players' data from the game FIFA 19. There is a lot of several players from different countries, playing in different competitions. Their abilities in the game should reflect their real-world skills. The game's creators are about its efforts by creating game attributes, such as sprint speed, force bullets, endings or headers that we can express numerically. In the project, we will try to classify players into a game room based on these attributes, and positions and predict their market value in the game. Since the game does not always show players the market value of the player, but only its attributes, our model can be useful in determining the amount offered for the transfer of the player when negotiating in the game. When in the real world the abilities of the players (e.g., ending) do not bear any numerical. We will try to use the fact that in the game of expression, we can find out which attributes are important for certain gaming positions.

image
Spotify

<h5>Usage</h5> <p>The dataset can be used for:</p> <ul> <li>Building a <strong>Recommendation System</strong> based on user input or preference.</li> <li>Classification purposes based on audio features and available genres.</li> <li>Any other application that you can think of. Feel free to discuss!</li> </ul> <h2>Column Description</h2> <ul> <li><strong>track_id:</strong> The Spotify ID for the track.</li> <li><strong>artists:</strong> The names of the artists who performed the track. If there is more than one artist, they are separated by a semicolon (;).</li> <li><strong>album_name:</strong> The album name in which the track appears.</li> <li><strong>track_name:</strong> The name of the track.</li> <li><strong>popularity:</strong> A value between 0 and 100 indicating the popularity of a track. It is calculated based on the number of plays and recency of plays.</li> <li><strong>duration_ms:</strong> The length of the track in milliseconds.</li> <li><strong>explicit:</strong> Whether the track has explicit lyrics (<em>true</em> = yes, <em>false</em> = no or unknown).</li> <li><strong>danceability:</strong> A measure (0.0 to 1.0) of how suitable a track is for dancing, based on musical elements like tempo and beat strength.</li> <li><strong>energy:</strong> A measure (0.0 to 1.0) of intensity and activity in a track. Higher values indicate fast, loud, and noisy tracks.</li> <li><strong>key:</strong> The musical key of the track, mapped to standard Pitch Class notation (e.g., 0 = C, 1 = C♯/D♭, 2 = D, etc.).</li> <li><strong>loudness:</strong> The overall loudness of a track in decibels (dB).</li> <li><strong>mode:</strong> Indicates the modality of a track (1 = Major, 0 = Minor).</li> <li><strong>speechiness:</strong> Detects the presence of spoken words in a track. Values above 0.66 indicate mostly speech-based tracks.</li> <li><strong>acousticness:</strong> A confidence measure (0.0 to 1.0) of whether the track is acoustic. Higher values indicate acoustic tracks.</li> <li><strong>instrumentalness:</strong> Predicts whether a track contains no vocals. Values closer to 1.0 indicate instrumental tracks.</li> <li><strong>liveness:</strong> Measures the presence of an audience in the recording. Values above 0.8 suggest a live performance.</li> <li><strong>valence:</strong> A measure (0.0 to 1.0) of the musical positiveness of a track. Higher values indicate happier and more cheerful tracks.</li> <li><strong>tempo:</strong> The estimated tempo of a track in beats per minute (BPM).</li> <li><strong>time_signature:</strong> An estimated time signature, ranging from 3 to 7 (e.g., 3/4 to 7/4).</li> <li><strong>track_genre:</strong> The genre of the track.</li> </ul>

image
HR Salary Prediction

The HR salary dataset from Kaggle is a valuable resource for analyzing employee compensation trends. It contains information on employee demographics, job-related details, and compensation figures. This data can be used to identify factors that influence salary, such as experience, education, and job title, and assess organisational pay equity.

image
Spam Detection Application

This project aims to develop an intelligent spam detection system using deep learning techniques. The objective is to accurately classify emails as either "spam" or "not spam" by building and deploying a robust model. The project involves several stages, starting with dataset acquisition, followed by data preprocessing, model training, and evaluation. The best-performing model and preprocessing pipeline are then saved and integrated into an interactive web application using Django and Streamlit.

image
AI-Enhanced Technical Indicator-Based Stock Trading Strategy

<p>Stock market forecasting often suffers from inconsistency due to noisy data, emotional trading, and the isolated use of indicators or sentiment. Traders lack a unified system that combines technical signals, sentiment analysis, and AI models to generate high-confidence trade decisions. Traditional methods based on single indicators (like MACD or RSI alone) often fail in volatile or uncertain markets. </p>

image
YOLOv8-Based Object Detection & Segmentation for Trash

Waste classification and plant disease identification are critical for sustainability and agriculture. Manual sorting is time-consuming, error-prone, and inconsistent. Automating these tasks requires accurate object detection and segmentation, even in noisy, real-world environments with class imbalance and complex backgrounds.

image
GAN Model for MNIST Digit Generation

<p class="text-secondary-light"> Generating realistic handwritten digit images requires a model capable of learning complex data distributions from limited training samples. Traditional image generation methods often lack diversity and tend to overfit the dataset. </p> <div class="mb-16 fw-bold">Project Challenge</div> <p class="text-secondary-light"> The key challenge is to train a GAN (Generative Adversarial Network) that can: </p> <ul class="text-secondary-light" style="list-style-type: disc; padding-left: 20px;"> <li>Generate sharp and visually diverse handwritten digits</li> <li>Maintain training stability throughout epochs</li> <li>Avoid common pitfalls such as mode collapse and vanishing gradients</li> </ul>

image
Stable Diffusion-Based Image Generation and Prompt Engineering

<div class="mb-16 fw-bold">🧩 Problem Statement</div> <p class="text-secondary-light"> Generating high-quality, realistic, and stylistically accurate images using text prompts is a complex task. Models like <strong>Stable Diffusion</strong> can produce widely varied outputs depending on multiple generation parameters. </p> <div class="fw-bold mt-3">Key Factors Influencing Output</div> <ul class="text-secondary-light" style="list-style-type: disc; padding-left: 20px;"> <li><strong>Prompt wording:</strong> Subtle changes can drastically affect composition and detail</li> <li><strong>Negative prompts:</strong> Used to suppress unwanted elements</li> <li><strong>Scheduler selection:</strong> Influences the image generation process (e.g., DDIM, Euler)</li> <li><strong>CFG Scale:</strong> Controls the strength of prompt adherence (higher = more literal)</li> <li><strong>Inference steps:</strong> Affects image quality, detail, and noise reduction</li> </ul> <p class="text-secondary-light"> Understanding how each parameter impacts the output is critical for <strong>controlling quality, style, and realism</strong> in AI art and computer vision applications. </p>

image
CIFAR-10 Image Classification with CNN & Transfer Learning

<div class="mb-16 fw-bold">🚨 Real-World Challenges in Image Classification</div> <ul class="text-secondary-light" style="list-style-type: disc; padding-left: 20px;"> <li><strong>Small input size:</strong> CIFAR-10 images are only 32×32, limiting detail and context</li> <li><strong>Overfitting:</strong> Models often memorize training data rather than generalize</li> <li><strong>Poor robustness:</strong> Struggle with varied lighting, backgrounds, blur, or webcam noise</li> <li><strong>Deployment hurdles:</strong> Real-time webcam predictions suffer from latency and performance issues, especially on low-resource hardware</li> </ul>

image
Recommender System for Amazon Beauty Products

E-commerce platforms like Amazon face the challenge of helping users navigate massive product catalogs. Without personalization, users experience choice overload and irrelevant suggestions. Many products lack complete metadata, and cold-start users/items limit collaborative approaches.

image
Personal Assistant Chatbot – Rule-Based + NLP Hybrid

Users often need a centralized virtual assistant that can handle small tasks like reminders, greetings, simple queries (date/time/weather), or custom commands. Existing solutions (like Alexa or Siri) are either voice-only or not customizable for personal needs, and most bots cannot be extended easily by developers.