Portfolio

This portfolio showcases my Data Science and Data Analysis projects from academic, self-learning, and hobby pursuits, along with my achievements and skills.

Achievements

  • Champion of ATTACKER 2024 - Trading Algorithm Building Competition.
  • Top 2 of The Data Analytics Competition 2024.
  • Top 10 The HUB Innovation and Creativity Competition.
  • Scientific Research on the topic "User Clustering on Social Networking Platforms and Applications in Business".

Projects

User Clustering

Analyze social media user behavior to identify clusters and propose targeted strategies, improving engagement and optimizing business outcomes. Web App
Tools Used: Python (Pandas, Numpy, Scikit-learn, Plotly), Streamlit, HuggingFace.

Game A/B Testing

Using A/B testing, the analysis of the mobile puzzle game Cookie Cats shows that keeping the first gate at level 30 improves 7-day retention compared to moving it to level 40.
Tools Used: Python (Pandas, Matplotlib)

Recruitment Platform Django

The main purpose of the Django application is to assess the suitability between resumes and job descriptions.
Web App
Tools Used: Python, Django, Langchain, Javascript, Bootstrap, Render

MySQL Chatbot

This project develops a chatbot that interprets natural language queries, generates SQL queries, and fetches results from a SQL database in an intuitive, user-friendly way. It leverages the Gemini-Pro model and integrates with the Streamlit GUI for an enhanced interactive experience. Chatbot
Tools Used: MySQL, LangChain, Streamlit, Railway

Vietnamese Stock Prediction

The project aims to predict the stock prices in Vietnam using the Prophet algorithm with data from the 'Vnstock' library and deploying it on Streamlit. Web App
Tools Used: Excel, Python (Pandas, Plotly, Vnstock, Streamlit), Prophet

Data Analysis (Netflix Userbase Dataset)

The project aims to gather insights into user behavior and preferences, informing strategies for an improved Netflix experience, including content recommendation and pricing optimization.
Tools Used: Python (Pandas, Plotly, Matplotlib, Seaborn), Jupyter Notebook.

Data Visualization (Bike Stores Dataset)

The Bike Stores project analyzes revenue growth trends over the years. Using Microsoft SQL Server for data queries and exports, Tableau is employed to create charts, offering insights into top customers and revenue per salesperson. Dashboard
Tools Used: Microsoft SQL Server, Excel, Tableau.

Data Cleaning (FIFA21 Dataset)

In this project, I will focus on cleaning and preparing messy FIFA 21 data for analysis. The data often contains inconsistencies, missing values, and various formatting issues. By addressing these problems, the data will be ready for accurate and meaningful analysis.
Tools Used: Python (Pandas, Numpy), Jupyter Notebook.

Micro Projects

  • Statistics and Machine Learning

    • Genetic Algorithm : In this file, I have implemented simple genetic algorithm that finds out the list of numbers which equal to any specified number when summed together.
    • Linear Regression : In this file, I aim to solve linear regression using analytical method and also by implementing gradient descent.

  • University Course

    • Data Mining : This repository contains code for a large exercise exploring Streamlit in data mining.
    • Artificial Intelligence : This repository contains code for a large exercise that learns how to effectively implement PL-Resolution, DPLL, and WalkSAT in a specific programming language.

  • Competitions


Core Competencies

  • Languages: Python, SQL, R
  • Methodologies: Data Analysis, Machine Learning, Time Series, NLP, Statistics, Data Structures & Algorithms
  • Tools: BigQuery, MySQL, Microsoft SQL Server, Git, MS Excel, Tableau, LangChain
  • Other Tools: Adobe Photoshop, Adobe Premiere Pro