Portfolio
This portfolio showcases my Data Science and Data Analysis projects from academic, self-learning, and hobby pursuits, along with my achievements and skills.
Achievements
- Champion of ATTACKER 2024 - Trading Algorithm Building Competition.
- Top 2 of The Data Analytics Competition 2024.
- Top 10 The HUB Innovation and Creativity Competition.
- Scientific Research on the topic "User Clustering on Social Networking Platforms and Applications in Business".
University Projects
Analyze social media user behavior to identify clusters and propose targeted strategies, improving engagement and optimizing business outcomes. Deploy
Tools Used: Python (pandas, numpy, scikit-learn, plotly, streamlit), HuggingFace.
Tools Used: Python (pandas, numpy, scikit-learn, plotly, streamlit), HuggingFace.
Using A/B testing, the analysis of the mobile puzzle game Cookie Cats shows that keeping the first gate at level 30 improves 7-day retention compared to moving it to level 40.
Tools Used: Python (pandas, matplotlib).
Tools Used: Python (pandas, matplotlib).
This project develops a chatbot that interprets natural language queries, generates SQL queries, and fetches results from a SQL database in an intuitive, user-friendly way. It leverages the Gemini-Pro model and integrates with the Streamlit GUI for an enhanced interactive experience. Chatbot
Tools Used: MySQL, LangChain, Streamlit.
Tools Used: MySQL, LangChain, Streamlit.
The project aims to predict the stock prices in Vietnam using the Prophet algorithm with data from the 'Vnstock' library and deploying it on Streamlit. Deploy
Tools Used: Excel, Python (pandas, plotly, vnstock, streamlit), Prophet.
Tools Used: Excel, Python (pandas, plotly, vnstock, streamlit), Prophet.
Data Analysis (Netflix Userbase Dataset)
The project aims to gather insights into user behavior and preferences, informing strategies for an improved Netflix experience, including content recommendation and pricing optimization.
Tools Used: Python (pandas, plotly, matplotlib, seaborn), Jupyter Notebook.
Tools Used: Python (pandas, plotly, matplotlib, seaborn), Jupyter Notebook.
Data Visualization (Bike Stores Dataset)
The Bike Stores project analyzes revenue growth trends over the years. Using Microsoft SQL Server for data queries and exports, Tableau is employed to create charts, offering insights into top customers and revenue per salesperson.
Dashboard
Tools Used: SQL Server, Excel, Tableau.
Tools Used: SQL Server, Excel, Tableau.
Micro Projects
University Course
- Data Mining : This repository contains code for a large exercise exploring Streamlit in data mining.
- Artificial Intelligence : This repository contains code for a large exercise that learns how to effectively implement PL-Resolution, DPLL, and WalkSAT in a specific programming language.
- Genetic Algorithm : In this file, I have implemented simple genetic algorithm that finds out the list of numbers which equal to any specified number when summed together.
- Linear Regression : In this file, I aim to solve linear regression using analytical method and also by implementing gradient descent.
Competitions
- The Data Analytics 2024: This repository contains code for a data analytics competition on the topic of building machine learning models to predict users' bank credit.
- Trading Algorithm - ATTACKER 2024: This repository contains the code for the trading algorithm building competition that my teammates and I did together.
- The HUB Innovation and Creativity: This repository contains the code for a startup idea contest (stock prediction) that my teammates and I did together.
Technical Skills
- Programming & Frameworks: Python (pandas, numpy, scikit-learn, pytorch), SQL (BigQuery), Vertex AI
- Data Analysis & Visualization: Data Analysis, A/B Testing, Firebase, Looker Studio/Tableau, Python (matplotlib/seaborn)
- Machine Learning Applications: Forecasting, Regression, Classification, Deep Learning (basic), NLP (basic)
- Other Tools: Git, Jupyter Notebook, Excel, ETL automation