This project aims to develop a machine learning model to predict whether individuals can concentrate on their studies based on their social media usage. The goal is to identify key factors that contribute to concentration difficulties and provide insights that could help mitigate these issues.
The dataset used in this project was sourced from Kaggle. It contains demographic information and social media usage data of individuals, along with target variables indicating concentration difficulties.
Data Preprocessing: Cleaning, handling missing values, encoding categorical variables.
Feature Engineering: Selecting relevant features, creating new features.
Model Selection: Training various machine learning models (e.g., Logistic Regression, Decision Trees, Random Forest, SVM) and selecting the best-performing one.
Model Evaluation: Evaluating model performance using metrics such as accuracy, precision, recall, and F1 score.
Deployment: Deploying the model for real-world use.
The best-performing model achieved an accuracy of X% on the test set. Feature importance analysis revealed that time spent on social media and validation seeking behavior were the most influential factors affecting concentration difficulties.
git clone https://github.com/cepdnaclk/e19-co544-Concentration-Difficulty-Prediction-Model.git
cd social-media-concentration-prediction
python preprocess.py
python train_model.py
python evaluate_model.py