Skip to content

azizzoaib786/airbnb-dataset-prediction-multiple-algo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

2 Commits
Β 
Β 
Β 
Β 

Repository files navigation

🏑 Airbnb Rental Income Revenue Prediction

This project focuses on building predictive models to estimate the success of Airbnb property listings, leveraging Logistic Regression and Random Forest algorithms. The aim is to determine whether a given property listing will be successful based on various features extracted and engineered from the Airbnb dataset.

🎯 Objective

To accurately predict the success likelihood of Airbnb property listings by employing machine learning techniques and insightful data analysis.

🚩 Dataset

The dataset utilized for this project is extensive and includes numerous listings with detailed features. It can be downloaded from:

(Alternatively, a direct link to the processed dataset will soon be provided.)

πŸ” Project Workflow

The project follows these structured steps:

  1. Exploratory Data Analysis (EDA)

    • Data cleaning and initial analysis to understand distributions, trends, and relationships between variables.
  2. Data Visualization

    • Heatmaps to visualize correlations between various features.
  3. Manual Feature Selection

    • Utilizing domain knowledge to select relevant and impactful features.
  4. Handling Missing Values

    • General imputation methods to handle incomplete data.
  5. Advanced Imputation (KNN)

    • K-Nearest Neighbors algorithm used specifically for imputing critical missing values in review_scores_rating and reviews_per_month.
  6. Data Preparation for Modeling

    • Encoding categorical features and scaling numeric features to enhance model performance.
  7. Feature Engineering

    • Creating additional meaningful features derived from the dataset to boost predictive accuracy.
  8. Logistic Regression Model

    • Application of Logistic Regression to classify listings as successful or unsuccessful.
  9. Random Forest Model

    • Employing a Random Forest classifier for improved accuracy and handling complex feature interactions.
  10. Model Comparison

    • Evaluating and comparing both models to determine the best performing predictive approach.

πŸ› οΈ Tools and Libraries

  • Python
  • Pandas
  • NumPy
  • Matplotlib
  • Seaborn
  • Scikit-learn

βš™οΈ Usage

Clone the repository:

git clone https://github.com/azizzoaib786/airbnb-dataset-prediction-multiple-algo.git
cd airbnb-dataset-prediction-multiple-algo

Install dependencies:

pip install -r requirements.txt

Run the analysis:

jupyter notebook

🀝 Contributing

Contributions, issues, and feature requests are welcome! Feel free to check issues page.

πŸ“„ License

This project is licensed under the MIT License – see the LICENSE file for details.

πŸ“« Contact

For questions or suggestions, please reach out via:

About

Airbnb Rental Income Revenue Prediction Challenge Using Logistics Regression & Random Forest

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published