Job/Internship Scam Detector

Project Description

This project is an ML application to predict whether a job or internship posting is likely to be a scam. It uses NLP techniques and classification models to analyze job details such as title, description, location, and requirements, and provides a confidence score for the prediction.

Features

User-friendly interface built with Streamlit for easy input of details.
Predicts scam likelihood using a pre-trained machine learning model.
Displays confidence scores for predictions.
Custom evaluation metric combining recall and precision for model performance.

Installation

Clone the repository:

git clone <repository-url>
cd Job-Internship-Fraud-Detection

Create and activate a virtual environment (recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install required packages:
```
pip install -r requirements.txt
```
Ensure the dataset fake_job_postings.csv is in the project directory.

Usage

Run the Streamlit app:
```
streamlit run app.py
```
Enter the job or internship details in the provided fields:
- Title
- Description
- Location
- Requirements (optional)
Click "Predict" to see whether the posting is a scam or not.

Model Training and Evaluation

The model is trained using a public dataset taken from Kaggle - Real / Fake Job Posting.
Text features from title, location, description, and requirements are combined and vectorized using TF-IDF.
Synthetic Minority Over-sampling Technique (SMOTE) is applied to handle class imbalance.
Multiple classifiers were evaluated including Random Forest, XGBoost, and Naive Bayes.
Hyperparameter tuning was performed using GridSearchCV.
A custom weighted recall-precision metric was used to select the best model.
The final model is saved as scam_prediction_model.pkl and loaded by the Streamlit app.

Final Classification Report

Metric	Precision	Recall	F1-Score	Support
0	0.99	0.95	0.97	4212
1	0.39	0.79	0.52	185
Accuracy			0.94	4397
Macro Avg	0.69	0.87	0.74	4397
Weighted Avg	0.96	0.94	0.95	4397

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
app.py		app.py
job-internship-scam.ipynb		job-internship-scam.ipynb
metrics.py		metrics.py
requirements.txt		requirements.txt
scam_prediction_model.pkl		scam_prediction_model.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Job/Internship Scam Detector

Project Description

Features

Installation

Usage

Model Training and Evaluation

Final Classification Report

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Job/Internship Scam Detector

Project Description

Features

Installation

Usage

Model Training and Evaluation

Final Classification Report

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages