File size: 1,338 Bytes
38c93e6
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
---
language: en
license: apache-2.0
tags:
- xgboost
- machine-learning
- classification
- cybersecurity
- phishing-detection
datasets:
- custom
metrics:
- accuracy
- precision
- recall
- f1
---

# XGBoost Phishing Detection Models

## Model Description

XGBoost models trained for phishing detection using URL and HTML content features.

This model is trained using XGBoost for binary classification tasks.

## Model Architecture

- **Model Type**: XGBoost Classifier
- **Framework**: XGBoost
- **Task**: Binary Classification

## Usage

```python
import joblib
from huggingface_hub import hf_hub_download

# Download the model
model_path = hf_hub_download(repo_id="th1enq/xgboost_checkpoint", filename="xgboost phishing detection models.joblib")

# Load the model
model = joblib.load(model_path)

# Make predictions
predictions = model.predict(X_test)
```

## Training

The model was trained using the XGBoost library with the following approach:
- Feature extraction from URLs/HTML content
- Binary classification (legitimate vs phishing)
- Cross-validation for model evaluation

## Files

- `xgboost phishing detection models.joblib`: The trained XGBoost model
- `features.py`: Feature extraction functions
- `URLFeatureExtraction.py`: URL-specific feature extraction

## License

This model is released under the Apache 2.0 License.