An advanced machine learning model for detecting phishing URLs with high accuracy
NeoGuardianAI is a sophisticated machine learning model designed to identify and flag potentially dangerous phishing URLs. With the rise in cyber threats, this tool serves as a digital guardian, helping users navigate the web safely.
The model analyzes various features of a URL to determine if it's legitimate or a phishing attempt, providing real-time protection against cyber threats.
The model was trained on the pirocheto/phishing-url dataset from Hugging Face, containing thousands of labeled URLs.
Extracted over 30 features from each URL, including length metrics, domain characteristics, special character counts, and suspicious patterns.
After evaluating multiple algorithms, XGBoost was selected for its superior performance in classification tasks and ability to handle complex feature relationships.
The model was trained with carefully tuned hyperparameters including max depth, learning rate, and regularization to prevent overfitting.
After rigorous testing and validation, the model was deployed to Hugging Face Hub for public access and integrated into a Gradio web interface.
When a URL is submitted, NeoGuardianAI performs a comprehensive analysis:
A gradient boosting framework that uses decision trees and gradient boosting to create a highly accurate prediction model.
XGBClassifier( max_depth=5, learning_rate=0.1, n_estimators=100, subsample=0.8, colsample_bytree=0.8, gamma=0.1, objective='binary:logistic', eval_metric='logloss' )
StandardScaler is used to normalize features, ensuring all inputs have similar scale for optimal model performance.
The model combines multiple decision trees, with each new tree correcting errors made by previous trees, resulting in high accuracy predictions.
96.31%
96.00%
96.66%
96.33%
NeoGuardianAI achieves exceptional performance across all key metrics, making it highly reliable for phishing URL detection:
The close values of precision and recall indicate the model is well-balanced, minimizing both false positives and false negatives.
The high F1 score (96.33%) demonstrates the model's effectiveness in real-world scenarios where both precision and recall are important.
NeoGuardianAI's performance exceeds many commercial phishing detection solutions, which typically achieve 85-90% accuracy.
The easiest way to use NeoGuardianAI is through the Hugging Face Spaces web interface:
For developers, NeoGuardianAI can be integrated into applications using the Hugging Face Inference API:
import requests
API_URL = "https://api-inference.huggingface.co/models/Devishetty100/neoguardianai"
headers = {"Authorization": "Bearer YOUR_API_TOKEN"}
def query(url):
payload = {"inputs": url}
response = requests.post(API_URL, headers=headers, json=payload)
return response.json()
result = query("https://example.com")
print(result)
Replace YOUR_API_TOKEN with your Hugging Face API token. The API returns a prediction and confidence score for the provided URL.