🚀 Launched SafeGuard AI: A BERT-powered app to detect online toxicity

kumardipanshu7777 · July 7, 2025, 3:40pm

Launched SafeGuard AI: A BERT-powered app to detect online toxicity

Hey everyone,

I’m excited to share a project I’ve been working on called SafeGuard AI. It’s a web app built with Streamlit that uses a fine-tuned BERT model to detect cyberbullying and toxic comments in real time. The goal is to create a simple tool that can help promote safer online spaces.

You can try the live demo here: https://safegaurd-ai.streamlit.app/

What it Does

You can enter any sentence or comment into the app, and it will classify it as either “Bullying” or “Not Bullying” with a confidence score. It’s designed to understand the context and nuance of language, not just keywords.

Tech Stack

UI: Streamlit
Machine Learning: PyTorch
Model Framework: Hugging Face Transformers
Data Handling: Pandas

How it Works & Performance

The core of the app is a bert-base-uncased model that was fine-tuned on a balanced dataset of over 115,000 text samples.

After training, the model achieved some really strong results on the test set:

Accuracy: 93%
Recall (for Bullying): 96% (meaning it successfully catches 96% of harmful comments)
F1-Score: 0.93

This high recall shows it’s very effective at its main job of identifying toxic content.

How You Can Recreate It

For anyone interested in how this was built, the whole process is documented in the GitHub repository. The basic steps were:

Data Preparation: Combined several public datasets and created a large, balanced .csv file.
Model Training: Fine-tuned the BERT model on the prepared data using PyTorch in a Google Colab notebook.
App Development: Created the app.py script using Streamlit to build the interface and load the saved model files.
Deployment: Pushed the entire project (app script, model files via Git LFS, and requirements) to GitHub and deployed it on Streamlit Cloud.

You can find the full code, the final model, and a detailed README here: GitHub - Dipanshu7777/bullying-detector: Cyberbullying Detection App using BERT NLP and Streamlit

I’m really passionate about using AI for social good and would love to hear any feedback or suggestions from the community. Thanks for checking it out!

Topic		Replies	Views
🚀 Introducing my YouTube Comment Sentiment Analysis app Show the Community!	0	297	April 7, 2025
Meet Playground - my new Streamlit App Show the Community! heroku , streamlit-cloud	6	2369	May 13, 2022
🔍 Fake News Detection App – Built with ML & NLP! Show the Community! streamlit-cloud	4	176	April 8, 2025
Emotion Detection Application Show the Community! deep-learning , computer-vision , streamlit-cloud	1	1263	May 13, 2022
Streamlit + Pycaret, An End-to-End Machine Learning Web Application Show the Community! cache , docker	2	2618	May 13, 2022

🚀 Launched SafeGuard AI: A BERT-powered app to detect online toxicity