Toxicity in Tweets using a BERT model

This page summarizes the projects mentioned and recommended in the original post on dev.to

InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
  • ToLD-Br

    Toxic Language Detection in Social Media for Brazilian Portuguese: New Dataset and Multilingual Analysis

  • The dataset is based on ToLD-Br, which is a huge dataset of tweets (or is it Xeets now?) that contains some additional info such as a classification if the text contains homophobia, obscenity, insults, racism, misogyny and xenophobia. The dataset for the competition, however, is a simple toxicity column.

  • And that's it! If you want to check it out and train/test this model yourself, feel free to check the code in my GitHub repository!

  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • Show HN: Move data from any Vector DB to any other Vector DB

    1 project | news.ycombinator.com | 10 May 2024
  • BMF: Frame extraction acceleration- video similarity search with Pinecone

    3 projects | dev.to | 10 May 2024
  • Farspeak

    1 project | news.ycombinator.com | 10 May 2024
  • Data Science with GitHub Copilot

    1 project | news.ycombinator.com | 10 May 2024
  • LangChain: LLM App Evaluation

    1 project | dev.to | 10 May 2024