Show HN: EmuBert – the first open encoder model for Australian law

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com

Scout Monitoring - Free Django app performance insights with Scout Monitoring
Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.
www.scoutapm.com
featured
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
  • emubert-creator

    The training code behind EmuBert, the largest open-source masked language model for Australian law.

  • ⦁ Text embedding.

    Not only that but, despite only being trained to guess missing words, EmuBert seems to know facts such as that Norfolk Island is an Australian territory (try the prompt, 'Norfolk Island is an Australian .'), it is Section 51 of the Constitution that grants Parliament the power to make laws for the peace, order, and good government of the Commonwealth ('Section of the Constitution grants the Australian Parliament the power to make laws for the peace, order, and good government of the Commonwealth.'), and that the representative of the monarch of Australia is the Governor-General ('The representative of the monarch of Australia is the -General.').

    Finally, EmuBert achieves a perplexity of 2.05 on the Open Australian Legal QA, the first open dataset of Australian legal questions and answers, outperforming all known state-of-the-art masked language models, including Roberta, Bert and Legal-Bert.

    You can check out EmuBert on Hugging Face here: https://huggingface.co/umarbutler/emubert

    The code I used to create EmuBert is also openly available on GitHub: https://github.com/umarbutler/emubert-creator

  • Scout Monitoring

    Free Django app performance insights with Scout Monitoring. Get Scout setup in minutes, and let us sweat the small stuff. A couple lines in settings.py is all you need to start monitoring your apps. Sign up for our free tier today.

    Scout Monitoring logo
NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Related posts

  • So, how do you get music for your self-hosted server?

    2 projects | news.ycombinator.com | 7 Jun 2024
  • LSP-AI: open-source language server serving as back end for AI code assistance

    7 projects | news.ycombinator.com | 8 Jun 2024
  • MT-Bench: Comparing different LLM Judges

    2 projects | dev.to | 8 Jun 2024
  • Gloe: Simplify Complex Pipelines with Type-Safe Transformers

    1 project | news.ycombinator.com | 8 Jun 2024
  • Flash-Linear-Attention

    1 project | news.ycombinator.com | 8 Jun 2024