Python Privacy

Open-source Python projects categorized as Privacy

Top 23 Python Privacy Projects

  • hosts

    🔒 Consolidating and extending hosts files from several well-curated sources. Optionally pick extensions for porn, social media, and other categories.

  • Project mention: Does PiHole block porn? | /r/pihole | 2023-12-06

    Not by default but a blocklist can be found here https://github.com/StevenBlack/hosts

  • macOS-Security-and-Privacy-Guide

    Guide to securing and improving privacy on macOS

  • Project mention: Hardening macOS | /r/MacOS | 2023-07-03
  • InfluxDB

    Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

    InfluxDB logo
  • ungoogled-chromium

    Google Chromium, sans integration with Google

  • Project mention: console.log(DOOM) | news.ycombinator.com | 2024-02-25
  • PySyft

    Perform data science on data that remains in someone else's server

  • Project mention: A Better Mastodon Client | news.ycombinator.com | 2023-12-21

    https://github.com/OpenMined/PySyft - Federated Learning data science

    Incentives are much harder but smart contracts can handle the tech part.

    Going this route eventually you quickly have "quantum AI app store" and your system of government is a 12GB download. Can't even say if it's a good idea compared to e.g. anarcho-primitivism.

    Project mention: So I deployed Whoogle on my NAS.... | /r/selfhosted | 2023-12-08
  • tribler

    Privacy enhanced BitTorrent client with P2P content discovery

  • Project mention: Tribler: An attack-resilient micro-economy for media | news.ycombinator.com | 2024-04-25

    I noticed that too:

    https://github.com/Tribler/tribler/wiki/%22TrustChain%22-arc...

    But not much else about it. Would be interested to read more. Using torrent seeding as a form of Proof-of-Work that rewards tokens is actually an interesting use case for cryptocurrency, and not as energy-hungry.

  • adversarial-robustness-toolbox

    Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
  • FedML

    FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, FEDML Nexus AI (https://fedml.ai) is your generative AI platform at scale.

  • Project mention: [Experiment] The future of AI is open-source, and here is the plan | /r/samkoesnadi | 2023-06-05

    FedML https://github.com/FedML-AI/FedML might already provide a lot of tools to do the job

  • ProxyBroker

    Proxy [Finder | Checker | Server]. HTTP(S) & SOCKS :performing_arts:

  • presidio

    Context aware, pluggable and customizable data protection and de-identification SDK for text and images

  • Project mention: You can't build a moat with AI | news.ycombinator.com | 2024-04-11

    Perhaps de-identification before training could be helpful here.

    Microsoft does seem active in this, e.g. https://microsoft.github.io/presidio/

  • Shynet

    Modern, privacy-friendly, and detailed web analytics that works without cookies or JS.

  • Project mention: It Took Me a Decade to Find the Perfect Personal Website Stack – Ghost+Fathom | news.ycombinator.com | 2023-07-09

    +1 on shynet! I use it for my personal website and my blog, and it's been working great.

    I got it up and running with Podman, so no need to install and run the Docker daemon. I also fixed SQLite support [1], so no need for an additional DB server.

    I analyzed available open-source web analytics tools [2] and AFAIK there is simpler solution for web analytics that doesn't involve a third party.

    [1] https://github.com/milesmcc/shynet/issues/208

    [2] https://blog.fidelramos.net/software/privacy-respecting-self...

  • bleachbit

    BleachBit system cleaner for Windows and Linux

  • Project mention: Change in "Web Data" Autofill file under User Data\Default | /r/techsupport | 2023-08-19

    For who want the complete story, this is a recent issue with the Bleachbit cleaner in this github: https://github.com/bleachbit/bleachbit/issues/1518

  • email2phonenumber

    A OSINT tool to obtain a target's phone number just by having his email address

  • Project mention: FLaNK Stack Weekly for 20 Nov 2023 | dev.to | 2023-11-20
  • privacy

    Library for training machine learning models with privacy for training data

  • noisy

    Simple random DNS, HTTP/S internet traffic noise generator

  • portainer-templates

    🚢 500+ 1-click Portainer app templates

  • Project mention: Is it worth upgrading the RAM/CPU on my micro optiplex 7050 to use for a home server? | /r/HomeServer | 2023-12-08
  • hosts

    Hostfile blocklist for ads and tracking, updated regularly (by lightswitch05)

  • Project mention: DNS server set to Pihole but no traffic | /r/pihole | 2023-06-24

    I've added the Ads & Tracking list and the AMP Hosts list from Developer Dan to the default list; any others you recommend I add? It's hard to tell if the ads coming through are a 'my blocklist isn't good enough' problem or a 'my pihole's not set up properly yet' problem.

  • DataProfiler

    What's in your data? Extract schema, statistics and entities from datasets

  • Project mention: LongRoPE: Extending LLM Context Window Beyond 2M Tokens | news.ycombinator.com | 2024-02-22

    It's been possible to skip tokenization for a long time, my team and I did it here - https://github.com/capitalone/DataProfiler

    For what it's worth, we actually were working with LSTMs with nearly a billion params back in 2016-2017 area. Transformers made it far more effective to train and execute, but ultimately LSTMs are able to achieve similar results, though slow & require more training data.

  • OpenWPM

    A web privacy measurement framework

  • tf-encrypted

    A Framework for Encrypted Machine Learning in TensorFlow

  • no-google

    Completely block Google and its services

  • Project mention: Is there a filter or otherwise a way to block all google domains except YouTube? | /r/Adguard | 2023-05-12

    Yes

  • Shreddit

    Remove your comment history on Reddit as deleting an account does not do so.

  • Project mention: Reddit Fulfilled My Data Copy Request - What's the best script to use this to nuke? | /r/privacy | 2023-07-11

    Some scripts like https://github.com/x89/Shreddit look promising, and I'm getting ready to pull the trigger on it just once I make sure my whitelist IDs are good. However, it's probably not thorough enough to hit all my content. My reddit data has over 68,000 comments.

  • Social-Amnesia

    Forget the past. Social Amnesia makes sure your social media accounts only show your posts from recent history, not from "that phase" 5 years ago.

  • SaaSHub

    SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives

    SaaSHub logo
NOTE: The open source projects on this list are ordered by number of github stars. The number of mentions indicates repo mentiontions in the last 12 Months or since we started tracking (Dec 2020).

Python Privacy related posts

  • Tribler: An attack-resilient micro-economy for media

    5 projects | news.ycombinator.com | 25 Apr 2024
  • LongRoPE: Extending LLM Context Window Beyond 2M Tokens

    1 project | news.ycombinator.com | 22 Feb 2024
  • Browsers Are Weird

    2 projects | news.ycombinator.com | 5 Feb 2024
  • Which corporations in your opinion are the most evil for privacy, and the least evil for privacy?

    1 project | /r/privacy | 9 Dec 2023
  • Mozilla CEO received $6,9m salary in 2022, a $2m increase from 2021, meanwhile Firefox has lost 30m of its userbase since 2020.

    1 project | /r/browsers | 6 Dec 2023
  • Open Source Ad Blocker for Mac, Windows, and Linux

    3 projects | news.ycombinator.com | 4 Dec 2023
  • From email to phone number, a new OSINT approach

    1 project | news.ycombinator.com | 16 Nov 2023
  • A note from our sponsor - InfluxDB
    www.influxdata.com | 10 May 2024
    Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →

Index

What are some of the best open-source Privacy projects in Python? This list will help you:

Project Stars
1 hosts 25,558
2 macOS-Security-and-Privacy-Guide 20,907
3 ungoogled-chromium 18,979
4 PySyft 9,273
5 whoogle-search 8,833
6 tribler 4,687
7 adversarial-robustness-toolbox 4,483
8 FedML 4,068
9 ProxyBroker 3,729
10 presidio 3,119
11 Shynet 2,815
12 bleachbit 2,720
13 email2phonenumber 1,950
14 privacy 1,874
15 noisy 1,622
16 portainer-templates 1,514
17 hosts 1,495
18 DataProfiler 1,363
19 OpenWPM 1,313
20 tf-encrypted 1,194
21 no-google 1,179
22 Shreddit 989
23 Social-Amnesia 798

Sponsored
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com