Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality. Learn more →
Top 23 Python Privacy Projects
-
hosts
🔒 Consolidating and extending hosts files from several well-curated sources. Optionally pick extensions for porn, social media, and other categories.
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
-
adversarial-robustness-toolbox
Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning Security - Evasion, Poisoning, Extraction, Inference - Red and Blue Teams
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
-
FedML
FEDML - The unified and scalable ML library for large-scale distributed training, model serving, and federated learning. FEDML Launch, a cross-cloud scheduler, further enables running any AI jobs on any GPU cloud or on-premise cluster. Built on this library, FEDML Nexus AI (https://fedml.ai) is your generative AI platform at scale.
-
presidio
Context aware, pluggable and customizable data protection and de-identification SDK for text and images
-
Social-Amnesia
Forget the past. Social Amnesia makes sure your social media accounts only show your posts from recent history, not from "that phase" 5 years ago.
-
SaaSHub
SaaSHub - Software Alternatives and Reviews. SaaSHub helps you find the best software and product alternatives
Not by default but a blocklist can be found here https://github.com/StevenBlack/hosts
https://github.com/OpenMined/PySyft - Federated Learning data science
Incentives are much harder but smart contracts can handle the tech part.
Going this route eventually you quickly have "quantum AI app store" and your system of government is a 12GB download. Can't even say if it's a good idea compared to e.g. anarcho-primitivism.
Project mention: Tribler: An attack-resilient micro-economy for media | news.ycombinator.com | 2024-04-25I noticed that too:
https://github.com/Tribler/tribler/wiki/%22TrustChain%22-arc...
But not much else about it. Would be interested to read more. Using torrent seeding as a form of Proof-of-Work that rewards tokens is actually an interesting use case for cryptocurrency, and not as energy-hungry.
Project mention: [Experiment] The future of AI is open-source, and here is the plan | /r/samkoesnadi | 2023-06-05FedML https://github.com/FedML-AI/FedML might already provide a lot of tools to do the job
Perhaps de-identification before training could be helpful here.
Microsoft does seem active in this, e.g. https://microsoft.github.io/presidio/
Project mention: It Took Me a Decade to Find the Perfect Personal Website Stack – Ghost+Fathom | news.ycombinator.com | 2023-07-09+1 on shynet! I use it for my personal website and my blog, and it's been working great.
I got it up and running with Podman, so no need to install and run the Docker daemon. I also fixed SQLite support [1], so no need for an additional DB server.
I analyzed available open-source web analytics tools [2] and AFAIK there is simpler solution for web analytics that doesn't involve a third party.
[1] https://github.com/milesmcc/shynet/issues/208
[2] https://blog.fidelramos.net/software/privacy-respecting-self...
Project mention: Change in "Web Data" Autofill file under User Data\Default | /r/techsupport | 2023-08-19For who want the complete story, this is a recent issue with the Bleachbit cleaner in this github: https://github.com/bleachbit/bleachbit/issues/1518
Project mention: Is it worth upgrading the RAM/CPU on my micro optiplex 7050 to use for a home server? | /r/HomeServer | 2023-12-08
I've added the Ads & Tracking list and the AMP Hosts list from Developer Dan to the default list; any others you recommend I add? It's hard to tell if the ads coming through are a 'my blocklist isn't good enough' problem or a 'my pihole's not set up properly yet' problem.
Project mention: LongRoPE: Extending LLM Context Window Beyond 2M Tokens | news.ycombinator.com | 2024-02-22It's been possible to skip tokenization for a long time, my team and I did it here - https://github.com/capitalone/DataProfiler
For what it's worth, we actually were working with LSTMs with nearly a billion params back in 2016-2017 area. Transformers made it far more effective to train and execute, but ultimately LSTMs are able to achieve similar results, though slow & require more training data.
Project mention: Is there a filter or otherwise a way to block all google domains except YouTube? | /r/Adguard | 2023-05-12Yes
Project mention: Reddit Fulfilled My Data Copy Request - What's the best script to use this to nuke? | /r/privacy | 2023-07-11Some scripts like https://github.com/x89/Shreddit look promising, and I'm getting ready to pull the trigger on it just once I make sure my whitelist IDs are good. However, it's probably not thorough enough to hit all my content. My reddit data has over 68,000 comments.
Python Privacy related posts
-
Tribler: An attack-resilient micro-economy for media
-
LongRoPE: Extending LLM Context Window Beyond 2M Tokens
-
Browsers Are Weird
-
Which corporations in your opinion are the most evil for privacy, and the least evil for privacy?
-
Mozilla CEO received $6,9m salary in 2022, a $2m increase from 2021, meanwhile Firefox has lost 30m of its userbase since 2020.
-
Open Source Ad Blocker for Mac, Windows, and Linux
-
From email to phone number, a new OSINT approach
-
A note from our sponsor - InfluxDB
www.influxdata.com | 10 May 2024
Index
What are some of the best open-source Privacy projects in Python? This list will help you:
Project | Stars | |
---|---|---|
1 | hosts | 25,558 |
2 | macOS-Security-and-Privacy-Guide | 20,907 |
3 | ungoogled-chromium | 18,979 |
4 | PySyft | 9,273 |
5 | whoogle-search | 8,833 |
6 | tribler | 4,687 |
7 | adversarial-robustness-toolbox | 4,483 |
8 | FedML | 4,068 |
9 | ProxyBroker | 3,729 |
10 | presidio | 3,119 |
11 | Shynet | 2,815 |
12 | bleachbit | 2,720 |
13 | email2phonenumber | 1,950 |
14 | privacy | 1,874 |
15 | noisy | 1,622 |
16 | portainer-templates | 1,514 |
17 | hosts | 1,495 |
18 | DataProfiler | 1,363 |
19 | OpenWPM | 1,313 |
20 | tf-encrypted | 1,194 |
21 | no-google | 1,179 |
22 | Shreddit | 989 |
23 | Social-Amnesia | 798 |
Sponsored