Friends don't let friends export to CSV

InfluxDB - Power Real-Time Data Analytics at Scale

Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.

www.influxdata.com

featured

SaaSHub - Software Alternatives and Reviews

SaaSHub helps you find the best software and product alternatives

www.saashub.com

featured

usv

16 185 9.1 Shell

Unicode Separated Values (USV) data markup for units, records, groups, files, streaming, and more.

I can't remember the last time I, or anyone I've ever worked with for that matter, ever typed up a CSV from scratch. The whole point of USV is that the delimiters can't normally be typed so you don't have to worry about escaping.
USV supports displayable delimiters (see https://github.com/SixArm/usv), so for the much more common case of editing an existing CSV in a text editor, you can just copy and paste.

Sep

2 639 8.8 C#

World's Fastest .NET CSV Parser. Modern, minimal, fast, zero allocation, reading and writing of separated values (`csv`, `tsv` etc.). Cross-platform, trimmable and AOT/NativeAOT compatible.

If you ever need to parse CSV really fast and happen to know C#, there is an incredible vectorized parser for that: https://github.com/nietras/Sep/

InfluxDB

www.influxdata.com featured

Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
csvy

1 55 - R

Import and Export CSV Data With a YAML Metadata Header

There is CSVY, which lets you set a delimiter, schema, column types, etc. and has libraries in many languages and is natively supported in R.
Also is backwards-compatible with most CSV parsers.
https://github.com/leeper/csvy

Fiona

3 1,129 8.5 Python

Fiona reads and writes geographic data files

Your issue is that you're using the default (old) binding to GDAL, based on Fiona [0].
You need to use pyogrio [1], its vectorized counterpart, instead. Make sure you use `engine="pyogrio"` when calling `to_file` [2]. Fiona does a loop in Python, while pyogrio is exclusively compiled. So pyogrio is usually about 10-15x faster than fiona. Soon, in pyogrio version 0.8, it will be another ~2-4x faster than pyogrio is now [3].
[0]: https://github.com/Toblerity/Fiona
[1]: https://github.com/geopandas/pyogrio
[2]: https://geopandas.org/en/stable/docs/reference/api/geopandas...
[3]: https://github.com/geopandas/pyogrio/pull/346

pyogrio

1 240 8.6 Python

Vectorized vector I/O using OGR

Your issue is that you're using the default (old) binding to GDAL, based on Fiona [0].
You need to use pyogrio [1], its vectorized counterpart, instead. Make sure you use `engine="pyogrio"` when calling `to_file` [2]. Fiona does a loop in Python, while pyogrio is exclusively compiled. So pyogrio is usually about 10-15x faster than fiona. Soon, in pyogrio version 0.8, it will be another ~2-4x faster than pyogrio is now [3].
[0]: https://github.com/Toblerity/Fiona
[1]: https://github.com/geopandas/pyogrio
[2]: https://geopandas.org/en/stable/docs/reference/api/geopandas...
[3]: https://github.com/geopandas/pyogrio/pull/346

geoparquet

3 725 5.5 Python

Specification for storing geospatial vector data (point, line, polygon) in Parquet

That's why I'm working on the GeoParquet spec [0]! It gives you both compression-by-default and super fast reads and writes! So it's usually as small as gzipped CSV, if not smaller, while being faster to read and write than GeoPackage.
Try using `GeoDataFrame.to_parquet` and `GeoPandas.read_parquet`
[0]: https://github.com/opengeospatial/geoparquet

NOTE: The number of mentions on this list indicates mentions on common posts plus user suggested alternatives. Hence, a higher number means a more popular project.

Suggest a related project

Eli Bendersky: Faster XML Stream Processing in Go

1 project | news.ycombinator.com | 7 May 2024
Data Science at the Command Line, 2nd Edition (2021)

5 projects | news.ycombinator.com | 6 May 2024
Show HN: TextQuery – Query and Visualize Your CSV Data in Minutes

3 projects | news.ycombinator.com | 2 Apr 2024
Revolutionizing Real-Time Alerts with AI, NATs and Streamlit

6 projects | dev.to | 18 Feb 2024
Show HN: Srgn, AST-aware text manipulation

1 project | news.ycombinator.com | 27 Jan 2024

Friends don't let friends export to CSV

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Gis Python C# cloud-native CSV
Post date: 25 Mar 2024

usv

Sep

InfluxDB

csvy

Fiona

pyogrio

geoparquet

Related posts

Eli Bendersky: Faster XML Stream Processing in Go

Data Science at the Command Line, 2nd Edition (2021)

Show HN: TextQuery – Query and Visualize Your CSV Data in Minutes

Revolutionizing Real-Time Alerts with AI, NATs and Streamlit

Show HN: Srgn, AST-aware text manipulation

Friends don't let friends export to CSV

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com Gis Python C# cloud-native CSV Post date: 25 Mar 2024

usv

Sep

InfluxDB

csvy

Fiona

pyogrio

geoparquet

Related posts

Eli Bendersky: Faster XML Stream Processing in Go

Data Science at the Command Line, 2nd Edition (2021)

Show HN: TextQuery – Query and Visualize Your CSV Data in Minutes

Revolutionizing Real-Time Alerts with AI, NATs and Streamlit

Show HN: Srgn, AST-aware text manipulation

This page summarizes the projects mentioned and recommended in the original post on news.ycombinator.com
Gis Python C# cloud-native CSV
Post date: 25 Mar 2024