-
Data-Engineering-Workflow-DUCKDB
A Complete Data Engineering Workflow, Data Modelling and Advanced Analytics using Python, DuckDB
-
InfluxDB
Power Real-Time Data Analytics at Scale. Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
#load data from `parquet` file df =pd.read_parquet('https://github.com/DanielOX/Data-Engineering-Workflow-DUCKDB/raw/main/nyc_taxi_trip_data/2023/2023.parquet') # display dataframe display(df.head()) # meta info on dataframe - df.shape outputs (rows, columns). print(f'Dataset Contains: {df.shape[0]} rows, {df.shape[1]} columns')
NOTE:
The number of mentions on this list indicates mentions on common posts plus user suggested alternatives.
Hence, a higher number means a more popular project.