buckaroo VS sketch

Compare buckaroo vs sketch and see what are their differences.

buckaroo

Buckaroo - the data wrangling assistant for pandas. Quickly explore dataframes, and run pandas commands via a GUI. Works inside the jupyter notebook. (by paddymul)
InfluxDB - Power Real-Time Data Analytics at Scale
Get real-time insights from all types of time series data with InfluxDB. Ingest, query, and analyze billions of data points in real-time with unbounded cardinality.
www.influxdata.com
featured
SaaSHub - Software Alternatives and Reviews
SaaSHub helps you find the best software and product alternatives
www.saashub.com
featured
buckaroo sketch
10 20
160 2,195
- 0.7%
8.9 4.4
25 days ago 3 months ago
Jupyter Notebook Python
BSD 3-clause "New" or "Revised" License MIT License
The number of mentions indicates the total number of mentions that we've tracked plus the number of user suggested alternatives.
Stars - the number of stars that a project has on GitHub. Growth - month over month growth in stars.
Activity is a relative number indicating how actively a project is being developed. Recent commits have higher weight than older ones.
For example, an activity of 9.0 indicates that a project is amongst the top 10% of the most actively developed projects that we are tracking.

buckaroo

Posts with mentions or reviews of buckaroo. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-04-28.
  • PySheets – Spreadsheet UI for Python
    3 projects | news.ycombinator.com | 28 Apr 2024
    I created buckaroo [1] as a better dataframe viewer for jupyter with built in summary stats. It's built to bring a better dataframe experience to people already using pandas/polars. All of it is extensible [2] so that you can customize stats and transformations to your workflow.

    [1] https://github.com/paddymul/buckaroo

  • The Design Philosophy of Great Tables (Software Package)
    7 projects | news.ycombinator.com | 4 Apr 2024
    Great tables has done some really nice work on python/jupyter tables. It looks like they are almost building a "grammar of tables" similar to a grammar of graphics. More projects should write about their philosophy and aims like this.

    I have built a different table library for jupyter called buckaroo. My approach has been different. Buckaroo aims to allow you to interactively cycle through different formats and post-processing functions to quickly glean important insights from a table while working interactively. I took the view that I type the same commands over and over to perform rudimentary exploratory data analysis, those commands and insights should be built into a table.

    Great tables seems built so that you can manually format a table for presentation.

    https://github.com/paddymul/buckaroo

    https://youtu.be/GPl6_9n31NE

  • Ask HN: Problems worth solving with a low-code back end?
    2 projects | news.ycombinator.com | 12 Mar 2024
    JLisp.

    3. It was very easy to define new lowcode commands, and have the frontend add them to the palette. Each command defines two methods "transform" which manipulates the dataframe, and "transform_to_py" which takes the same arguments but emits python code.

    Adoption of my library in general, and the low code UI specifically has been very limited. I'm in the middle of plumbing the lowcode support back in after a refactor of other parts.

    I would like to build a whole ecosystem around JLisp and Buckaroo. Specifically I have some "auto-cleaning" functionality that emits JLisp cleaning and normalization commands, these commands can then be editted in the UI (delete, edit parameters). It's easier to emit JLisp than raw python syntax, it's also much easier to make a UI to manipulate it.

    Do you have a repo to look at? What usecase did you have in mind when you were building it?

    If I were evaluating a low-code backend builder I'd be interested in the examples, and tests. Hopefully the tests would double as examples. For a Workflow type low-code-builder I'd be most interested in the cron functionality.

    [1] https://github.com/paddymul/buckaroo

    [2] http://norvig.com/lispy2.html

  • How to Write a (Lisp) Interpreter (In Python)
    18 projects | news.ycombinator.com | 11 Mar 2024
    I used Norvig’s lisp2.py to build a low code UI. I modified the interpreter to accept JSON flavored lisp, basically replace parens with brackets. The upside is that it was very very easy to make a react front end that manipulates JSON (JLisp). My thinking was, I need a serialization format for operations from the front end, and a way to interpret them. I could write my own language that no one has heard of, or use lisp, which few have used.

    https://github.com/paddymul/buckaroo/blob/main/buckaroo/jlis...

  • Show HN: The Buckaroo Data Table for Jupyter
    1 project | news.ycombinator.com | 26 Nov 2023
  • Buckaroo – the data wrangling assistant for Pandas
    1 project | news.ycombinator.com | 7 Nov 2023
  • Ask HN: Who wants to be hired? (October 2023)
    18 projects | news.ycombinator.com | 2 Oct 2023
    Location: Boston

    Remote: Yes

    Willing to relocate: Yes

    Technologies: talking to users, python, pandas/numpy, jupyter, js/ts

    Résumé/CV: https://www.linkedin.com/in/paddymullen/

    Email: [email protected]

    In my next role, I want a broad mandate to make a meaningful impact within an organization by developing products that address genuine business challenges, with a preference for data related problems.

    Recently I built the data table for Jupyter/Pandas that I have wanted for over a decade. The open source Buckaroo (https://github.com/paddymul/buckaroo) data table combines a performant table, summary statistics, and a low code UI to expedite common data analysis tasks.

  • Ask HN: Who wants to be hired? (September 2023)
    16 projects | news.ycombinator.com | 1 Sep 2023
    wanted for over a decade. The open source Buckaroo https://github.com/paddymul/buckaroo data table combines a performant table, summary statistics, and a low code UI to
  • Ask HN: Who wants to be hired? (June 2023)
    21 projects | news.ycombinator.com | 1 Jun 2023
  • Pandas AI – The Future of Data Analysis
    7 projects | news.ycombinator.com | 17 May 2023
    This morning I added a "Related Projects" [3] Section to the Buckaroo docs. If Buckaroo doesn't solve your problem, look at one of the other linked projects (like Mito).

    [1] https://github.com/approximatelabs/sketch

    [2] https://github.com/paddymul/buckaroo

    [3] https://buckaroo-data.readthedocs.io/en/latest/FAQ.html

sketch

Posts with mentions or reviews of sketch. We have used some of these posts to build our list of alternatives and similar projects. The last one was on 2024-02-05.
  • Ask HN: What have you built with LLMs?
    43 projects | news.ycombinator.com | 5 Feb 2024
    We've made a lot of data tooling things based on LLMs, and are in the process of rebranding and launching our main product.

    1. sketch (in notebook, ai for pandas) https://github.com/approximatelabs/sketch

    2. datadm (open source, "chat with data", with support for the open source LLMs (https://github.com/approximatelabs/datadm)

    3. Our main product: julyp. https://julyp.com/ (currently under very active rebrand and cleanup) -- but a "chat with data" style app, with a lot of specialized features. I'm also streaming me using it (and sometimes building it) every weekday on twitch to solve misc data problems (https://www.twitch.tv/bluecoconut)

    For your next question, about the stack and deploy:

  • Pandas AI – The Future of Data Analysis
    7 projects | news.ycombinator.com | 17 May 2023
    This morning I added a "Related Projects" [3] Section to the Buckaroo docs. If Buckaroo doesn't solve your problem, look at one of the other linked projects (like Mito).

    [1] https://github.com/approximatelabs/sketch

    [2] https://github.com/paddymul/buckaroo

    [3] https://buckaroo-data.readthedocs.io/en/latest/FAQ.html

  • Ask HN: What's your favorite GPT powered tool?
    16 projects | news.ycombinator.com | 15 May 2023
    For GPT/Copilot style help for pandas, in notebooks REPL flow (without needing to install plugins), I built sketch. I genuinely use it every-time I'm working on pandas dataframes for a quick one-off analysis. Just makes the iteration loop so much faster. (Specifically the `.sketch.howto`, anecdotally I actually don't use `.sketch.ask` anymore)

    https://github.com/approximatelabs/sketch

  • RasaGPT: First headless LLM chatbot built on top of Rasa, Langchain and FastAPI
    13 projects | news.ycombinator.com | 8 May 2023
    https://github.com/approximatelabs/lambdaprompt It has served all of my personal use-cases since making it, including powering `sketch` (copilot for pandas) https://github.com/approximatelabs/sketch

    Core things it does: Uses jinja templates, does sync and async, and most importantly treats LLM completion endpoints as "function calls", which you can compose and build structures around just with simple python. I also combined it with fastapi so you can just serve up any templates you want directly as rest endpoints. It also offers callback hooks so you can log & trace execution graphs.

    All together its only ~600 lines of python.

    I haven't had a chance to really push all the different examples out there, but most "complex behaviors", so there aren't many patterns to copy. But if you're comfortable in python, then I think it offers a pretty good interface.

    I hope to get back to it sometime in the next week to introduce local-mode (eg. all the open source smaller models are now available, I want to make those first-class)

  • [D] The best way to train an LLM on company data
    7 projects | /r/MachineLearning | 29 Mar 2023
    Please look at sketch and langchain pandas/SQL plugins. I have seen excellent results with both of these approaches. Both of these approaches will require you to send metadata to openAI.
  • Meet Sketch: An AI code Writing Assistant For Pandas
    1 project | /r/machinelearningnews | 1 Feb 2023
    👉 Understand your data through questions 👉 Create code from plain text Quick Read: https://www.marktechpost.com/2023/02/01/meet-sketch-an-ai-code-writing-assistant-for-pandas/ Github: https://github.com/approximatelabs/sketch
  • Replacing a SQL analyst with 26 recursive GPT prompts
    5 projects | news.ycombinator.com | 25 Jan 2023
    (3) Asking for re-writes of failed queries (happens occasionally) also helps

    The main challenge I think with a lot of these "look it works" tools for data applications, is how do you get an interface that actually will be easy to adopt. The chat-bot style shown here (discord and slack integration) I can see being really valuable, as I believe there has been some traction with these style integrations with data catalog systems recently. People like to ask data questions to other people in slack, adding a bot that tries to answer might short-circuit a lot of this!

    We built a prototype where we applied similar techniques to the pandas-code-writing part of the stack, trying to help keep data scientists / data analysts "in flow", integrating the code answers in notebooks (similar to how co-pilot puts suggestions in-line) -- and released https://github.com/approximatelabs/sketch a little while ago.

  • FLiP Stack Weekly for 21 Jan 2023
    19 projects | dev.to | 23 Jan 2023
    Python AI Helper https://github.com/approximatelabs/sketch
  • LangChain: Build AI apps with LLMs through composability
    8 projects | news.ycombinator.com | 17 Jan 2023
  • Show HN: Sketch – AI code-writing assistant that understands data content
    1 project | /r/patient_hackernews | 16 Jan 2023

What are some alternatives?

When comparing buckaroo and sketch you can also consider the following projects:

electron-orbitals - Hydrogen electron orbitals, and the software to render them.

RasaGPT - 💬 RasaGPT is the first headless LLM chatbot platform built on top of Rasa and Langchain. Built w/ Rasa, FastAPI, Langchain, LlamaIndex, SQLModel, pgvector, ngrok, telegram

resume

lmql - A language for constraint-guided and efficient LLM programming.

resume - My résumé.

gpt_index - LlamaIndex (GPT Index) is a project that provides a central interface to connect your LLM's with external data. [Moved to: https://github.com/jerryjliu/llama_index]

applin-rails-demo - Example of how to use applin-rails.

pandas-ai - Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.

resume - My latest resume

langchain - ⚡ Building applications with LLMs through composability ⚡ [Moved to: https://github.com/langchain-ai/langchain]

resume

rasa - 💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants