datafusion

SQL query engine for Apache Arrow data with distributed execution support.

brewmacoslinux
Try with needOr install directly
Source

About

Apache Arrow DataFusion and Ballista query engines

Commands

datafusion-cli

Examples

run SQL queries against CSV files$ datafusion-cli SELECT * FROM 'data.csv' WHERE age > 30;
query parquet files with SQL$ datafusion-cli SELECT COUNT(*) FROM 'large_dataset.parquet';
join and aggregate data from multiple sources$ datafusion-cli SELECT t1.id, COUNT(*) FROM 'table1.csv' t1 JOIN 'table2.parquet' t2 ON t1.id = t2.id GROUP BY t1.id;
load and explore data interactively$ datafusion-cli \help
execute SQL from a file$ datafusion-cli < query.sql