SQL query engine for Apache Arrow data with distributed execution support.
Apache Arrow DataFusion and Ballista query engines
datafusion-cli$ datafusion-cli
SELECT * FROM 'data.csv' WHERE age > 30;$ datafusion-cli
SELECT COUNT(*) FROM 'large_dataset.parquet';$ datafusion-cli
SELECT t1.id, COUNT(*) FROM 'table1.csv' t1 JOIN 'table2.parquet' t2 ON t1.id = t2.id GROUP BY t1.id;$ datafusion-cli
\help$ datafusion-cli < query.sql