Problem
We are using cerebro tables to perform analysis on our data lake. I am using python to access the data; however, my python script returns no data. I successfully generate the token and look at the dataset names but cannot retrieve the actual data.
I can look at the data in the UI and pull data using curl commands.
Answer
Please use another mechanism (hive, spark, ...) to retrieve the data.
1) pyOkera is not designed to be used with very large tables with large numbers of partitions. The Spark or presto clients are much better suited;
2) Prior to v 1.5.2, pyOkera does not support complex data types (such as arrays).
Comments
0 comments
Please sign in to leave a comment.