Skip to content

Instantly share code, notes, and snippets.

@Prussia
Forked from Antwnis/Row count of Parquet files
Created December 14, 2017 09:53
Show Gist options
  • Select an option

  • Save Prussia/f6cb4fdf0d44dc00a7d44db002dba4e0 to your computer and use it in GitHub Desktop.

Select an option

Save Prussia/f6cb4fdf0d44dc00a7d44db002dba4e0 to your computer and use it in GitHub Desktop.
for(Footer f : ParquetFileReader.readFooters(conf, fs, false)){
for(BlockMetaData b : f.getParquetMetadata().getBlocks()){
rowCount += b.getRowCount();
}
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment