We want to load data from S3 into Teradata. There are two variations, which may or may not require different handling:
Data in S3 has been placed there independent of any previous Kylo operation
Data is in S3 due to an earlier ingest flow in Kylo
In either case, we want to:
Select bucket locations (with a way to select 'this bucket and all below it in the hierarchy')
Identify and/or select file format (CSV, AVRO, Parquet, etc)
At that point, there are standard Kylo operations that can take place (schema reading, validation, etc).