Kylo benchmark and optimisation on 4 node HDP cluster for ingesting 25GB of data spread over 1000 files