batch processing - Take extremely long time in kettle pentaho job -
good day,
i have kettle pentaho file run batch job.
basically, files contain of 2 main steps, first step, read input file (txt file) , store inside table1. second step, same first step, read same input file , store inside table2.
this batch working fine until put in 20mb input file.it require more 7hours finish job.
below test case have done:
15360 records, 1.4mb, 2 minutes , 20 seconds (140 seconds total). 30720 records, 2.8mb , 7 minutes , 30 seconds (450 seconds total) 61440 records, 5.5mb, 26 minutes , 55 seconds (1615 seconds total). 250000 records, 20mb, 7 hours , 30 minutes in log, found there steps occupied of time consuming. follow: 1. text file input. 2. select values. 3. modified java script value. 
both main steps contain 3 kettle pentaho function. 20mb input file, first step take around 7 minutes, second step take more 7 hours.
try @ in quite long time, still cant find out problem.
kindly advise.
there might multiple reasons (i assume). first of all, try optimize steps "select values" , "modified javascript". of performance tuning tips given in here.
also may try increase java memory in pan.sh. check image below:

change javamaxmem somevalue higher 1024.
hope these changes might :)
Comments
Post a Comment