flushing just means that those writes are not done sync (which is faster) so there is a backlog of them that still needs to be written out. for bulk data like this this is the best way to do it normally, I don't think the async-ness is the cause of your slowness here, rather the huge backlog is...