do you have same performance problem, without "--enable-jemalloc" ?
we enable it mainly for ceph/librbd performance in qemu 2.4, just wonder if this new commit could change behaviour.
this bugzilla
https://bugzilla.redhat.com/show_bug.cgi?id=1251353
talk about jemalloc, tcmalloc before this...