Git Repo - linux.git/commit

slab: remove synchronous synchronize_sched() from memcg cache deactivation path

With kmem cgroup support enabled, kmem_caches can be created and
destroyed frequently and a great number of near empty kmem_caches can
accumulate if there are a lot of transient cgroups and the system is not
under memory pressure.  When memory reclaim starts under such
conditions, it can lead to consecutive deactivation and destruction of
many kmem_caches, easily hundreds of thousands on moderately large
systems, exposing scalability issues in the current slab management
code.  This is one of the patches to address the issue.

slub uses synchronize_sched() to deactivate a memcg cache.
synchronize_sched() is an expensive and slow operation and doesn't scale
when a huge number of caches are destroyed back-to-back.  While there
used to be a simple batching mechanism, the batching was too restricted
to be helpful.

This patch implements slab_deactivate_memcg_cache_rcu_sched() which slub
can use to schedule sched RCU callback instead of performing
synchronize_sched() synchronously while holding cgroup_mutex.  While
this adds online cpus, mems and slab_mutex operations, operating on
these locks back-to-back from the same kworker, which is what's gonna
happen when there are many to deactivate, isn't expensive at all and
this gets rid of the scalability problem completely.

Link: http://lkml.kernel.org/r/[email protected]
Signed-off-by: Tejun Heo <[email protected]>
Reported-by: Jay Vana <[email protected]>
Acked-by: Vladimir Davydov <[email protected]>
Cc: Christoph Lameter <[email protected]>
Cc: Pekka Enberg <[email protected]>
Cc: David Rientjes <[email protected]>
Cc: Joonsoo Kim <[email protected]>
Signed-off-by: Andrew Morton <[email protected]>
Signed-off-by: Linus Torvalds <[email protected]>

author	Tejun Heo <[email protected]>
	Wed, 22 Feb 2017 23:41:30 +0000 (15:41 -0800)
committer	Linus Torvalds <[email protected]>
	Thu, 23 Feb 2017 00:41:27 +0000 (16:41 -0800)
commit	01fb58bcba63f8fba37581c24c99e9a515dd0335
tree	475ebac1b656204783280c52acf315dfd3caea03	tree \| snapshot
parent	c9fc586403e7c85eee06b2d5dea14ce71c00fcd8	commit \| diff

include/linux/slab.h		diff \| blob \| blame \| history
mm/slab.h		diff \| blob \| blame \| history
mm/slab_common.c		diff \| blob \| blame \| history
mm/slub.c		diff \| blob \| blame \| history