linux-stable

History

Linus Torvalds ff887eb07c workqueue: Changes for v6.9 This cycle, a lot of workqueue changes including some that are significant and invasive. - During v6.6 cycle, unbound workqueues were updated so that they are more topology aware and flexible, which among other things improved workqueue behavior on modern multi-L3 CPUs. In the process, `636b927eba` ("workqueue: Make unbound workqueues to use per-cpu pool_workqueues") switched unbound workqueues to use per-CPU frontend pool_workqueues as a part of increasing front-back mapping flexibility. An unwelcome side effect of this change was that this made max concurrency enforcement per-CPU blowing up the maximum number of allowed concurrent executions. I incorrectly assumed that this wouldn't cause practical problems as most unbound workqueue users are self-regulate max concurrency; however, there definitely are which don't (e.g. on IO paths) and the drastic increase in the allowed max concurrency led to noticeable perf regressions in some use cases. This is now addressed by separating out max concurrency enforcement to a separate struct - wq_node_nr_active - which makes @max_active consistently mean system-wide max concurrency regardless of the number of CPUs or (finally) NUMA nodes. This is a rather invasive and, in places, a bit clunky; however, the clunkiness rises from the the inherent requirement to handle the disagreement between the execution locality domain and max concurrency enforcement domain on some modern machines. See `5797b1c189` ("workqueue: Implement system-wide nr_active enforcement for unbound workqueues") for more details. - BH workqueue support is added. They are similar to per-CPU workqueues but execute work items in the softirq context. This is expected to replace tasklet. However, currently, it's missing the ability to disable and enable work items which is needed to convert many tasklet users. To avoid crowding this merge window too much, this will be included in the next merge window. A separate pull request will be sent for the couple conversion patches that are currently pending. - Waiman plugged a long-standing hole in workqueue CPU isolation where ordered workqueues didn't follow wq_unbound_cpumask updates. Ordered workqueues now follow the same rules as other unbound workqueues. - More CPU isolation improvements: Juri fixed another deficit in workqueue isolation where unbound rescuers don't respect wq_unbound_cpumask. Leonardo fixed delayed_work timers firing on isolated CPUs. - Other misc changes. -----BEGIN PGP SIGNATURE----- iIQEABYKACwWIQTfIjM1kS57o3GsC/uxYfJx3gVYGQUCZe7JCQ4cdGpAa2VybmVs Lm9yZwAKCRCxYfJx3gVYGcnqAP9UP8zEM1la19cilhboDumxmRWyRpV/egFOqsMP Y5PuoAEAtsBJtQWtm5w46+y+fk3nK2ugXlQio2gH0qQcxX6SdgQ= =/ovv -----END PGP SIGNATURE----- Merge tag 'wq-for-6.9' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/wq Pull workqueue updates from Tejun Heo: "This cycle, a lot of workqueue changes including some that are significant and invasive. - During v6.6 cycle, unbound workqueues were updated so that they are more topology aware and flexible, which among other things improved workqueue behavior on modern multi-L3 CPUs. In the process, commit `636b927eba` ("workqueue: Make unbound workqueues to use per-cpu pool_workqueues") switched unbound workqueues to use per-CPU frontend pool_workqueues as a part of increasing front-back mapping flexibility. An unwelcome side effect of this change was that this made max concurrency enforcement per-CPU blowing up the maximum number of allowed concurrent executions. I incorrectly assumed that this wouldn't cause practical problems as most unbound workqueue users are self-regulate max concurrency; however, there definitely are which don't (e.g. on IO paths) and the drastic increase in the allowed max concurrency led to noticeable perf regressions in some use cases. This is now addressed by separating out max concurrency enforcement to a separate struct - wq_node_nr_active - which makes @max_active consistently mean system-wide max concurrency regardless of the number of CPUs or (finally) NUMA nodes. This is a rather invasive and, in places, a bit clunky; however, the clunkiness rises from the the inherent requirement to handle the disagreement between the execution locality domain and max concurrency enforcement domain on some modern machines. See commit `5797b1c189` ("workqueue: Implement system-wide nr_active enforcement for unbound workqueues") for more details. - BH workqueue support is added. They are similar to per-CPU workqueues but execute work items in the softirq context. This is expected to replace tasklet. However, currently, it's missing the ability to disable and enable work items which is needed to convert many tasklet users. To avoid crowding this merge window too much, this will be included in the next merge window. A separate pull request will be sent for the couple conversion patches that are currently pending. - Waiman plugged a long-standing hole in workqueue CPU isolation where ordered workqueues didn't follow wq_unbound_cpumask updates. Ordered workqueues now follow the same rules as other unbound workqueues. - More CPU isolation improvements: Juri fixed another deficit in workqueue isolation where unbound rescuers don't respect wq_unbound_cpumask. Leonardo fixed delayed_work timers firing on isolated CPUs. - Other misc changes" * tag 'wq-for-6.9' of git://git.kernel.org/pub/scm/linux/kernel/git/tj/wq: (54 commits) workqueue: Drain BH work items on hot-unplugged CPUs workqueue: Introduce from_work() helper for cleaner callback declarations workqueue: Control intensive warning threshold through cmdline workqueue: Make @flags handling consistent across set_work_data() and friends workqueue: Remove clear_work_data() workqueue: Factor out work_grab_pending() from __cancel_work_sync() workqueue: Clean up enum work_bits and related constants workqueue: Introduce work_cancel_flags workqueue: Use variable name irq_flags for saving local irq flags workqueue: Reorganize flush and cancel[_sync] functions workqueue: Rename __cancel_work_timer() to __cancel_timer_sync() workqueue: Use rcu_read_lock_any_held() instead of rcu_read_lock_held() workqueue: Cosmetic changes workqueue, irq_work: Build fix for !CONFIG_IRQ_WORK workqueue: Fix queue_work_on() with BH workqueues async: Use a dedicated unbound workqueue with raised min_active workqueue: Implement workqueue_set_min_active() workqueue: Fix kernel-doc comment of unplug_oldest_pwq() workqueue: Bind unbound workqueue rescuer to wq_unbound_cpumask kernel/workqueue: Let rescuers follow unbound wq cpumask changes ...		2024-03-11 12:50:42 -07:00
..
irq	Documentation: irqdomain: Fix typo of "at least once"	2022-08-18 11:11:52 -06:00
wrappers	docs: put atomic*.txt and memory-barriers.txt into the core-api book	2022-09-29 12:55:06 -06:00
asm-annotations.rst	docs: move x86 documentation into Documentation/arch/	2023-03-30 12:58:51 -06:00
assoc_array.rst	…
boot-time-mm.rst	…
cachetlb.rst	mm: remove ARCH_IMPLEMENTS_FLUSH_DCACHE_FOLIO	2023-08-24 16:20:19 -07:00
circular-buffers.rst	…
cpu_hotplug.rst	arch: Remove Itanium (IA-64) architecture	2023-09-11 08:13:17 +00:00
debug-objects.rst	…
debugging-via-ohci1394.rst	Documentation: Drop or replace remaining mentions of IA64	2023-09-11 08:13:18 +00:00
dma-api-howto.rst	docs: dma: update a reference to a moved document	2023-11-17 08:46:01 -07:00
dma-api.rst	docs: dma-api: Fix description of the sync_sg API	2023-11-17 08:52:13 -07:00
dma-attributes.rst	Reinstate some of "swiotlb: rework "fix info leak with DMA_FROM_DEVICE""	2022-03-28 11:37:05 -07:00
dma-isa-lpc.rst	…
entry.rst	…
errseq.rst	…
genalloc.rst	…
generic-radix-tree.rst	…
genericirq.rst	Docu: genericirq.rst: fix irq-example	2023-08-28 12:45:31 -06:00
gfp_mask-from-fs-io.rst	…
idr.rst	IDR: Note that the IDR API is deprecated	2022-07-10 21:17:30 -04:00
index.rst	docs: add more netlink docs (incl. spec docs)	2023-01-24 10:58:11 +01:00
kernel-api.rst	Documentation: core-api: Drop :export: for int_log.h	2023-07-25 17:40:25 +01:00
kobject.rst	…
kref.rst	…
librs.rst	…
local_ops.rst	timers: Update the documentation to reflect on the new timer_shutdown() API	2022-11-24 15:09:12 +01:00
maple_tree.rst	maple_tree: update the documentation of maple tree	2023-12-10 16:51:32 -08:00
memory-allocation.rst	mm/slab: document kfree() as allowed for kmem_cache_alloc() objects	2023-03-29 10:35:41 +02:00
memory-hotplug.rst	…
mm-api.rst	mm/slab, docs: switch mm-api docs generation from slab.c to slub.c	2023-12-05 11:11:34 +01:00
netlink.rst	doc/netlink: Update genetlink-legacy documentation	2023-08-27 17:17:09 -07:00
packing.rst	Documentation: core-api: packing: correct spelling	2023-02-15 21:40:54 -08:00
padata.rst	Documentation: core-api: padata: correct spelling	2023-02-16 16:58:01 -07:00
pin_user_pages.rst	Documentation/gpu: VM_BIND locking document	2023-11-29 20:54:43 +01:00
printk-basics.rst	…
printk-formats.rst	printk changes for 6.6	2023-09-04 13:20:19 -07:00
printk-index.rst	printk/index: Printk index feature documentation	2022-04-13 14:25:31 +02:00
protection-keys.rst	Documentation/protection-keys: Clean up documentation for User Space pkeys	2022-06-07 16:06:22 -07:00
rbtree.rst	…
refcount-vs-atomic.rst	…
symbol-namespaces.rst	doc: module: update file references	2022-07-01 14:50:01 -07:00
this_cpu_ops.rst	arch: Remove cmpxchg_double	2023-06-05 09:36:39 +02:00
timekeeping.rst	timekeeping: Introduce fast accessor to clock tai	2022-04-14 16:19:30 +02:00
tracepoint.rst	…
unaligned-memory-access.rst	…
watch_queue.rst	Documentation: move watch_queue to core-api	2022-04-22 09:47:25 -06:00
workqueue.rst	workqueue: Changes for v6.9	2024-03-11 12:50:42 -07:00
xarray.rst	…