From patchwork Thu May 28 04:26:54 2026 Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit X-Patchwork-Submitter: "H.J. Lu" X-Patchwork-Id: 135854 Return-Path: X-Original-To: patchwork@sourceware.org Delivered-To: patchwork@sourceware.org Received: from vm01.sourceware.org (localhost [IPv6:::1]) by sourceware.org (Postfix) with ESMTP id 787ED4BAE7FE for ; Thu, 28 May 2026 04:28:14 +0000 (GMT) DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org 787ED4BAE7FE Authentication-Results: sourceware.org; dkim=pass (2048-bit key, unprotected) header.d=gmail.com header.i=@gmail.com header.a=rsa-sha256 header.s=20251104 header.b=hN7iaVrX X-Original-To: libc-alpha@sourceware.org Delivered-To: libc-alpha@sourceware.org Received: from mail-pj1-x1030.google.com (mail-pj1-x1030.google.com [IPv6:2607:f8b0:4864:20::1030]) by sourceware.org (Postfix) with ESMTPS id E86194BAE7ED for ; Thu, 28 May 2026 04:27:32 +0000 (GMT) DMARC-Filter: OpenDMARC Filter v1.4.2 sourceware.org E86194BAE7ED Authentication-Results: sourceware.org; dmarc=pass (p=none dis=none) header.from=gmail.com Authentication-Results: sourceware.org; spf=pass smtp.mailfrom=gmail.com ARC-Filter: OpenARC Filter v1.0.0 sourceware.org E86194BAE7ED Authentication-Results: sourceware.org; arc=pass smtp.remote-ip=2607:f8b0:4864:20::1030 ARC-Seal: i=2; a=rsa-sha256; d=sourceware.org; s=key; t=1779942453; cv=pass; b=tueboITD43xSJIWvyNKx+KkkH+LpEwGgmRsig+GfvYBaPNLnzg1rl8yxyEjv1Kg7OltfnICFrnynPSRWZGOsSLdhvUCYj5rm+ey5Oa0w1vf3XWEW4m7dJw2R/iSec9nsEXV0WOiw3KFgW0cVN4uwCH81KaKv7U+qBT4ouozkOVU= ARC-Message-Signature: i=2; a=rsa-sha256; d=sourceware.org; s=key; t=1779942453; c=relaxed/simple; bh=hqH3bow0J+X4Tt4AewTf4ccUssukk8zDJYratE7gYDQ=; h=DKIM-Signature:MIME-Version:From:Date:Message-ID:Subject:To; b=WfpiHufQ+SLZ/2X7oYynrphJ9qh43d3HdF8SX9xnBEZRH2K8FHqvasxFBQB3DcpZta7G7590jht3DDMc6CV2KtMkNEMpZPS5pVdr5HnoIsrX4lw0ZNH2hpyyWKFCRdyX9gvSNp0QBnyOyzS48SIzf1zYUW9o7EEOeapEQAw3MsY= ARC-Authentication-Results: i=2; sourceware.org; dkim=pass (2048-bit key, unprotected) header.d=gmail.com header.i=@gmail.com header.a=rsa-sha256 header.s=20251104 header.b=hN7iaVrX DKIM-Filter: OpenDKIM Filter v2.11.0 sourceware.org E86194BAE7ED Received: by mail-pj1-x1030.google.com with SMTP id 98e67ed59e1d1-36ac67f489aso3020380a91.0 for ; Wed, 27 May 2026 21:27:32 -0700 (PDT) ARC-Seal: i=1; a=rsa-sha256; t=1779942452; cv=none; d=google.com; s=arc-20240605; b=QX7JCuHJdY7zxI89z0VHuecE/gOX5LxBlOrFsT6Qs3vgGT7xQ6SZFiWfcPikjMW9ez edJAy+AYY8HaezxCu7mkvtRTMvTP1JEzoXgPcaoQk6ufxZscwNpG80REKA5PsJzpvXxp 2gPkx6/qe5S6BENqP+oooFpAgfpRayaDeaYC181ubLR177PqRLteKeBv6xos+oKwAdPv C/VStdP5KfGMhHn6QP4Gv3dvR2blae8XlUNvBV/Ca/FqOA9/FOFMxxPjL7o6ju+dT33V FAqTePHA4qkehLVg+hzZhJe3YtIMeAhahIi0kNmPod0Uts8h48eeGgzMrxzQV5lsne8G f7Cw== ARC-Message-Signature: i=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=arc-20240605; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :dkim-signature; bh=T9KpuSxCU8D8Q9DEkBEhRYPNodH3Va87nP36akVAL9w=; fh=Q6xDGpYyVVcqCAp5bMfFShKXn2jRyt11/a7LlkfFLvY=; b=Q5iWxwzSCIM1jARvyUQpTAict/DIZ8LkgU/3APZxoGa3Ab1CKzRvtmorgsKbxt38iO E8aN1Wzo4jog/TqeZokbdgQPgAa2Ivhvnk9sVkuPBtVdpE+73JGK9jE9qc+8IbmD5Cq+ bfW/UgTg0iTB5UPeAyLko/opd6pKuCHQBI9B5gZ89Jar/rH5rhJSr0ghHGDaX+X33uBl 8Uv8CVy1HFqQJqWPZgeqdBcOHo7V6xxFPY9eCM35OYGHrI+EL5TFLsqv3vKi5+Ny5a8A 2pyu3uZO38F3RnIn5fvxFwQIMviWtlK4RcKry0k0xhJ0Q3rW2BOiyi43peGUuxVblehS YqFA==; darn=sourceware.org ARC-Authentication-Results: i=1; mx.google.com; arc=none DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20251104; t=1779942452; x=1780547252; darn=sourceware.org; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :from:to:cc:subject:date:message-id:reply-to; bh=T9KpuSxCU8D8Q9DEkBEhRYPNodH3Va87nP36akVAL9w=; b=hN7iaVrXiSC3tQkFYPSgj0U3JECZHVKO7w28cZX1tZ+xlp+YoruToNAtzdVx22dE0z CK8/TKASZysoT9S8dmMikVeseM1lGZ3SiDaF9idABMQhRbOPdnbOL7O4J2x4HNz0B5Ie Xx6JYkR9ZOqlmGT/gpkEt9cPXwJM7XDGaWPoe2jPB51XQG5EPWhr+wi0jp127UmZbhyB 1SUCwDUYD16YC4x+iyQ8l+/C1GTRqZ/HYUd9vHyd1wkf2EVVwxwuWEVJzVFCqeKyHRF2 YRjrva+6zxdjPWu8UuwIeh3NqHL3hV6tI7qWyTXIvFMb1RcQXo/w6NqqpuHiNuhsQMC3 7xDw== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20251104; t=1779942452; x=1780547252; h=to:subject:message-id:date:from:in-reply-to:references:mime-version :x-gm-gg:x-gm-message-state:from:to:cc:subject:date:message-id :reply-to; bh=T9KpuSxCU8D8Q9DEkBEhRYPNodH3Va87nP36akVAL9w=; b=Ulykp6wYzSXspOFTHp/knIGjQso5IUQ7u6SMBh3L10wLh8joBkH+RRAkEgxHT39RHX P3S03fYzpRCb6iM2szDF531rQkPKGBRINfyothzz7Oc78p0brWrBy6+NGAreQgKZWOkT 9OqmsZhJfzKVJnmID2V/UaAbRzn+OtKEonmqoYl+wTF7yn7JJJHSAOSeLYEL4AMf6lGW n4f4wMBnEZsdnfMdbWR57ncWNIntCUaqpgdJQIQKCjWAOpxMX8+C9Z25t4Hmd82vCvsj /91vMNBBcDF2TPDLB4pSBNVO3zRiW5SZbroxA9ZVqao4yzlYj6xXfZVOmRP2ed4w0pHQ II8w== X-Gm-Message-State: AOJu0YxX7UvT7n5MF90mO7ccXZ826ZfZk2pZOMo/rm4cmIzshM+VHJ38 qPstccdNGmlDxR60u3v6vCXsN+kPuDbCxDSd4hFkrr2RiiFCaUOuQj1Un2gVs1Dxvk5IDP4YdT3 y8z1aErfc/Vi8XbRfvy+JilJrLeJGOEGoEhLiBfx6Ow== X-Gm-Gg: Acq92OHbMgFfczTywDJRXoG6wdAXh69M6aq3O7StvtqDoDWrOOojDmUgN71kpdoEFdb Joz/RW/LMsWIseIN+N6eAI6+cnheMVcIWDdIoCVyhor4v2CelvXXIbPRogA4U8J9EhM4YtKAHLt VBMQdIM/jupfeEPU5V6ol+MwJ9U574MtKlslrFkfMUpKTjlqUJR7bd4Gai1XPI3LQCvX4BpYoin rInT6fxklmcJbbPFqel3XV8mRdIICBXvbeXDRECVujBoH2JFpdku8Yhg4ib4hQF4xUDf945wexG h2jIqLt1YJi2GN7K3A== X-Received: by 2002:a17:90b:544b:b0:36b:9798:4f68 with SMTP id 98e67ed59e1d1-36b9c9717d1mr5596a91.9.1779942451770; Wed, 27 May 2026 21:27:31 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: "H.J. Lu" Date: Thu, 28 May 2026 12:26:54 +0800 X-Gm-Features: AVHnY4IkdzXMGsXT8qwDCbrwxmZC6oBU_nrU443ATndlNXxpfoDW8S6OryrcH7s Message-ID: Subject: [PATCH v11] elf: Support THP segment load with madvise enabled THP To: GNU C Library , DJ Delorie , Adhemerval Zanella , Wilco Dijkstra , "Carlos O'Donell" X-Spam-Status: No, score=-3009.5 required=5.0 tests=BAYES_00, DKIM_SIGNED, DKIM_VALID, DKIM_VALID_AU, DKIM_VALID_EF, FREEMAIL_FROM, GIT_PATCH_0, KAM_SHORT, RCVD_IN_DNSWL_NONE, SPF_HELO_NONE, SPF_PASS, TXREP, URIBL_BLOCKED shortcircuit=no autolearn=ham autolearn_force=no version=3.4.6 X-Spam-Checker-Version: SpamAssassin 3.4.6 (2021-04-09) on sourceware.org X-BeenThere: libc-alpha@sourceware.org X-Mailman-Version: 2.1.30 Precedence: list List-Id: Libc-alpha mailing list List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , Errors-To: libc-alpha-bounces~patchwork=sourceware.org@sourceware.org Changes in v11: 1. Build THP PDE tests with $(load-address-ldflag)=$(THP-PAGE-SIZE) to work around: https://sourceware.org/bugzilla/show_bug.cgi?id=34184 2. Update strace-tst-thp.sh to run static THP tests directly. From 75a068e17c8104beb1f5a982fa3330cc2b3ef06b Mon Sep 17 00:00:00 2001 From: "H.J. Lu" Date: Mon, 13 Apr 2026 08:23:05 +0800 Subject: [PATCH v11] elf: Support THP segment load with madvise enabled THP The current THP segment load approach works only when THP is enabled with always in the kernel. If THP is enabled with madvise in the kernel, to enable THP segment load in an application, madvise should be called with MADV_HUGEPAGE on all THP eligible PT_LOAD segments: 1. Define DL_MAP_DEFAULT_THP_PAGESIZE in hugepages.h and default it to 0. If DL_MAP_DEFAULT_THP_PAGESIZE is defined, assume kernel THP madvise mode. If kernel THP mode is always or never, there is an extra madvise call which has no impact. DL_MAP_DEFAULT_THP_PAGESIZE is defined for x86 and 64-bit loongarch. 2. Update _dl_map_segment_align to support madvise THP mode. This fixes BZ #34079. 3. Call _dl_executable_postprocess in rtld_setup_main_map for dynamic executables and in LIBC_START_MAIN for static executables, which calls madvise with MADV_HUGEPAGE on all THP eligible PT_LOAD segments in executable. This fixes BZ #34080 for both dynamic and static executables. 4. Call _dl_postprocess_loadcmd_extra in _dl_postprocess_loadcmd, which calls madvise with MADV_HUGEPAGE on all THP eligible PT_LOAD segments when loading an object after they have been mapped in. This fixes BZ #34080 for shared objects. 5. Set the maximum page alignment on THP tests to THP page size as the default maximum page alignment may be smaller than THP page size. 6. Add tests to verify that large executable PT_LOAD segments in executables are mapped at addresses aligned to THP page size when the kernel is configured to use THP in "always" mode or "madvise" mode by inspecting /proc/self/maps to check that the mapping address is aligned to THP page size reported by the kernel. Also verify that madvise is called with MADV_HUGEPAGE when the glibc tunable glibc.elf.thp=1 is used and madvise isn't called with MADV_HUGEPAGE when the glibc tunable glibc.elf.thp=0 is used. Skip these tests if THP page size cannot be determined or if THP is not enabled in "always" mode nor "madvise" mode. Quote WANG Rui : From benchmarking a clang build of the Linux kernel on x86_64 with your patch in THP madvise mode, I observed that iTLB misses were reduced, similar to what we see in THP always mode. NB: Some THP tests fail on arm due to limitations of arm32 kABI: https://sourceware.org/bugzilla/show_bug.cgi?id=34096 Signed-off-by: H.J. Lu --- csu/libc-start.c | 4 + elf/dl-load.h | 3 + elf/dl-support.c | 6 + elf/rtld.c | 17 +- sysdeps/generic/dl-exec-post.h | 34 ++++ sysdeps/generic/dl-load-post.h | 23 +++ sysdeps/generic/hugepages.h | 8 +- sysdeps/generic/ldsodefs.h | 12 ++ sysdeps/unix/sysv/linux/Makefile | 165 +++++++++++++++++- sysdeps/unix/sysv/linux/arm/Makefile | 7 + sysdeps/unix/sysv/linux/dl-exec-post.h | 126 +++++++++++++ sysdeps/unix/sysv/linux/dl-load-post.h | 32 ++++ .../unix/sysv/linux/dl-map-segment-align.c | 33 +--- .../unix/sysv/linux/dl-map-segment-align.h | 17 +- sysdeps/unix/sysv/linux/ldsodefs.h | 3 + sysdeps/unix/sysv/linux/loongarch/Makefile | 3 + .../{dl-map-segment-align.h => hugepages.h} | 4 +- sysdeps/unix/sysv/linux/strace-tst-thp.sh | 80 +++++++++ .../unix/sysv/linux/tst-thp-1-no-s-code-pde.c | 19 ++ .../sysv/linux/tst-thp-1-no-s-code-static.c | 19 ++ sysdeps/unix/sysv/linux/tst-thp-1-no-s-code.c | 19 ++ sysdeps/unix/sysv/linux/tst-thp-1-pde.c | 19 ++ sysdeps/unix/sysv/linux/tst-thp-1-static.c | 19 ++ sysdeps/unix/sysv/linux/tst-thp-1.c | 28 +++ sysdeps/unix/sysv/linux/tst-thp-align-check.h | 124 +++++++++++++ sysdeps/unix/sysv/linux/tst-thp-align.c | 123 +------------ sysdeps/unix/sysv/linux/x86/hugepages.h | 22 +++ 27 files changed, 805 insertions(+), 164 deletions(-) create mode 100644 sysdeps/generic/dl-exec-post.h create mode 100644 sysdeps/generic/dl-load-post.h create mode 100644 sysdeps/unix/sysv/linux/dl-exec-post.h create mode 100644 sysdeps/unix/sysv/linux/dl-load-post.h rename sysdeps/unix/sysv/linux/loongarch/lp64/{dl-map-segment-align.h => hugepages.h} (90%) create mode 100644 sysdeps/unix/sysv/linux/strace-tst-thp.sh create mode 100644 sysdeps/unix/sysv/linux/tst-thp-1-no-s-code-pde.c create mode 100644 sysdeps/unix/sysv/linux/tst-thp-1-no-s-code-static.c create mode 100644 sysdeps/unix/sysv/linux/tst-thp-1-no-s-code.c create mode 100644 sysdeps/unix/sysv/linux/tst-thp-1-pde.c create mode 100644 sysdeps/unix/sysv/linux/tst-thp-1-static.c create mode 100644 sysdeps/unix/sysv/linux/tst-thp-1.c create mode 100644 sysdeps/unix/sysv/linux/tst-thp-align-check.h create mode 100644 sysdeps/unix/sysv/linux/x86/hugepages.h diff --git a/csu/libc-start.c b/csu/libc-start.c index 03d770ef15..bb106b524a 100644 --- a/csu/libc-start.c +++ b/csu/libc-start.c @@ -205,6 +205,7 @@ call_fini (void *unused) #endif /* !SHARED */ #include +#include STATIC int LIBC_START_MAIN (int (*main) (int, char **, char ** MAIN_AUXVEC_DECL), @@ -300,6 +301,9 @@ LIBC_START_MAIN (int (*main) (int, char **, char ** MAIN_AUXVEC_DECL), __pointer_chk_guard_local = pointer_chk_guard; # endif + struct link_map *main_map = _dl_get_dl_main_map (); + _dl_executable_postprocess (main_map, GL(dl_phdr), GL(dl_phnum)); + /* Now that the TCB, canary, and pointer guard are in place, run the deferred IFUNC relocations. For non-PIE static binaries this is ARCH_SETUP_IREL (apply_irel); for static-pie it is the IRELATIVE diff --git a/elf/dl-load.h b/elf/dl-load.h index 80ae5db4b3..84b82e183d 100644 --- a/elf/dl-load.h +++ b/elf/dl-load.h @@ -112,6 +112,7 @@ struct loadcmd int prot; /* PROT_* bits. */ }; +#include /* Iterator for program header segments. Initialize with _dl_pt_load_iterator_init, then either walk PT_LOAD segments via @@ -215,6 +216,8 @@ _dl_postprocess_loadcmd (struct link_map *l, const ElfW(Ehdr) *header, /* Found the program header in this segment. */ l->l_phdr = (void *) (uintptr_t) (c->mapstart + header->e_phoff - c->mapoff); + + _dl_postprocess_loadcmd_extra (l, c); } diff --git a/elf/dl-support.c b/elf/dl-support.c index 0508d6113b..a8114de003 100644 --- a/elf/dl-support.c +++ b/elf/dl-support.c @@ -179,6 +179,12 @@ int _dl_stack_cache_lock; #endif struct dl_scope_free_list *_dl_scope_free_list; +#ifdef HAVE_THP +int _dl_elf_thp_control = -1; +enum thp_mode_t _dl_thp_mode; +size_t _dl_elf_thp_pagesize; +#endif + #ifdef NEED_DL_SYSINFO /* Needed for improved syscall handling on at least x86/Linux. NB: Don't initialize it here to avoid RELATIVE relocation in static PIE. */ diff --git a/elf/rtld.c b/elf/rtld.c index 12e1b4dd71..e1ec899756 100644 --- a/elf/rtld.c +++ b/elf/rtld.c @@ -52,6 +52,7 @@ #include #include #include +#include #include @@ -323,6 +324,9 @@ struct rtld_global _rtld_global = /* Generally the default presumption without further information is an * executable stack but this is not true for all platforms. */ ._dl_stack_prot_flags = DEFAULT_STACK_PROT_PERMS, +#ifdef HAVE_THP + ._dl_elf_thp_control = -1, +#endif #ifdef _LIBC_REENTRANT ._dl_load_lock = _RTLD_LOCK_RECURSIVE_INITIALIZER, ._dl_load_write_lock = _RTLD_LOCK_RECURSIVE_INITIALIZER, @@ -1209,14 +1213,8 @@ rtld_setup_main_map (struct link_map *main_map) main_map->l_relro_size = ph->p_memsz; break; } - /* Process program headers again, but scan them backwards since - PT_GNU_PROPERTY is close to the end of program headers. */ - for (const ElfW(Phdr) *ph = &phdr[phnum]; ph != phdr; --ph) - if (ph[-1].p_type == PT_GNU_PROPERTY) - { - _dl_process_pt_gnu_property (main_map, -1, &ph[-1]); - break; - } + + _dl_executable_postprocess (main_map, phdr, phnum); /* Adjust the address of the TLS initialization image in case the executable is actually an ET_DYN object. */ @@ -1589,6 +1587,9 @@ dl_main (const ElfW(Phdr) *phdr, { RTLD_TIMING_VAR (start); rtld_timer_start (&start); +#ifdef HAVE_THP + _dl_get_thp_config (); +#endif _dl_map_object (NULL, rtld_progname, lt_executable, 0, __RTLD_OPENEXEC, LM_ID_BASE); rtld_timer_stop (&load_time, start); diff --git a/sysdeps/generic/dl-exec-post.h b/sysdeps/generic/dl-exec-post.h new file mode 100644 index 0000000000..f5dcdc093a --- /dev/null +++ b/sysdeps/generic/dl-exec-post.h @@ -0,0 +1,34 @@ +/* _dl_executable_postprocess. Generic version. + Copyright (C) 2026 Free Software Foundation, Inc. + Copyright The GNU Toolchain Authors. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +static inline void +_dl_executable_postprocess (struct link_map *main_map, + const ElfW(Phdr) *phdr, ElfW(Word) phnum) +{ +#ifdef SHARED + /* Process program headers again, but scan them backwards since + PT_GNU_PROPERTY is close to the end of program headers. */ + for (const ElfW(Phdr) *ph = &phdr[phnum]; ph != phdr; --ph) + if (ph[-1].p_type == PT_GNU_PROPERTY) + { + _dl_process_pt_gnu_property (main_map, -1, &ph[-1]); + break; + } +#endif +} diff --git a/sysdeps/generic/dl-load-post.h b/sysdeps/generic/dl-load-post.h new file mode 100644 index 0000000000..1a6d205e33 --- /dev/null +++ b/sysdeps/generic/dl-load-post.h @@ -0,0 +1,23 @@ +/* _dl_postprocess_loadcmd_extra. Generic version. + Copyright (C) 2026 Free Software Foundation, Inc. + Copyright The GNU Toolchain Authors. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +static inline void +_dl_postprocess_loadcmd_extra (struct link_map *l, const struct loadcmd *c) +{ +} diff --git a/sysdeps/generic/hugepages.h b/sysdeps/generic/hugepages.h index 5fc9b5c8de..f7f4957e79 100644 --- a/sysdeps/generic/hugepages.h +++ b/sysdeps/generic/hugepages.h @@ -26,10 +26,10 @@ unsigned long int __get_thp_size (void) attribute_hidden; enum thp_mode_t { + thp_mode_not_supported = 0, thp_mode_always, thp_mode_madvise, - thp_mode_never, - thp_mode_not_supported + thp_mode_never }; enum thp_mode_t __get_thp_mode (void) attribute_hidden; @@ -45,6 +45,10 @@ void __get_hugepage_config (size_t requested, size_t *pagesize, int *flags) # define MALLOC_DEFAULT_THP_PAGESIZE 0 #endif +#ifndef DL_MAP_DEFAULT_THP_PAGESIZE +# define DL_MAP_DEFAULT_THP_PAGESIZE 0 +#endif + #ifndef MAX_THP_PAGESIZE # define MAX_THP_PAGESIZE (32 * 1024 * 1024) #endif diff --git a/sysdeps/generic/ldsodefs.h b/sysdeps/generic/ldsodefs.h index 24529db8a1..ee58e2778a 100644 --- a/sysdeps/generic/ldsodefs.h +++ b/sysdeps/generic/ldsodefs.h @@ -39,6 +39,7 @@ #include #include #include +#include __BEGIN_DECLS @@ -477,6 +478,17 @@ struct rtld_global EXTERN struct __pthread **_dl_pthread_threads; __mach_rwlock_define (EXTERN, _dl_pthread_threads_lock) #endif +#ifdef HAVE_THP + /* The THP segment load control: + > 0: Enabled by GLIBC_TUNABLES=glibc.elf.thp=1. + 0: Disabled by GLIBC_TUNABLES=glibc.elf.thp=0. + < 0: To be enabled or disabled by GLIBC_TUNABLES. */ + EXTERN int _dl_elf_thp_control; + /* The kernel THP mode. */ + EXTERN enum thp_mode_t _dl_thp_mode; + /* Page size used for THP segment load. */ + EXTERN size_t _dl_elf_thp_pagesize; +#endif #ifdef SHARED }; # define __rtld_global_attribute__ diff --git a/sysdeps/unix/sysv/linux/Makefile b/sysdeps/unix/sysv/linux/Makefile index 63e7046cb3..d825fe853a 100644 --- a/sysdeps/unix/sysv/linux/Makefile +++ b/sysdeps/unix/sysv/linux/Makefile @@ -698,11 +698,15 @@ $(objpfx)pldd: $(objpfx)xmalloc.o tests += \ tst-rseq-tls-range \ tst-rseq-tls-range-4096 \ + tst-thp-1 \ + tst-thp-1-pde \ + tst-thp-1-static \ tst-thp-align \ # tests tests-static += \ tst-rseq-tls-range-4096-static \ tst-rseq-tls-range-static \ + tst-thp-1-static \ # tests-static modules-names += \ tst-rseq-tls-range-mod \ @@ -712,15 +716,12 @@ CFLAGS-tst-rseq-tls-range.c += -DMAIN_TLS_ALIGN=4 CFLAGS-tst-rseq-tls-range-4096.c += -DMAIN_TLS_ALIGN=4096 CFLAGS-tst-rseq-tls-range-static.c += -DMAIN_TLS_ALIGN=4 CFLAGS-tst-rseq-tls-range-4096-static.c += -DMAIN_TLS_ALIGN=4096 -LDFLAGS-tst-thp-size-mod.so += -Wl,-z,noseparate-code $(objpfx)tst-rseq-tls-range.out: $(objpfx)tst-rseq-tls-range-mod.so $(objpfx)tst-rseq-tls-range-4096.out: $(objpfx)tst-rseq-tls-range-mod.so $(objpfx)tst-rseq-tls-range-static.out: $(objpfx)tst-rseq-tls-range-mod.so $(objpfx)tst-rseq-tls-range-4096-static.out: $(objpfx)tst-rseq-tls-range-mod.so -$(objpfx)tst-thp-align.out: $(objpfx)tst-thp-size-mod.so tst-rseq-tls-range-static-ENV = LD_LIBRARY_PATH=$(objpfx):$(common-objpfx) tst-rseq-tls-range-4096-static-ENV = LD_LIBRARY_PATH=$(objpfx):$(common-objpfx) -tst-thp-align-ENV = GLIBC_TUNABLES=glibc.elf.thp=1 test-internal-extras += tst-nolink-libc ifeq ($(run-built-tests),yes) @@ -729,6 +730,164 @@ tests-special += \ $(objpfx)tst-nolink-libc-2.out \ # tests-special endif + +ifndef THP-PAGE-SIZE +# Align PT_LOAD segments in THP tests to THP page size so that kernel will +# map PIE to the address aligned to THP page size. Default THP page size +# to 2MB which can be overridden in Makefile in subdirectories. +THP-PAGE-SIZE = 0x200000 +endif + +THP-PAGE-SIZE-LDFLAGS = -Wl,-z,max-page-size=$(THP-PAGE-SIZE) + +# -Wl,-z,max-page-size=$(THP-PAGE-SIZE) alone doesn't work for PDE when +# text-segment address is lower than the maximum page size: +# https://sourceware.org/bugzilla/show_bug.cgi?id=34184 +ifneq (,$(load-address-ldflag)) +LOAD-THP-ADDRESS-LDFLAGS = $(load-address-ldflag)=$(THP-PAGE-SIZE) +endif + +LDFLAGS-tst-thp-size-mod.so = -Wl,-z,noseparate-code \ + $(THP-PAGE-SIZE-LDFLAGS) +tst-thp-align-ENV = GLIBC_TUNABLES=glibc.elf.thp=1 +$(objpfx)tst-thp-align.out: $(objpfx)tst-thp-size-mod.so + +tests += \ + tst-thp-1-no-s-code \ + tst-thp-1-no-s-code-pde \ + tst-thp-1-no-s-code-static \ +# tests +tests-static += \ + tst-thp-1-no-s-code-static \ +# tests-static + +LDFLAGS-tst-thp-1 = -Wl,-z,separate-code $(THP-PAGE-SIZE-LDFLAGS) +LDFLAGS-tst-thp-1-pde = -Wl,-z,separate-code $(THP-PAGE-SIZE-LDFLAGS) \ + $(LOAD-THP-ADDRESS-LDFLAGS) +LDFLAGS-tst-thp-1-static = -Wl,-z,separate-code $(THP-PAGE-SIZE-LDFLAGS) +ifneq (yes,$(enable-static-pie)) +LDFLAGS-tst-thp-1-static += $(LOAD-THP-ADDRESS-LDFLAGS) +endif +LDFLAGS-tst-thp-1-no-s-code = -Wl,-z,noseparate-code \ + $(THP-PAGE-SIZE-LDFLAGS) +LDFLAGS-tst-thp-1-no-s-code-pde = -Wl,-z,noseparate-code \ + $(THP-PAGE-SIZE-LDFLAGS) \ + $(LOAD-THP-ADDRESS-LDFLAGS) +LDFLAGS-tst-thp-1-no-s-code-static = -Wl,-z,noseparate-code \ + $(THP-PAGE-SIZE-LDFLAGS) +ifneq (yes,$(enable-static-pie)) +LDFLAGS-tst-thp-1-no-s-code-static += $(LOAD-THP-ADDRESS-LDFLAGS) +endif + +$(objpfx)tst-thp-1-no-s-code: $(objpfx)tst-thp-size-mod.o +$(objpfx)tst-thp-1-no-s-code-pde: $(objpfx)tst-thp-size-mod.o +$(objpfx)tst-thp-1-no-s-code-static: $(objpfx)tst-thp-size-mod.o + +tst-thp-1-no-s-code-ENV = GLIBC_TUNABLES=glibc.elf.thp=1 +tst-thp-1-no-s-code-pde-ENV = GLIBC_TUNABLES=glibc.elf.thp=1 +tst-thp-1-no-s-code-static-ENV = GLIBC_TUNABLES=glibc.elf.thp=1 + +tst-thp-1-no-s-code-pde-no-pie = yes + +tst-thp-1-ENV = GLIBC_TUNABLES=glibc.elf.thp=1 +tst-thp-1-pde-ENV = GLIBC_TUNABLES=glibc.elf.thp=1 +tst-thp-1-static-ENV = GLIBC_TUNABLES=glibc.elf.thp=1 + +$(objpfx)tst-thp-1: $(objpfx)tst-thp-size-mod.o +$(objpfx)tst-thp-1-pde: $(objpfx)tst-thp-size-mod.o +$(objpfx)tst-thp-1-static: $(objpfx)tst-thp-size-mod.o + +tst-thp-1-pde-no-pie = yes + +# Don't run strace tests for cross-compiling. +ifeq (no,$(cross-compiling)) +thp-kernel-status = $(shell grep madvise /sys/kernel/mm/transparent_hugepage/enabled) +# Verify that madvise is called with MADV_HUGEPAGE when THP is enabled +# under madvise THP kernel. +ifneq ($(findstring [madvise],$(thp-kernel-status)),) +tests-special += \ + $(objpfx)strace-tst-thp-1-disabled.out \ + $(objpfx)strace-tst-thp-1-enabled.out \ + $(objpfx)strace-tst-thp-1-pde-disabled.out \ + $(objpfx)strace-tst-thp-1-pde-enabled.out \ + $(objpfx)strace-tst-thp-1-static-disabled.out \ + $(objpfx)strace-tst-thp-1-static-enabled.out \ + $(objpfx)strace-tst-thp-align-default.out \ + $(objpfx)strace-tst-thp-align-disabled.out \ + $(objpfx)strace-tst-thp-align-enabled.out \ +# tests-special + +$(objpfx)strace-tst-thp-1-enabled.out: \ + $(..)sysdeps/unix/sysv/linux/strace-tst-thp.sh $(objpfx)ld.so \ + $(objpfx)tst-thp-1 + $(SHELL) $< $(objpfx)ld.so '$(test-wrapper-env)' \ + '$(run-program-env) GLIBC_TUNABLES=glibc.elf.thp=1' \ + '$(rpath-link)' $(objpfx)tst-thp-1 > $@; \ + $(evaluate-test) + +$(objpfx)strace-tst-thp-1-disabled.out: \ + $(..)sysdeps/unix/sysv/linux/strace-tst-thp.sh $(objpfx)ld.so \ + $(objpfx)tst-thp-1 + $(SHELL) $< $(objpfx)ld.so '$(test-wrapper-env)' \ + '$(run-program-env) GLIBC_TUNABLES=glibc.elf.thp=0' \ + '$(rpath-link)' $(objpfx)tst-thp-1 > $@; \ + $(evaluate-test) + +$(objpfx)strace-tst-thp-1-pde-enabled.out: \ + $(..)sysdeps/unix/sysv/linux/strace-tst-thp.sh $(objpfx)ld.so \ + $(objpfx)tst-thp-1-pde + $(SHELL) $< $(objpfx)ld.so '$(test-wrapper-env)' \ + '$(run-program-env) GLIBC_TUNABLES=glibc.elf.thp=1' \ + '$(rpath-link)' $(objpfx)tst-thp-1-pde > $@; \ + $(evaluate-test) + +$(objpfx)strace-tst-thp-1-pde-disabled.out: \ + $(..)sysdeps/unix/sysv/linux/strace-tst-thp.sh $(objpfx)ld.so \ + $(objpfx)tst-thp-1-pde + $(SHELL) $< $(objpfx)ld.so '$(test-wrapper-env)' \ + '$(run-program-env) GLIBC_TUNABLES=glibc.elf.thp=0' \ + '$(rpath-link)' $(objpfx)tst-thp-1-pde > $@; \ + $(evaluate-test) + +$(objpfx)strace-tst-thp-1-static-enabled.out: \ + $(..)sysdeps/unix/sysv/linux/strace-tst-thp.sh \ + $(objpfx)tst-thp-1-static + $(SHELL) $< $(objpfx)tst-thp-1-static '$(test-wrapper-env)' \ + '$(run-program-env) GLIBC_TUNABLES=glibc.elf.thp=1' > $@; \ + $(evaluate-test) + +$(objpfx)strace-tst-thp-1-static-disabled.out: \ + $(..)sysdeps/unix/sysv/linux/strace-tst-thp.sh \ + $(objpfx)tst-thp-1-static + $(SHELL) $< $(objpfx)tst-thp-1-static '$(test-wrapper-env)' \ + '$(run-program-env) GLIBC_TUNABLES=glibc.elf.thp=0' > $@; \ + $(evaluate-test) + +$(objpfx)strace-tst-thp-align-default.out: \ + $(..)sysdeps/unix/sysv/linux/strace-tst-thp.sh $(objpfx)ld.so \ + $(objpfx)tst-thp-align + $(SHELL) $< $(objpfx)ld.so '$(test-wrapper-env)' \ + '$(run-program-env)' \ + '$(rpath-link)' $(objpfx)tst-thp-align > $@; \ + $(evaluate-test) + +$(objpfx)strace-tst-thp-align-enabled.out: \ + $(..)sysdeps/unix/sysv/linux/strace-tst-thp.sh $(objpfx)ld.so \ + $(objpfx)tst-thp-align + $(SHELL) $< $(objpfx)ld.so '$(test-wrapper-env)' \ + '$(run-program-env) GLIBC_TUNABLES=glibc.elf.thp=1' \ + '$(rpath-link)' $(objpfx)tst-thp-align > $@; \ + $(evaluate-test) + +$(objpfx)strace-tst-thp-align-disabled.out: \ + $(..)sysdeps/unix/sysv/linux/strace-tst-thp.sh $(objpfx)ld.so \ + $(objpfx)tst-thp-align + $(SHELL) $< $(objpfx)ld.so '$(test-wrapper-env)' \ + '$(run-program-env) GLIBC_TUNABLES=glibc.elf.thp=0' \ + '$(rpath-link)' $(objpfx)tst-thp-align > $@; \ + $(evaluate-test) +endif # [madvise] +endif # $(cross-compiling) endif # $(subdir) == elf ifeq ($(subdir),rt) diff --git a/sysdeps/unix/sysv/linux/arm/Makefile b/sysdeps/unix/sysv/linux/arm/Makefile index e73ce4f811..1ee8bec9b9 100644 --- a/sysdeps/unix/sysv/linux/arm/Makefile +++ b/sysdeps/unix/sysv/linux/arm/Makefile @@ -3,6 +3,13 @@ sysdep-rtld-routines += aeabi_read_tp libc-do-syscall # The test uses INTERNAL_SYSCALL_CALL. In thumb mode, this uses # an undefined reference to __libc_do_syscall. CFLAGS-tst-nolink-libc.c += -marm + +# These tests fail on arm due to limitations of arm32 kABI: +# https://sourceware.org/bugzilla/show_bug.cgi?id=34096 +test-xfail-tst-thp-1-no-s-code-pde = yes +test-xfail-tst-thp-1-no-s-code-static = yes +test-xfail-tst-thp-1-pde = yes +test-xfail-tst-thp-1-static = yes endif ifeq ($(subdir),misc) diff --git a/sysdeps/unix/sysv/linux/dl-exec-post.h b/sysdeps/unix/sysv/linux/dl-exec-post.h new file mode 100644 index 0000000000..69505df402 --- /dev/null +++ b/sysdeps/unix/sysv/linux/dl-exec-post.h @@ -0,0 +1,126 @@ +/* _dl_executable_postprocess. Linux version. + Copyright (C) 2026 Free Software Foundation, Inc. + Copyright The GNU Toolchain Authors. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +static inline void +_dl_get_thp_config (void) +{ + /* Check if there is GLIBC_TUNABLES=glibc.elf.thp=[0|1]. */ + GL(dl_elf_thp_control) = TUNABLE_GET_FULL (glibc, elf, thp, int32_t, + NULL); + + /* Return if the tunable is not set or THP is disabled by the + tunable. */ + if (GL(dl_elf_thp_control) == 0) + return; + + _Static_assert (DL_MAP_DEFAULT_THP_PAGESIZE <= MAX_THP_PAGESIZE, + "DL_MAP_DEFAULT_THP_PAGESIZE <= MAX_THP_PAGESIZE"); + + /* NB: Accessing /sys/kernel/mm files is quite expensive and the file + may not be accessible in containers. If DL_MAP_DEFAULT_THP_PAGESIZE + is non-zero, assume THP mode is madvise and always call madvise. + Since madvise is a fast system call, it adds only a small overhead + compared to the cost of accessing /sys/kernel/mm files. */ + if (DL_MAP_DEFAULT_THP_PAGESIZE != 0) + { + GL(dl_elf_thp_pagesize) = DL_MAP_DEFAULT_THP_PAGESIZE; + GL(dl_thp_mode) = thp_mode_madvise; + } + else + { + GL(dl_thp_mode) = __get_thp_mode (); + if (GL(dl_thp_mode) == thp_mode_always + || GL(dl_thp_mode) == thp_mode_madvise) + { + GL(dl_elf_thp_pagesize) = __get_thp_size (); + /* We cap the huge page size at MAX_THP_PAGESIZE to avoid + over-aligning on systems with very large normal pages + (like 64K pages with 512M huge pages). */ + if (GL(dl_elf_thp_pagesize) > MAX_THP_PAGESIZE) + GL(dl_elf_thp_pagesize) = 0; + } + else + GL(dl_elf_thp_pagesize) = 0; + + if (GL(dl_elf_thp_pagesize) == 0) + { + GL(dl_elf_thp_control) = 0; + GL(dl_thp_mode) = thp_mode_not_supported; + } + } +} + +static inline void +_dl_executable_postprocess (struct link_map *main_map, + const ElfW(Phdr) *phdr, ElfW(Word) phnum) +{ + /* NB: In static executable, PT_GNU_PROPERTY is processed in target + libc-start.h if it is needed by target. When ld.so is used, if + a target doesn't need PT_GNU_PROPERTY, _dl_process_pt_gnu_property + is an empty function. */ +#ifdef SHARED + /* Process program headers again, but scan them backwards since + PT_GNU_PROPERTY is close to the end of program headers. */ + for (const ElfW(Phdr) *ph = &phdr[phnum]; ph != phdr; --ph) + if (ph[-1].p_type == PT_GNU_PROPERTY) + { + _dl_process_pt_gnu_property (main_map, -1, &ph[-1]); + break; + } +#endif + + /* If THP state was not yet initialized, the main executable was mapped + by the kernel; in that case this function is the only place that can + apply MADV_HUGEPAGE to the main executable's segments. Otherwise, + _dl_get_thp_config has already run earlier in dl_main and + _dl_map_segments has just mapped the main executable, so + _dl_postprocess_loadcmd_extra has already done the madvise pass; do + not repeat it here. */ + if (GL(dl_elf_thp_control) != -1) + return; + + _dl_get_thp_config (); + + /* Return if THP segment load isn't enabled. */ + if (GL(dl_elf_thp_control) <= 0) + return; + + /* NB: If DL_MAP_DEFAULT_THP_PAGESIZE is non-zero, dl_thp_mode is set + to thp_mode_madvise. */ + if (DL_MAP_DEFAULT_THP_PAGESIZE == 0 + && GL(dl_thp_mode) != thp_mode_madvise) + return; + + /* When we get here, the main executable have been mapped in. Call + madvise with MADV_HUGEPAGE for all THP eligible PT_LOAD segments. */ + + const ElfW(Phdr) *ph; + + size_t thp_pagesize = GL(dl_elf_thp_pagesize); + + /* Call __madvise if offset and address of the PT_LOAD segment are + aligned to THP page size and it is read-only. */ + for (ph = phdr; ph < &phdr[phnum]; ++ph) + if (ph->p_type == PT_LOAD + && ph->p_memsz >= thp_pagesize + && ((ph->p_vaddr | ph->p_offset) & (thp_pagesize - 1)) == 0 + && (ph->p_flags & (PF_W | PF_R)) == PF_R) + __madvise ((void *) (main_map->l_addr + ph->p_vaddr), + ph->p_memsz, MADV_HUGEPAGE); +} diff --git a/sysdeps/unix/sysv/linux/dl-load-post.h b/sysdeps/unix/sysv/linux/dl-load-post.h new file mode 100644 index 0000000000..88467764d4 --- /dev/null +++ b/sysdeps/unix/sysv/linux/dl-load-post.h @@ -0,0 +1,32 @@ +/* _dl_postprocess_loadcmd_extra. Linux version. + Copyright (C) 2026 Free Software Foundation, Inc. + Copyright The GNU Toolchain Authors. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +static bool _dl_segment_thp_eligible (const struct loadcmd *, size_t); + +/* After L has been mapped in, call madvise with MADV_HUGEPAGE for THP + madvise mode if L is THP eligible. */ + +static inline void +_dl_postprocess_loadcmd_extra (struct link_map *l, const struct loadcmd *c) +{ + if (GL(dl_thp_mode) == thp_mode_madvise + && _dl_segment_thp_eligible (c, GL(dl_elf_thp_pagesize))) + __madvise ((void *) (l->l_addr + c->mapstart), + c->mapend - c->mapstart, MADV_HUGEPAGE); +} diff --git a/sysdeps/unix/sysv/linux/dl-map-segment-align.c b/sysdeps/unix/sysv/linux/dl-map-segment-align.c index a39e74d91b..1260b26c22 100644 --- a/sysdeps/unix/sysv/linux/dl-map-segment-align.c +++ b/sysdeps/unix/sysv/linux/dl-map-segment-align.c @@ -17,38 +17,23 @@ License along with the GNU C Library; if not, see . */ +#include #include -#include -#include + +/* Return the alignment of the PT_LOAD segment for THP. P_ALIGN_MAX is + the maximum p_align value in the PT_LOAD segment. */ ElfW (Addr) _dl_map_segment_align (const struct loadcmd *c, ElfW (Addr) p_align_max) { - static enum thp_mode_t thp_mode = thp_mode_not_supported; - static unsigned long int thp_pagesize; + size_t thp_pagesize = GL(dl_elf_thp_pagesize); - if (TUNABLE_GET (glibc, elf, thp, int32_t, NULL) == 0) + if (GL(dl_elf_thp_control) <= 0 || p_align_max >= thp_pagesize) return p_align_max; - if (__glibc_unlikely (thp_mode == thp_mode_not_supported - || thp_pagesize == 0)) - { - unsigned long int default_thp_pagesize = DL_MAP_DEFAULT_THP_PAGESIZE; - thp_mode = default_thp_pagesize ? thp_mode_always : __get_thp_mode (); - thp_pagesize = default_thp_pagesize ? : __get_thp_size (); - } - - /* Aligning load segments that are large enough to the PMD size helps - improve THP eligibility and reduces TLB pressure. - We cap the huge page size at MAX_THP_PAGESIZE to avoid over-aligning - on systems with very large normal pages (like 64K pages with 512M - huge pages). */ - if (thp_mode == thp_mode_always - && thp_pagesize <= MAX_THP_PAGESIZE - && ((c->mapstart | c->mapoff) & (thp_pagesize - 1)) == 0 - && (c->mapend - c->mapstart) >= thp_pagesize - && p_align_max < thp_pagesize - && (c->prot & PROT_WRITE) == 0) + /* Return true if the segment is THP eligible. It helps improve THP + eligibility and reduces TLB pressure. */ + if (_dl_segment_thp_eligible (c, thp_pagesize)) return thp_pagesize; return p_align_max; diff --git a/sysdeps/unix/sysv/linux/dl-map-segment-align.h b/sysdeps/unix/sysv/linux/dl-map-segment-align.h index d9b05181b7..b904e128d8 100644 --- a/sysdeps/unix/sysv/linux/dl-map-segment-align.h +++ b/sysdeps/unix/sysv/linux/dl-map-segment-align.h @@ -19,9 +19,18 @@ #include -#ifndef DL_MAP_DEFAULT_THP_PAGESIZE -# define DL_MAP_DEFAULT_THP_PAGESIZE 0 -#endif - extern ElfW (Addr) _dl_map_segment_align (const struct loadcmd *, ElfW (Addr)) attribute_hidden; + +/* Return true only if the loadcmd C is THP eligible with THP page size + THP_PAGESIZE, which means that it is read-only, its size >= THP page + size, its offset and address of the loadcmd C are aligned to THP page + size. */ + +static inline bool +_dl_segment_thp_eligible (const struct loadcmd *c, size_t thp_pagesize) +{ + return ((c->prot & PROT_WRITE) == 0 + && (c->mapend - c->mapstart) >= thp_pagesize + && ((c->mapstart | c->mapoff) & (thp_pagesize - 1)) == 0); +} diff --git a/sysdeps/unix/sysv/linux/ldsodefs.h b/sysdeps/unix/sysv/linux/ldsodefs.h index c63b649432..e39d9afe34 100644 --- a/sysdeps/unix/sysv/linux/ldsodefs.h +++ b/sysdeps/unix/sysv/linux/ldsodefs.h @@ -21,6 +21,9 @@ /* We have the auxiliary vector. */ #define HAVE_AUX_VECTOR +/* We have transparent huge page. */ +#define HAVE_THP + /* Get the real definitions. */ #include_next diff --git a/sysdeps/unix/sysv/linux/loongarch/Makefile b/sysdeps/unix/sysv/linux/loongarch/Makefile index 0d5f087862..d5beb62440 100644 --- a/sysdeps/unix/sysv/linux/loongarch/Makefile +++ b/sysdeps/unix/sysv/linux/loongarch/Makefile @@ -12,3 +12,6 @@ abi-ilp32s-condition := __WORDSIZE == 32 && defined __loongarch_soft_float abi-ilp32d-condition := __WORDSIZE == 32 && defined __loongarch_double_float abi-lp64s-condition := __WORDSIZE == 64 && defined __loongarch_soft_float abi-lp64d-condition := __WORDSIZE == 64 && defined __loongarch_double_float + +# Align THP tests to 32MB. +THP-PAGE-SIZE = 0x2000000 diff --git a/sysdeps/unix/sysv/linux/loongarch/lp64/dl-map-segment-align.h b/sysdeps/unix/sysv/linux/loongarch/lp64/hugepages.h similarity index 90% rename from sysdeps/unix/sysv/linux/loongarch/lp64/dl-map-segment-align.h rename to sysdeps/unix/sysv/linux/loongarch/lp64/hugepages.h index c51ee4ac47..30252a9b86 100644 --- a/sysdeps/unix/sysv/linux/loongarch/lp64/dl-map-segment-align.h +++ b/sysdeps/unix/sysv/linux/loongarch/lp64/hugepages.h @@ -1,4 +1,4 @@ -/* _dl_map_segment_align. LoongArch64 Linux version. +/* Huge Page support. LoongArch64 Linux version. Copyright (C) 2026 Free Software Foundation, Inc. Copyright The GNU Toolchain Authors. This file is part of the GNU C Library. @@ -19,4 +19,4 @@ #define DL_MAP_DEFAULT_THP_PAGESIZE (32 * 1024 * 1024) -#include_next +#include_next diff --git a/sysdeps/unix/sysv/linux/strace-tst-thp.sh b/sysdeps/unix/sysv/linux/strace-tst-thp.sh new file mode 100644 index 0000000000..3ffb256c71 --- /dev/null +++ b/sysdeps/unix/sysv/linux/strace-tst-thp.sh @@ -0,0 +1,80 @@ +#!/bin/bash +# Run THP test under strace to verify control of the THP segment load. +# Copyright (C) 2026 Free Software Foundation, Inc. +# This file is part of the GNU C Library. + +# The GNU C Library is free software; you can redistribute it and/or +# modify it under the terms of the GNU Lesser General Public +# License as published by the Free Software Foundation; either +# version 2.1 of the License, or (at your option) any later version. + +# The GNU C Library is distributed in the hope that it will be useful, +# but WITHOUT ANY WARRANTY; without even the implied warranty of +# MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU +# Lesser General Public License for more details. + +# You should have received a copy of the GNU Lesser General Public +# License along with the GNU C Library; if not, see +# . + +set -e + +case x"${1}" in +*-static) + rtld= + test_wrapper_env="$2" + run_program_env="$3" + library_path= + test_prog="$1" + ;; +*) + rtld="$1" + test_wrapper_env="$2" + run_program_env="$3" + library_path="$4" + test_prog="$5" + ;; +esac + + +if test x"${rtld}" = x; then + cmd="${test_wrapper_env} ${run_program_env} strace ${test_prog}" +else + cmd="${test_wrapper_env} ${run_program_env} strace ${rtld} \ + --library-path ${library_path} ${test_prog}" +fi + +TIMEOUTFACTOR=${TIMEOUTFACTOR:-1} + +case x"${run_program_env}" in +*glibc.elf.thp=1*) + strace_expected=yes + ;; +*) + strace_expected=no + ;; +esac + +# Verify strace is not just present, but works in this environment. If +# not, skip the test. +/bin/sh -c \ + "${test_wrapper_env} ${run_program_env} \ + strace -e trace=none -- /bin/true" > /dev/null 2>&1 || exit 77 + +# Finally the actual test inside the test environment, using the just +# build ld.so and new libraries to run the THP test under strace. +if /bin/sh -c \ + "timeout -k 4 $((3*$TIMEOUTFACTOR)) ${cmd} --direct 2>&1 \ + | grep -E \"madvise.*, MADV_HUGEPAGE\""; then + if test ${strace_expected} = yes; then + exit 0 + else + exit 1 + fi +else + if test ${strace_expected} = no; then + exit 0 + else + exit 1 + fi +fi diff --git a/sysdeps/unix/sysv/linux/tst-thp-1-no-s-code-pde.c b/sysdeps/unix/sysv/linux/tst-thp-1-no-s-code-pde.c new file mode 100644 index 0000000000..3fd01e9bfe --- /dev/null +++ b/sysdeps/unix/sysv/linux/tst-thp-1-no-s-code-pde.c @@ -0,0 +1,19 @@ +/* Test PDE with THP segment load linked with -Wl,-z,noseparate-code. + Copyright (C) 2026 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#include "tst-thp-1.c" diff --git a/sysdeps/unix/sysv/linux/tst-thp-1-no-s-code-static.c b/sysdeps/unix/sysv/linux/tst-thp-1-no-s-code-static.c new file mode 100644 index 0000000000..d0ae0f1ff0 --- /dev/null +++ b/sysdeps/unix/sysv/linux/tst-thp-1-no-s-code-static.c @@ -0,0 +1,19 @@ +/* Test static with THP segment load linked with -Wl,-z,noseparate-code. + Copyright (C) 2026 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#include "tst-thp-1.c" diff --git a/sysdeps/unix/sysv/linux/tst-thp-1-no-s-code.c b/sysdeps/unix/sysv/linux/tst-thp-1-no-s-code.c new file mode 100644 index 0000000000..5eb1e005ed --- /dev/null +++ b/sysdeps/unix/sysv/linux/tst-thp-1-no-s-code.c @@ -0,0 +1,19 @@ +/* Test THP segment load linked with -Wl,-z,noseparate-code. + Copyright (C) 2026 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#include "tst-thp-1.c" diff --git a/sysdeps/unix/sysv/linux/tst-thp-1-pde.c b/sysdeps/unix/sysv/linux/tst-thp-1-pde.c new file mode 100644 index 0000000000..d854dd43da --- /dev/null +++ b/sysdeps/unix/sysv/linux/tst-thp-1-pde.c @@ -0,0 +1,19 @@ +/* Test PDE with THP segment load linked with -Wl,-z,separate-code. + Copyright (C) 2026 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#include "tst-thp-1.c" diff --git a/sysdeps/unix/sysv/linux/tst-thp-1-static.c b/sysdeps/unix/sysv/linux/tst-thp-1-static.c new file mode 100644 index 0000000000..66d7e12954 --- /dev/null +++ b/sysdeps/unix/sysv/linux/tst-thp-1-static.c @@ -0,0 +1,19 @@ +/* Test static with THP segment load linked with -Wl,-z,separate-code. + Copyright (C) 2026 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#include "tst-thp-1.c" diff --git a/sysdeps/unix/sysv/linux/tst-thp-1.c b/sysdeps/unix/sysv/linux/tst-thp-1.c new file mode 100644 index 0000000000..49eea7069c --- /dev/null +++ b/sysdeps/unix/sysv/linux/tst-thp-1.c @@ -0,0 +1,28 @@ +/* Test THP segment load linked with -Wl,-z,separate-code. + Copyright (C) 2026 Free Software Foundation, Inc. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#include "tst-thp-align-check.h" + +static int +do_test (void) +{ + check_align ("tst-thp-1"); + return 0; +} + +#include diff --git a/sysdeps/unix/sysv/linux/tst-thp-align-check.h b/sysdeps/unix/sysv/linux/tst-thp-align-check.h new file mode 100644 index 0000000000..8f1efc0ef1 --- /dev/null +++ b/sysdeps/unix/sysv/linux/tst-thp-align-check.h @@ -0,0 +1,124 @@ +/* Test the THP compatible alignment of PT_LOAD segments. + + Copyright (C) 2026 Free Software Foundation, Inc. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#include +#include +#include +#include +#include +#include +#include +#include +#include +#undef attribute_hidden +#define attribute_hidden +#include /* For enum thp_mode_t and MAX_THP_PAGESIZE. */ +#undef attribute_hidden + +static unsigned long int +get_thp_size (void) +{ + int fd = open ("/sys/kernel/mm/transparent_hugepage/hpage_pmd_size", + O_RDONLY, 0); + if (fd == -1) + return 0; + + char str[INT_BUFSIZE_BOUND (unsigned long int)]; + ssize_t s = read (fd, str, sizeof (str)); + close (fd); + if (s < 0) + return 0; + + unsigned long int r = 0; + for (ssize_t i = 0; i < s; i++) + { + if (str[i] == '\n') + break; + r *= 10; + r += str[i] - '0'; + } + return r; +} + +static enum thp_mode_t +get_thp_mode (void) +{ + int fd = open ("/sys/kernel/mm/transparent_hugepage/enabled", O_RDONLY, 0); + if (fd == -1) + return thp_mode_not_supported; + + static const char mode_always[] = "[always] madvise never\n"; + static const char mode_madvise[] = "always [madvise] never\n"; + static const char mode_never[] = "always madvise [never]\n"; + + char str[sizeof(mode_always)]; + ssize_t s = read (fd, str, sizeof (str)); + if (s >= sizeof str || s < 0) + return thp_mode_not_supported; + str[s] = '\0'; + close (fd); + + if (s == sizeof (mode_always) - 1) + { + if (strcmp (str, mode_always) == 0) + return thp_mode_always; + else if (strcmp (str, mode_madvise) == 0) + return thp_mode_madvise; + else if (strcmp (str, mode_never) == 0) + return thp_mode_never; + } + return thp_mode_not_supported; +} + +static void +check_align (const char *name) +{ + unsigned long int thp_size = get_thp_size (); + enum thp_mode_t thp_mode = get_thp_mode (); + + if (thp_size == 0) + FAIL_UNSUPPORTED ("unable to get THP size.\n"); + + if (thp_size > MAX_THP_PAGESIZE) + FAIL_UNSUPPORTED ("THP size exceeds MAX_THP_PAGESIZE.\n"); + + if (thp_mode != thp_mode_always && thp_mode != thp_mode_madvise) + FAIL_UNSUPPORTED ("THP mode is not always nor madvise.\n"); + + FILE *f = xfopen ("/proc/self/maps", "r"); + char *line = NULL; + size_t len; + + while (xgetline (&line, &len, f)) + { + uintptr_t from, to; + char *prot = NULL, *path = NULL; + int r = sscanf (line, "%" SCNxPTR "-%" SCNxPTR "%ms%*s%*s%*s%ms", + &from, &to, &prot, &path); + + TEST_VERIFY (r == 3 || r == 4); + + if (strstr (prot, "x") && strstr (path, name)) + TEST_COMPARE (from % thp_size, 0); + + free (path); + } + + free (line); + xfclose (f); +} diff --git a/sysdeps/unix/sysv/linux/tst-thp-align.c b/sysdeps/unix/sysv/linux/tst-thp-align.c index 0b3f18e000..2e44109ba6 100644 --- a/sysdeps/unix/sysv/linux/tst-thp-align.c +++ b/sysdeps/unix/sysv/linux/tst-thp-align.c @@ -16,129 +16,10 @@ License along with the GNU C Library; if not, see . */ -#include -#include -#include -#include -#include -#include -#include #include -#include -#include +#include "tst-thp-align-check.h" #define THP_SIZE_MOD_NAME "tst-thp-size-mod.so" -#define MAX_THP_PAGESIZE (32 * 1024 * 1024) - -enum thp_mode_t -{ - thp_mode_always, - thp_mode_madvise, - thp_mode_never, - thp_mode_not_supported -}; - -static unsigned long int -get_thp_size (void) -{ - int fd = open ("/sys/kernel/mm/transparent_hugepage/hpage_pmd_size", - O_RDONLY, 0); - if (fd == -1) - return 0; - - char str[INT_BUFSIZE_BOUND (unsigned long int)]; - ssize_t s = read (fd, str, sizeof (str)); - close (fd); - if (s < 0) - return 0; - - unsigned long int r = 0; - for (ssize_t i = 0; i < s; i++) - { - if (str[i] == '\n') - break; - r *= 10; - r += str[i] - '0'; - } - return r; -} - -static enum thp_mode_t -get_thp_mode (void) -{ - int fd = open ("/sys/kernel/mm/transparent_hugepage/enabled", O_RDONLY, 0); - if (fd == -1) - return thp_mode_not_supported; - - static const char mode_always[] = "[always] madvise never\n"; - static const char mode_madvise[] = "always [madvise] never\n"; - static const char mode_never[] = "always madvise [never]\n"; - - char str[sizeof(mode_always)]; - ssize_t s = read (fd, str, sizeof (str)); - if (s >= sizeof str || s < 0) - return thp_mode_not_supported; - str[s] = '\0'; - close (fd); - - if (s == sizeof (mode_always) - 1) - { - if (strcmp (str, mode_always) == 0) - return thp_mode_always; - else if (strcmp (str, mode_madvise) == 0) - return thp_mode_madvise; - else if (strcmp (str, mode_never) == 0) - return thp_mode_never; - } - return thp_mode_not_supported; -} - -static void -check_align (void) -{ - unsigned long int thp_size = get_thp_size (); - enum thp_mode_t thp_mode = get_thp_mode (); - - if (thp_size == 0) - { - FAIL_UNSUPPORTED ("unable to get THP size.\n"); - return; - } - - if (thp_size > MAX_THP_PAGESIZE) - { - FAIL_UNSUPPORTED ("THP size exceeds MAX_THP_PAGESIZE.\n"); - return; - } - - if (thp_mode != thp_mode_always) - { - FAIL_UNSUPPORTED ("THP mode is not always.\n"); - return; - } - - FILE *f = xfopen ("/proc/self/maps", "r"); - char *line = NULL; - size_t len; - - while (xgetline (&line, &len, f)) - { - uintptr_t from, to; - char *prot = NULL, *path = NULL; - int r = sscanf (line, "%" SCNxPTR "-%" SCNxPTR "%ms%*s%*s%*s%ms", - &from, &to, &prot, &path); - - TEST_VERIFY (r == 3 || r == 4); - - if (strstr (prot, "x") && strstr (path, THP_SIZE_MOD_NAME)) - TEST_COMPARE (from % thp_size, 0); - - free (path); - } - - free (line); - xfclose (f); -} static int do_test (void) @@ -146,7 +27,7 @@ do_test (void) void *dl; dl = xdlopen (THP_SIZE_MOD_NAME, RTLD_NOW); - check_align (); + check_align (THP_SIZE_MOD_NAME); xdlclose (dl); return 0; diff --git a/sysdeps/unix/sysv/linux/x86/hugepages.h b/sysdeps/unix/sysv/linux/x86/hugepages.h new file mode 100644 index 0000000000..1a8c370969 --- /dev/null +++ b/sysdeps/unix/sysv/linux/x86/hugepages.h @@ -0,0 +1,22 @@ +/* Huge Page support. Linux/x86 version. + Copyright (C) 2026 Free Software Foundation, Inc. + Copyright The GNU Toolchain Authors. + This file is part of the GNU C Library. + + The GNU C Library is free software; you can redistribute it and/or + modify it under the terms of the GNU Lesser General Public + License as published by the Free Software Foundation; either + version 2.1 of the License, or (at your option) any later version. + + The GNU C Library is distributed in the hope that it will be useful, + but WITHOUT ANY WARRANTY; without even the implied warranty of + MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU + Lesser General Public License for more details. + + You should have received a copy of the GNU Lesser General Public + License along with the GNU C Library; if not, see + . */ + +#define DL_MAP_DEFAULT_THP_PAGESIZE (2 * 1024 * 1024) + +#include_next -- 2.54.0