itsscb/rust - rust - Gitea: Git with a cup of tea

mirror of https://github.com/rust-lang/rust.git synced 2025-11-18 09:26:28 +00:00

Author	SHA1	Message	Date
Trevor Gross	53a055049c	Add `roundeven{,f,f16,f128}` C23 specifies a new set of `roundeven` functions that round to the nearest integral, with ties to even. It does not raise any floating point exceptions. This behavior is similar to two other functions: 1. `rint`, which rounds to the nearest integer respecting rounding mode and possibly raising exceptions. 2. `nearbyint`, which is identical to `rint` except it may not raise exceptions. Technically `rint`, `nearbyint`, and `roundeven` all behave the same in Rust because we assume default floating point environment. The backends are allowed to lower to `roundeven`, however, so we should provide it in case the fallback is needed. Add the `roundeven` family here and convert `rint` to a function that takes a rounding mode. This currently has no effect.	2025-02-11 00:55:22 -06:00
Trevor Gross	669731335e	Add `fminimum`, `fmaximum`, `fminimum_num`, and `fmaximum_num` These functions represent new operations from IEEE 754-2019. Introduce them for all float sizes.	2025-02-10 16:17:33 -06:00
Trevor Gross	2f0685a9a2	Implement `u256` with two `u128`s rather than `u64` This produces better assembly, e.g. on aarch64: .globl libm::u128_wmul .p2align 2 libm::u128_wmul: Lfunc_begin124: .cfi_startproc mul x9, x2, x0 umulh x10, x2, x0 umulh x11, x3, x0 mul x12, x3, x0 umulh x13, x2, x1 mul x14, x2, x1 umulh x15, x3, x1 mul x16, x3, x1 adds x10, x10, x14 cinc x13, x13, hs adds x13, x13, x16 cinc x14, x15, hs adds x10, x10, x12 cinc x11, x11, hs adds x11, x13, x11 stp x9, x10, [x8] cinc x9, x14, hs stp x11, x9, [x8, rust-lang/libm#16] ret The original was ~70 instructions so the improvement is significant. With these changes, the result is reasonably close to what LLVM generates using `u256` operands [1]. [1]: https://llvm.godbolt.org/z/re1aGdaqY	2025-02-09 23:41:51 -06:00
Trevor Gross	900b61f363	Change how operators are `black_box`ed For some reason, the upcoming limb changes in [1] seem to ignore the black boxing when applied to the operator function. Changing to instead black box the inputs appears to fix this. [1]: https://github.com/rust-lang/libm/pull/503	2025-02-08 04:49:44 -06:00
Trevor Gross	0a43f24a30	Add simple icount benchmarks for `u256` operations	2025-02-08 02:02:45 -06:00
Trevor Gross	9223d60dfa	Add `fmaf128` Resolve all remaining `f64`-specific items in the generic version of `fma`, then expose `fmaf128`.	2025-02-06 18:41:45 -06:00
Trevor Gross	e01ce5d53a	Commonize the signature for all instances of `get_test_cases` In order to make these more interchangeable in more places, always return `(impl Iterator, u64)`. This will facilitate using other generators for extensive tests.	2025-02-05 16:30:11 -06:00
Trevor Gross	eee632ee1b	Add checks via annotation that lists are sorted or exhaustive This crate has a handful of lists that need to list all API and can't easily be verified. Additionally, some longer lists should be kept sorted so they are easier to look through. Resolve both of these by adding a check in `update-api-list.py` that looks for annotations and verifies the contents are as expected. Annotations are `verify-apilist-start`, `verify-apilist-end`, `verify-sorted-start`, and `verify-sorted-end`. This includes fixes for anything that did not meet the criteria.	2025-02-05 15:18:05 +00:00
Trevor Gross	cc2874c9a9	Add `scalbnf16`, `scalbnf128`, `ldexpf16`, and `ldexpf128` Use the generic `scalbn` to provide `f16` and `f128` versions, which also work for `ldexp`. This involves a new algorithm for `f16` because the default does not converge fast enough with a limited number of rounds.	2025-02-05 13:37:54 +00:00
Trevor Gross	173a48ce8c	Enable missing icount benchmarks A few new functions were added but this list did not get updated. Do so here.	2025-01-24 03:39:49 -06:00
Trevor Gross	71200bc3ce	Add `fmodf128` This function is significantly slower than all others so includes an override in `EXTREMELY_SLOW_TESTS`. Without it, PR CI takes ~1hour and the extensive tests in CI take ~1day.	2025-01-24 08:23:15 +00:00
Trevor Gross	67218cbaa5	Add `fmodf16` using the generic implementation	2025-01-24 06:03:59 +00:00
Trevor Gross	6d5105c006	Add `fminf16`, `fmaxf16`, `fminf128`, and `fmaxf128`	2025-01-24 03:01:36 +00:00
Trevor Gross	d20a5e82a5	Add `roundf16` and `roundf128`	2025-01-24 01:59:10 +00:00
Trevor Gross	b22398d658	Add `rintf16` and `rintf128` Use the generic algorithms to provide implementations for these routines.	2025-01-22 11:04:39 +00:00
Trevor Gross	6a8bb0fa80	Add `floorf16` and `floorf128` Use the generic algorithms to provide implementations for these routines.	2025-01-22 08:50:06 +00:00
Trevor Gross	9064c42abe	Add `ceilf16` and `ceilf128` Use the generic algorithms to provide implementations for these routines.	2025-01-22 07:22:32 +00:00
Trevor Gross	186eac9227	Add `sqrtf16` and `sqrtf128` Use the generic algorithms to provide implementations for these routines.	2025-01-22 05:31:13 +00:00
Trevor Gross	5139ba6f46	Reduce the warm up and measurement time for `short-benchmarks` The icount benchmarks are what we will be relying on in CI more than the existing benchmarks. There isn't much reason to keep these around, but there isn't much point in dropping them either. So, just reduce the runtime.	2025-01-16 09:07:46 +00:00
Trevor Gross	490ebbb187	Add benchmarks using iai-callgrind Running walltime benchmarks in CI is notoriously unstable, Introduce benchmarks that instead use instruction count and other more reproducible metrics, using `iai-callgrind` [1], which we are able to run in CI with a high degree of reproducibility. Inputs to this benchmark are a logspace sweep, which gives an approximation for real-world use, but may fail to indicate outlier cases. [1]: https://github.com/iai-callgrind/iai-callgrind	2025-01-16 09:07:19 +00:00
Trevor Gross	2d857e1c21	Replace `HasDomain` to enable multi-argument edge case and domain tests This also allows reusing the same generator logic between logspace tests and extensive tests, so comes with a nice bit of cleanup. Changes: * Make the generator part of `CheckCtx` since a `Generator` and `CheckCtx` are almost always passed together. * Rename `domain_logspace` to `spaced` since this no longer only operates within a domain and we may want to handle integer spacing. * Domain is now calculated at runtime rather than using traits, which is much easier to work with. * With the above, domains for multidimensional functions are added. * The extensive test generator code tests has been combined with the domain_logspace generator code. With this, the domain tests have just become a subset of extensive tests. These were renamed to "quickspace" since, technically, the extensive tests are also "domain" or "domain logspace" tests. * Edge case generators now handle functions with multiple inputs. * The test runners can be significantly cleaned up and deduplicated.	2025-01-16 01:10:26 +00:00
Trevor Gross	13b5bf3959	Add `fdimf16` and `fdimf128` Use the generic algorithms to provide implementations for these routines.	2025-01-13 14:04:54 +00:00
Trevor Gross	b558b365d3	Add `truncf16` and `truncf128` Use the generic algorithms to provide implementations for these routines.	2025-01-13 10:12:09 +00:00
Trevor Gross	6b5e8b20f0	Add test infrastructure for `f16` and `f128` Update test traits to support `f16` and `f128`, as applicable. Add the new routines (`fabs` and `copysign` for `f16` and `f128`) to the list of all operations.	2025-01-06 04:10:51 -05:00
Trevor Gross	37dbc534cb	Rewrite the random test generator Currently, all inputs are generated and then cached. This works reasonably well but it isn't very configurable or extensible (adding `f16` and `f128` is awkward). Replace this with a trait for generating random sequences of tuples. This also removes possible storage limitations of caching all inputs.	2025-01-06 00:32:21 +00:00
Trevor Gross	3fb16fbdbe	macros: Always emit `f16_enabled` and `f128_enabled` attributes Once we start addinf `f16` and `f128` routines, we will need to have this cfg for almost all uses of `for_each_function`. Rather than needing to specify this each time, always emit `#[cfg(f16_enabled)]` or `#[cfg(f128_enabled)]` for each function that uses `f16` or `f128`, respectively.	2025-01-02 17:38:09 -05:00
Trevor Gross	ba1d271158	Rename associated type helpers, add `OpITy` Change the names to make them less ambiguous. Additionally add `OpITy` for accessing the same-sized integer of an operation's float type.	2024-12-22 23:56:45 +00:00
Trevor Gross	4cdb9ec674	Introduce helper types for accessing trait items The ambiguous associated types error sometimes fires in cases where it shouldn't be ambiguous ([1]), which can make things clunky when working with chained associated types (e.g. `Op::FTy::Int::*` does not work). Add helper types that we can use instead of the full syntax. There aren't too many cases in-crate now but this is relevant for some open PRs. [1]: https://github.com/rust-lang/rust/issues/38078	2024-12-22 23:33:02 +00:00
Trevor Gross	5032fcf139	Change default ULP to use enum matching Migrate from string to enum matching and tie this to `CheckCtx::new`, so no tests need to explicitly set ULP.	2024-11-02 23:22:09 -05:00
Trevor Gross	f113f2be1e	Rename `Name` to `Identifier` to avoid some ambiguity of "name"	2024-11-02 22:42:05 -05:00
Trevor Gross	2fab4f4580	Change the `CheckCtx` constructor to take a `Name` enum This prepares to eliminate some reliance on string matching but does not yet make those changes.	2024-11-02 22:35:30 -05:00
Trevor Gross	f7f24a4ed8	Rework tests to make use of the new `MathOp` trait	2024-11-02 16:36:17 -05:00
Trevor Gross	abff7cd82a	Add benchmarks against musl libm Add a benchmark for each function that checks against `musl_math_sys`.	2024-10-31 22:40:30 -05:00

33 Commits