itsscb/rust - rust - Gitea: Git with a cup of tea

mirror of https://github.com/rust-lang/rust.git synced 2025-11-03 22:49:17 +00:00

Author	SHA1	Message	Date
Trevor Gross	a7cd13b6a3	Make `Float::exp` return an unsigned integer `exp` does not perform any form of unbiasing, so there isn't any reason it should be signed. Change this. Additionally, add `EPSILON` to the `Float` trait.	2025-01-22 07:15:39 +00:00
Trevor Gross	5ac2f99954	Shift then mask, rather than mask then shift This may allow for small optimizations with larger float types since `u32` math can be used after shifting. LLVM may be already getting this anyway.	2025-01-22 07:09:37 +00:00
Trevor Gross	186eac9227	Add `sqrtf16` and `sqrtf128` Use the generic algorithms to provide implementations for these routines.	2025-01-22 05:31:13 +00:00
Trevor Gross	03041a0371	Copy the u256 implementation from compiler_builtins	2025-01-22 05:31:13 +00:00
Trevor Gross	573ded2ee8	Port the most recent version of Musl's `sqrt` as a generic algorithm Musl commit 97e9b73d59 ("math: new software sqrt") adds a new algorithm using Goldschmidt division. Port this algorithm to Rust and make it generic, which shows a notable performance improvement over the existing algorithm. This also allows adding square root routines for `f16` and `f128`.	2025-01-22 05:31:13 +00:00
Trevor Gross	8927014e91	Enable `force-soft-floats` for extensive tests Any architecture-specific float operations are likely to consist of only a few instructions, but the softfloat implementations are much more complex. Ensure this is what gets tested.	2025-01-22 05:31:13 +00:00
Trevor Gross	9c98c46147	Don't set `opt_level` in the musl build script `cc` automatically reads this from Cargo's `OPT_LEVEL` variable so we don't need to set it explicitly. Remove this so running in a debugger makes more sense.	2025-01-22 05:22:12 +00:00
Trevor Gross	6ac9c14933	Add a retry to the musl download This download has occasionally been failing in CI recently. Add a retry so this is less likely to cause the workflow to fail.	2025-01-21 22:02:48 -05:00
Trevor Gross	b3d57f8c28	Remove trailing whitespace in scripts, run JuliaFormatter	2025-01-21 20:30:11 -05:00
Trevor Gross	e21618c73e	Ignore files relevant to benchmarking	2025-01-21 07:58:05 +00:00
Trevor Gross	d3328a0dab	Add a way to ignore benchmark regression checks Introduce a way to ignore the results of icount regression tests, by specifying `allow-regressions` in the pull request body. This should apply to both pull requests and the merges based on them, since `gh pr view` automatically handles both.	2025-01-21 07:58:05 +00:00
Trevor Gross	c5dc1b8ca0	Run wall time benchmarks with `--features force-soft-floats` Similar to changes for `icount` benchmarks, this ensures we aren't testing the throughput of architecture instructions.	2025-01-21 07:58:05 +00:00
Trevor Gross	ba0cfe58dd	Run icount benchmarks once with softfloat and once with hardfloat These benchmarks are fast to run, so the time cost here is pretty minimal. Running softfloat benchmarks just ensures that we don't e.g. test the performance of `_mm_sqrt_ss` rather than our implementation, and running without softfloat gives us a way to see the effect of arch intrinsics.	2025-01-21 07:58:05 +00:00
Trevor Gross	f9041943f1	Switch to the arm-linux runner and enable MPFR The free arm64 Linux runners are now available [1]. Switch to using this image in CI, and enable tests against MPFR since this is now a native platform. [1]: https://github.blog/changelog/2025-01-16-linux-arm64-hosted-runners-now-available-for-free-in-public-repositories-public-preview/	2025-01-20 16:28:39 -05:00
Trevor Gross	f39af6cb97	Remove the limit for querying a baseline `--limit=1` seems to apply before `jq` filtering, meaning our `WORKFLOW_NAME` ("CI") workflow may not appear in the input to the jq query. Removing `--limit` provides a default amount of inputs that jq can then filter from, so this works better.	2025-01-16 15:24:37 -06:00
Trevor Gross	3986206ce0	Add an xfail for recent ynf failures This failed a couple of times recently in CI, once on i686 and once on aarch64-apple: thread 'main' panicked at crates/libm-test/benches/random.rs:76:65: called `Result::unwrap()` on an `Err` value: ynf Caused by: 0: input: (681, 509.90924) (0x000002a9, 0x43fef462) expected: -3.2161271e38 0xff71f45b actual: -inf 0xff800000 1: mismatched infinities thread 'main' panicked at crates/libm-test/benches/random.rs:76:65: called `Result::unwrap()` on an `Err` value: ynf Caused by: 0: input: (132, 50.46604) (0x00000084, 0x4249dd3a) expected: -3.3364996e38 0xff7b02a5 actual: -inf 0xff800000 1: mismatched infinities Add a new override to account for this.	2025-01-16 09:47:00 +00:00
Trevor Gross	5139ba6f46	Reduce the warm up and measurement time for `short-benchmarks` The icount benchmarks are what we will be relying on in CI more than the existing benchmarks. There isn't much reason to keep these around, but there isn't much point in dropping them either. So, just reduce the runtime.	2025-01-16 09:07:46 +00:00
Trevor Gross	cdb1e680e0	Run iai-callgrind benchmarks in CI Add support in `ci-util.py` for finding the most recent baseline and downloading it, which new tests can then be compared against. Arbitrarily select nightly-2025-01-16 as the rustc version to pin to in benchmarks.	2025-01-16 09:07:46 +00:00
Trevor Gross	490ebbb187	Add benchmarks using iai-callgrind Running walltime benchmarks in CI is notoriously unstable, Introduce benchmarks that instead use instruction count and other more reproducible metrics, using `iai-callgrind` [1], which we are able to run in CI with a high degree of reproducibility. Inputs to this benchmark are a logspace sweep, which gives an approximation for real-world use, but may fail to indicate outlier cases. [1]: https://github.com/iai-callgrind/iai-callgrind	2025-01-16 09:07:19 +00:00
Trevor Gross	f56b41dbbd	Provide a way to override iteration count Benchmarks need a way to limit how many iterations get run. Introuce a way to inject this information here.	2025-01-16 08:53:50 +00:00
Trevor Gross	17c86e4e7f	Increase the CI timeout	2025-01-16 01:10:26 +00:00
Trevor Gross	ecca4879a2	Adjust precision and add xfails based on new tests	2025-01-16 01:10:26 +00:00
Trevor Gross	2d857e1c21	Replace `HasDomain` to enable multi-argument edge case and domain tests This also allows reusing the same generator logic between logspace tests and extensive tests, so comes with a nice bit of cleanup. Changes: * Make the generator part of `CheckCtx` since a `Generator` and `CheckCtx` are almost always passed together. * Rename `domain_logspace` to `spaced` since this no longer only operates within a domain and we may want to handle integer spacing. * Domain is now calculated at runtime rather than using traits, which is much easier to work with. * With the above, domains for multidimensional functions are added. * The extensive test generator code tests has been combined with the domain_logspace generator code. With this, the domain tests have just become a subset of extensive tests. These were renamed to "quickspace" since, technically, the extensive tests are also "domain" or "domain logspace" tests. * Edge case generators now handle functions with multiple inputs. * The test runners can be significantly cleaned up and deduplicated.	2025-01-16 01:10:26 +00:00
Trevor Gross	45e3b98165	Add an override for a recent failure Failed on i686: ──── STDERR: libm-test::bench/random y1f/crate thread 'main' panicked at crates/libm-test/benches/random.rs:76:65: called `Result::unwrap()` on an `Err` value: ynf Caused by: 0: input: (213, 109.15641) (0x000000d5, 0x42da5015) expected: -3.3049217e38 0xff78a27a actual: -inf 0xff800000 1: mismatched infinities	2025-01-15 01:05:38 +00:00
Trevor Gross	b251f74843	Pass --max-fail to nextest so it doesn't fail fast	2025-01-15 00:57:23 +00:00
Trevor Gross	f63ef37218	Slightly restructure `ci/calculate-exhaustive-matrix.py` Change this script into a generic CI utility that we will be able to expand in the future.	2025-01-15 00:57:23 +00:00
Trevor Gross	5e65179a39	Change `.yml` files to the canonical extension `.yaml`	2025-01-14 03:24:14 +00:00
Trevor Gross	26df5d6689	Use cargo-nextest for running tests in CI The test suite for this repo has quite a lot of tests, and it is difficult to tell which contribute the most to the long CI runtime. libtest does have an unstable flag to report test times, but that is inconvenient to use because it needs to be passed only to libtest binaries. Switch to cargo-nextest [1] which provides time reporting and, overall, a better test UI. It may also improve test runtime, though this seems unlikely since we have larger test binaries with many small tests (nextest benefits the most when there are larger binaries that can be run in parallel). For anyone running locally without, `run.sh` should still fall back to `cargo test` if `cargo-nextest` is not available. This diff includes some cleanup and consistency changes to other CI-related files. [1]: https://nexte.st	2025-01-13 21:32:54 -05:00
quaternic	bfbe919adf	Simplify and optimize `fdim` (#442 ) The cases with NaN arguments can be handled by the same x - y expression, and this generates much better code: https://godbolt.org/z/f3rnT8jx4.	2025-01-14 01:55:26 +00:00
Trevor Gross	bcd9d8a5c3	Reduce indentation in `run.sh` using early return	2025-01-13 23:01:48 +00:00
Trevor Gross	fd7a45f7f6	Don't set `codegen-units=1` by default in CI We can set this only for the release profile, there isn't any reason to have it set for debug tests.	2025-01-13 23:01:44 +00:00
Trevor Gross	13b5bf3959	Add `fdimf16` and `fdimf128` Use the generic algorithms to provide implementations for these routines.	2025-01-13 14:04:54 +00:00
Trevor Gross	0f285df716	Add a generic version of `fdim`	2025-01-13 13:49:46 +00:00
Trevor Gross	23dfe62a9d	Format the MPFR manual implementation list	2025-01-13 06:22:25 -05:00
Trevor Gross	23c9f74b58	Disable `util` and `libm-macros` for optimized tests These crates take time building in CI, especially with the release profile having LTO enabled, but there isn't really any reason to test them with different features or in release mode. Disable this to save some CI runtime.	2025-01-13 05:54:43 -05:00
Trevor Gross	b558b365d3	Add `truncf16` and `truncf128` Use the generic algorithms to provide implementations for these routines.	2025-01-13 10:12:09 +00:00
Trevor Gross	6ac06a97e5	Add a generic version of `trunc` The algorithm is identical for both types, so this is a straightforward routine to port.	2025-01-13 09:59:20 +00:00
Trevor Gross	dfa694a8e4	Add a utility crate for quick evaluation Introduce a simple binary that can run arbitrary input against any of the available implementations (musl, MPFR, our libm). This provides an easy way to check results, or run specific cases against a debugger. Examples: $ cargo run -p util -- eval libm pow 1.6 2.4 3.089498284311124 $ cargo run -p util -- eval mpfr pow 1.6 2.4 3.089498284311124 $ cargo run -p util -- eval musl tgamma 1.2344597839132 0.9097442657960874 $ cargo run -p util -- eval mpfr tgamma 1.2344597839132 0.9097442657960874 $ cargo run -p util -- eval libm tgamma 1.2344597839132 0.9097442657960871 $ cargo run -p util -- eval musl sincos 3.1415926535 (8.979318433952318e-11, -1.0)	2025-01-13 04:43:48 -05:00
Trevor Gross	3de783c40f	Enable `build-mpfr` and `build-musl` by default Most users who are developing this crate are likely running on a Unix system, since there isn't much to test against otherwise. For convenience, enable the features required to run these tests by default.	2025-01-13 03:53:46 +00:00
Trevor Gross	504616d5a0	Rename the `test-multiprecision` feature to `build-mpfr` Currently the features that control what we test against are `build-musl` and `test-multiprecision`. I didn't name them very consistently and there isn't really any reason for that. Rename `test-multiprecision` to `build-mpfr` to better reflect what it actually does and to be more consistent with `build-musl`.	2025-01-13 03:25:19 +00:00
Hanna Kruppe	87cc064e35	Introduce arch::aarch64 and use it for rint{,f}	2025-01-12 11:26:09 +01:00
Hanna Kruppe	7defd9b429	Use wasm32 arch intrinsics for rint{,f}	2025-01-12 11:25:51 +01:00
Trevor Gross	b50ea809ff	Add a new precision adjustment for i586 `exp2f` There was a recent failure from the random tests: ---- mp_random_exp2f stdout ---- Random Mpfr exp2f arg 1/1: 10000 iterations (10000 total) using `LIBM_SEED=fqgMuzs6eqH1VZSEmQpLnThnaIyRUOWe` thread 'mp_random_exp2f' panicked at crates/libm-test/tests/multiprecision.rs:41:49: called `Result::unwrap()` on an `Err` value: input: (127.97238,) (0x42fff1dc,) expected: 3.3383009e38 0x7f7b2556 actual: inf 0x7f800000 Caused by: mismatched infinities Add an xfail for mismatched infinities on i586.	2025-01-12 03:48:13 -05:00
Trevor Gross	228c716a43	Add a new precision adjustment for i586 `rint` `rint` had a couple recent failures from the random tests: ---- mp_random_rint stdout ---- Random Mpfr rint arg 1/1: 10000 iterations (10000 total) using `LIBM_SEED=Fl1f69DaJnwkHN2FeuCXaBFRvJYsPvEY` thread 'mp_random_rint' panicked at crates/libm-test/tests/multiprecision.rs:41:49: called `Result::unwrap()` on an `Err` value: input: (-849751480.5001163,) (0xc1c95316dc4003d0,) expected: -849751481.0 0xc1c95316dc800000 actual: -849751480.0 0xc1c95316dc000000 Caused by: ulp 8388608 > 100000 And: ---- mp_random_rint stdout ---- Random Mpfr rint arg 1/1: 10000 iterations (10000 total) using `LIBM_SEED=XN7VCGhX3Wu6Mzn8COvJPITyZlGP7gN7` thread 'mp_random_rint' panicked at crates/libm-test/tests/multiprecision.rs:41:49: called `Result::unwrap()` on an `Err` value: input: (-12493089.499809155,) (0xc167d4242ffe6fc5,) expected: -12493089.0 0xc167d42420000000 actual: -12493090.0 0xc167d42440000000 Caused by: ulp 536870912 > 100000 It seems we just implement an incorrect rounding mode. Replace the existing `rint` override with an xfail if the difference is 0.0 <= ε <= 1.0.	2025-01-12 03:48:13 -05:00
Trevor Gross	5b1c1fca03	Expose C versions of `libm` functions in the `cb` crate `compiler_builtins` exposes an `extern "C"` version of `libm` routines, so add the same here. There really isn't much to test here (unless we later add tests against C `libm` suites), but one nice benefit is this gives us a library with unmangled names that is easy to `objdump`. In accordance with that, also update `cb` to be a `staticlib`. Unfortunately this also means we have to remove it from the workspace, since Cargo doesn't allow setting `panic = "abort"` for a single crate.	2025-01-12 03:29:40 -05:00
Trevor Gross	2912076e50	Always use the same seed for benchmarking It would be preferable to switch to a different generator, or at least set the seed within the benchmark, but this is the most straightforward way to make things simple.	2025-01-11 23:36:11 -05:00
Trevor Gross	e1749bdb6d	Add `biteq` and `exp_unbiased` to `Float` These are two convenience methods. Additionally, add tests for the trait methods, and an `assert_biteq!` macro to check and print the output.	2025-01-11 22:58:00 -05:00
Trevor Gross	5c94cce6b2	Add a `release-checked` profile with debug and overflow assertions A failing debug assertion or overflow without correctly wrapping or saturating is a bug, but the `debug` profile that has these enabled does not run enough test cases to hit edge cases that may trigger these. Add a new `release-checked` profile that enables debug assertions and overflow checks. This seems to only extend per-function test time by a few seconds (or around a minute on longer extensive tests), so enable this as the default on CI. In order to ensure `no_panic` still gets checked, add a build-only step to CI.	2025-01-11 20:35:30 -05:00
Trevor Gross	721960c172	Remove `ExpInt` from `Float`, always use `i32` instead `ExpInt` is likely to only have performance benefits on 16-bit platforms, but makes working with the exponent more difficult. It seems like a worthwhile tradeoff to instead just use `i32`, so do that here.	2025-01-11 19:22:01 -05:00
Trevor Gross	5e13eeca01	Reorder tests in `run.sh` I do not believe Cargo separately caches crates with different sets of features enabled. So, ensuring that tests run with `unstable-intrinsics` are always grouped should slightly reduce runtime. As an added benefit, all the debug mode tests run first so initial feedback is available faster.	2025-01-11 18:48:08 -05:00

1 2 3 4 5 ...

592 Commits