Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add array_chunks #1023

Open
wants to merge 8 commits into
base: master
Choose a base branch
from
Open

Conversation

ronnodas
Copy link
Contributor

@ronnodas ronnodas commented Mar 2, 2025

Added an implementation of arrays based on the work in next_array(). Currently the N = 0 produces a post-monomorphization error, see the discussion in #1012.

Some other choices that I'm not sure about:

  • If the number of items yielded by the input iterator is not a multiple of N then there is exactly one allocation the first time next() returns None. I don't think this is a performance issue but this means the method has to be gated behind use_alloc. The alternative would be something like implementing IntoIterator for ArrayBuilder, which seems possible but the safety invariants get more complicated. A middle ground would be to only gate the remainder() method, but this seems worse.
  • Should ArrayChunks::remainder() return a custom type, so that it can be ExactSizeIterator?
  • Is it okay to not specify the behavior if the input iterator is not fused? I think the current implementation behaves slightly differently depending on whether the None is on a multiple of N.

Copy link

codecov bot commented Mar 2, 2025

Codecov Report

Attention: Patch coverage is 91.25000% with 7 lines in your changes missing coverage. Please review.

Project coverage is 94.32%. Comparing base (6814180) to head (db21b2f).
Report is 137 commits behind head on master.

Files with missing lines Patch % Lines
src/arrays.rs 82.92% 7 Missing ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##           master    #1023      +/-   ##
==========================================
- Coverage   94.38%   94.32%   -0.07%     
==========================================
  Files          48       51       +3     
  Lines        6665     6963     +298     
==========================================
+ Hits         6291     6568     +277     
- Misses        374      395      +21     

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ronnodas added 3 commits March 3, 2025 00:28
Iterator::array_chunks exists on nightly and it's better to avoid the name collision especially if and when that's stabilized
@ronnodas
Copy link
Contributor Author

ronnodas commented Mar 2, 2025

Re the MSRV failure, I'm not sure what the idiomatic way is to trigger a PME on 1.63 based on N > 0.

@jswrenn
Copy link
Member

jswrenn commented Mar 3, 2025

Re the MSRV failure, I'm not sure what the idiomatic way is to trigger a PME on 1.63 based on N > 0.

You can do it like this: https://play.rust-lang.org/?version=stable&mode=debug&edition=2024&gist=233fb2f6ca3df52d15e3df96b8c4cb61

src/arrays.rs Outdated
Comment on lines 88 to 102
/// Effectively assert!(N > 0) post-monomorphization
fn assert_positive<const N: usize>() {
trait StaticAssert<const N: usize> {
const ASSERT: bool;
}

impl<const N: usize> StaticAssert<N> for () {
const ASSERT: bool = {
assert!(N > 0);
true
};
}

assert!(<() as StaticAssert<N>>::ASSERT);
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Rust doesn't provide stacktraces for monomorphization errors, so folks will only see that an error occured in assert_positive — and not the code that actually required the assertion.

Define this as a macro and call that instead.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Is the change in db21b2f what you meant?

/// assert_eq!(Some([]), it.next());
/// ```
#[cfg(feature = "use_alloc")]
fn arrays<const N: usize>(self) -> Arrays<Self, N>
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IMO array_chunks is the better name, since it differentiates it from array_windows.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

To add context, I was trying to avoid triggering the unstable-name-collisions lint, because of rust-lang/rust#100450. Should I change it back to array_chunks?

Comment on lines +9 to +12
pub struct Arrays<I: Iterator, const N: usize> {
iter: I,
partial: Vec<I::Item>,
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It strikes me as a code smell that N doesn't appear in the definition here. Shouldn't partial be an ArrayBuilder<T, N>?

Copy link
Contributor Author

@ronnodas ronnodas Mar 5, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Making it ArrayBuilder<T, N> a priori makes sense, but I had trouble with the bounds for the impl Clone in that case. Note that MaybeUninit<T>: Clone requires T: Copy.

Maybe it could also be array::IntoIter<T, N> or a hypothetical ArrayBuilder::IntoIter<T, N> (this is close to what the unstable Iterator::ArrayChunks does) but I think the unsafe code for that would need to be more careful.

The state you need to keep track of doesn't really depend on N (except partial.len() <= N but this is sort of incidental). However Arrays needs to have N as a parameter for Arrays::Item to be well-defined.

Suggestions?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants