Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Tracking Issue for slice::array_chunks #74985

Open
5 tasks
lcnr opened this issue Jul 31, 2020 · 101 comments
Open
5 tasks

Tracking Issue for slice::array_chunks #74985

lcnr opened this issue Jul 31, 2020 · 101 comments
Labels
A-array Area: `[T; N]` A-const-generics Area: const generics (parameters and arguments) A-slice Area: `[T]` C-tracking-issue Category: An issue tracking the progress of sth. like the implementation of an RFC disposition-merge This issue / PR is in PFCP or FCP with a disposition to merge it. F-generic_const_exprs `#![feature(generic_const_exprs)]` final-comment-period In the final comment period and will be merged soon unless new substantive objections are raised. Libs-Tracked Libs issues that are tracked on the team's project board. T-libs-api Relevant to the library API team, which will review and decide on the PR/issue.

Comments

@lcnr
Copy link
Contributor

lcnr commented Jul 31, 2020

The feature gate for the issue is #![feature(array_chunks)].

Also tracks as_(r)chunks(_mut) in #![feature(slice_as_chunks)].

About tracking issues

Tracking issues are used to record the overall progress of implementation.
They are also uses as hubs connecting to other relevant issues, e.g., bugs or open design questions.
A tracking issue is however not meant for large scale discussion, questions, or bug reports about a feature.
Instead, open a dedicated issue for the specific matter and add the relevant feature gate label.

Steps

Unresolved Questions

Implementation history

@lcnr lcnr added C-tracking-issue Category: An issue tracking the progress of sth. like the implementation of an RFC A-const-generics Area: const generics (parameters and arguments) F-const_generics `#![feature(const_generics)]` T-libs-api Relevant to the library API team, which will review and decide on the PR/issue. labels Jul 31, 2020
@KodrAus KodrAus added Libs-Tracked Libs issues that are tracked on the team's project board. A-slice Area: `[T]` labels Aug 6, 2020
bors added a commit to rust-lang-ci/rust that referenced this issue Sep 12, 2020
Add `slice::array_chunks_mut`

This follows `array_chunks` from rust-lang#74373 with a mutable version, `array_chunks_mut`. The implementation is identical apart from mutability. The new tests are adaptations of the `chunks_exact_mut` tests, plus an inference test like the one for `array_chunks`.

I reused the unstable feature `array_chunks` and tracking issue rust-lang#74985, but I can separate that if desired.

r? `@withoutboats`
cc `@lcnr`
tmandry added a commit to tmandry/rust that referenced this issue Sep 16, 2020

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
Add array_windows fn

This mimicks the functionality added by array_chunks, and implements a const-generic form of
`windows`. It makes egregious use of `unsafe`, but by necessity because the array must be
re-interpreted as a slice of arrays, and unlike array_chunks this cannot be done by casting the
original array once, since each time the index is advanced it needs to move one element, not
`N`.

I'm planning on adding more tests, but this should be good enough as a premise for the functionality.
Notably: should there be more functions overwritten for the iterator implementation/in general?

~~I've marked the issue as rust-lang#74985 as there is no corresponding exact issue for `array_windows`, but it's based of off `array_chunks`.~~

Edit: See Issue rust-lang#75027 created by @lcnr for tracking issue

~~Do not merge until I add more tests, please.~~

r? @lcnr
JohnTitor added a commit to JohnTitor/rust that referenced this issue Oct 26, 2020

Verified

This commit was created on GitHub.com and signed with GitHub’s verified signature. The key has expired.
…lbertodt

Add [T]::as_chunks(_mut)

Allows getting the slices directly, rather than just through an iterator as in `array_chunks(_mut)`.  The constructors for those iterators are then written in terms of these methods, so the iterator constructors no longer have any `unsafe` of their own.

Unstable, of course. rust-lang#74985
@joshlf
Copy link
Contributor

joshlf commented Feb 10, 2021

Is there any intention of providing the reverse versions of these functions?

@thomcc
Copy link
Member

thomcc commented Apr 25, 2021

Is there any intention of providing the reverse versions of these functions?

What would that look like?

@joshlf
Copy link
Contributor

joshlf commented Apr 26, 2021

Same API, but iterating backwards. I have a use case in which I need to convert either a prefix or a suffix of a slice to an array reference. This API is perfect for the prefix use case, and I'd love to be able to use it for the suffix use case as well.

@newpavlov
Copy link
Contributor

@joshlf
Both slice of arrays and custom iterator type would implement the DoubleEndedIterator, so your use case does not need a separate reverse method.

@joshlf
Copy link
Contributor

joshlf commented Apr 26, 2021

That's only true if N evenly divides the length of the slice (see array_chunks docs). Otherwise, the remaining elements are only accessible via the remainder method, and won't be iterated over in the DoubleEndedIterator implementation.

@newpavlov
Copy link
Contributor

Ah, I misunderstood your use case, you want [0, 1, 2, 3, 4, 5, 6] to be split into [4, 5, 6], [1, 2, 3], [0], not into [6], [3, 4, 5], [0, 1, 2]. It indeed requires a separate method.

@joshlf
Copy link
Contributor

joshlf commented Apr 26, 2021

Yep, exactly 😃

@thomcc
Copy link
Member

thomcc commented Apr 27, 2021

Yeah, array_rchunks is definitely worth having.

@sffc
Copy link

sffc commented May 12, 2021

Also useful would be the inverse operation: [[0, 1, 2], [3, 4, 5]] to [0, 1, 2, 3, 4, 5]. (from &[[T; 3]] to &[T])

@ChayimFriedman2
Copy link
Contributor

Do we need to wait until const_evaluatable_unchecked is stabilized so we can deny N = 0 at compile time to stabilize this feature? Perhaps we can just mention in the documentation that currently it panics but it may be uplifted to a compile time error in the future.

@fzyzcjy
Copy link

fzyzcjy commented Oct 7, 2021

Hi, is there any alternatives before this is stable? Thanks!

@CryZe
Copy link
Contributor

CryZe commented Oct 7, 2021

Hi, is there any alternatives before this is stable? Thanks!

Yeah bytemuck::cast_slice and bytemuck::cast_slice_mut can do mostly the same (with a Pod requirement though)

@fzyzcjy
Copy link

fzyzcjy commented Oct 7, 2021

Thanks!

@Kixunil
Copy link
Contributor

Kixunil commented Mar 26, 2025

But panicking on a const generic, in a way that's "well if you ever run the monomorphization you'll see it", is way less of an issue.

That is certainly true in the vacuum. My story gives some perspective on what happens if it becomes a common pattern. The thing is, in that example all relevant values were hard-coded. If they were wrong they'd also always panic. But the issue is we got used to those conversions being the standard so much that a wrong conversion looked like the right one.

To map it to this method, one way this could go wrong is if people indeed started using if N == 0 often they might get so used to it that one day they don't notice that the check is if n == 0 and n is untrusted, so setting it to 0 is a DoS exploit. If used in a nested generic function it'd not emit any warning. And maybe for most, if not all, the people there's a more appropriate solution than doing that check but because the check is easy they weren't force to a more creative/principled solution. Just like we've completely missed that get_array::<const LEN: usize>(offset: usize) -> Option<&[T; LEN]> in an extension trait is possible and a good idea.

@RalfJung
Copy link
Member

RalfJung commented Mar 27, 2025

I thought the current policy was to always do FCP on the stabilization PR, not the tracking issue?

In this case, I think const-stabilizing these functions will require const-stabilizing an intrinsic (exact_div), which requires t-lang fcp. Preparing the stabilization PR would uncover such points so we don't need to wait for a second round of FCP.

@traviscross
Copy link
Contributor

I thought the current policy was to always do FCP on the stabilization PR, not the tracking issue?

It was reconsidered in the last libs-api meeting and decided that if that were to be changed, other related changes would need to be made to facilitate it, such as moving the discussions off of the tracking issues (or some other solution) so that the conversations would not be split between the tracking issue and the stabilization PR. It was mentioned that libs-api had previously done the FCP on the stabilization PR and that the experience was considered to have been a failure, leading to the process of doing it on the tracking issue. So the reasons behind the previous experience of failure with the process need to be explored and the causes resolved.

The divergence in experience is quite interesting, as on the lang side we've found doing the stabilization FCP on the stabilization PR to be a significant process improvement.

In this case, I think const-stabilizing these functions will require const-stabilizing an intrinsic (exact_div), which requires t-lang fcp. Preparing the stabilization PR would uncover such points so we don't need to wait for a second round of FCP.

Agreed. Similarly, apropos of @scottmcm's question, doing it on the stabilization PR helps to make the exact scope of the stabilization more clear.

@traviscross
Copy link
Contributor

traviscross commented Mar 28, 2025

In this case, I think const-stabilizing these functions will require const-stabilizing an intrinsic (exact_div), which requires t-lang fcp.

On this matter, of course, whoever is handling the stabilization, please put up a PR to stabilize the intrinsics and lang-nominate that. We could make our FCP contingent on libs-api deciding to stabilize the functions here so the FCPs could run in parallel.

@RalfJung
Copy link
Member

@rust-lang/libs-api it would be really good to get clarity on what exact set of functions and types is being proposed for stabilization here, as currently it is unclear.

@scottmcm
Copy link
Member

I put up and FCP for exact_div in #139163 -- I think we might as well just const-stabilize it, because it's not doing anything scary in the first place, and it's not like we can't still delete the intrinsic later if it comes to that anyway.

@RalfJung
Copy link
Member

It was reconsidered in the last libs-api meeting and decided that if that were to be changed, other related changes would need to be made to facilitate it, such as moving the discussions off of the tracking issues (or some other solution) so that the conversations would not be split between the tracking issue and the stabilization PR. It was mentioned that libs-api had previously done the FCP on the stabilization PR and that the experience was considered to have been a failure, leading to the process of doing it on the tracking issue. So the reasons behind the previous experience of failure with the process need to be explored and the causes resolved.

The divergence in experience is quite interesting, as on the lang side we've found doing the stabilization FCP on the stabilization PR to be a significant process improvement.

Yeah that is somewhat surprising. In particular with const stabilization, it can be hard to predict what exactly needs to be stabilized for this to work, such as extra intrinsics or some use of #[rustc_allow_const_fn_unstable]. So even if the FCP happens on the tracking issue, I would recommend a requirement that the stabilization PR must already exist so there is a clear idea of what is being proposed for stabilization.

@Amanieu
Copy link
Member

Amanieu commented Mar 31, 2025

@rust-lang/libs-api it would be really good to get clarity on what exact set of functions and types is being proposed for stabilization here, as currently it is unclear.

It wasn't discussed in the detail in the meeting, we mostly focused on the N == 0 issue. I'll bring it up again in tomorrow's meeting.

@Amanieu
Copy link
Member

Amanieu commented Apr 1, 2025

@rfcbot cancel

@rfcbot
Copy link
Collaborator

rfcbot commented Apr 1, 2025

@Amanieu proposal cancelled.

@rfcbot rfcbot removed final-comment-period In the final comment period and will be merged soon unless new substantive objections are raised. disposition-merge This issue / PR is in PFCP or FCP with a disposition to merge it. labels Apr 1, 2025
@Amanieu
Copy link
Member

Amanieu commented Apr 1, 2025

In the @rust-lang/libs-api meeting today we discussed the specific methods that we are considering stabilizing. We decided to only stabilize the as_chunks methods which return slices and leave the methods returning an iterator as unstable.

Methods being stabilized are:

impl [T] {
    const fn as_chunks<const N: usize>(&self) -> (&[[T; N]], &[T]);
    const fn as_rchunks<const N: usize>(&self) -> (&[T], &[[T; N]]);
    const unsafe fn as_chunks_unchecked<const N: usize>(&self) -> &[[T; N]];
    const fn as_chunks_mut<const N: usize>(&mut self) -> (&mut [[T; N]], &mut [T]);
    const fn as_rchunks_mut<const N: usize>(&mut self) -> (&mut [T], &mut [[T; N]]);
    const unsafe fn as_chunks_unchecked_mut<const N: usize>(&mut self) -> &mut [[T; N]];
}

@rfcbot merge

@rfcbot
Copy link
Collaborator

rfcbot commented Apr 1, 2025

Team member @Amanieu has proposed to merge this. The next step is review by the rest of the tagged team members:

No concerns currently listed.

Once a majority of reviewers approve (and at most 2 approvals are outstanding), this will enter its final comment period. If you spot a major issue that hasn't been raised at any point in this process, please speak up!

See this document for info about what commands tagged team members can give me.

@rfcbot rfcbot added proposed-final-comment-period Proposed to merge/close by relevant subteam, see T-<team> label. Will enter FCP once signed off. disposition-merge This issue / PR is in PFCP or FCP with a disposition to merge it. labels Apr 1, 2025
@Amanieu Amanieu removed the I-libs-api-nominated Nominated for discussion during a libs-api team meeting. label Apr 1, 2025
@BurntSushi
Copy link
Member

The docs for these functions currently state:

This check will most probably get changed to a compile time error before this method gets stabilized.

That should probably get fixed before stabilization or as part of stabilization.

@rfcbot rfcbot added final-comment-period In the final comment period and will be merged soon unless new substantive objections are raised. and removed proposed-final-comment-period Proposed to merge/close by relevant subteam, see T-<team> label. Will enter FCP once signed off. labels Apr 1, 2025
@rfcbot
Copy link
Collaborator

rfcbot commented Apr 1, 2025

🔔 This is now entering its final comment period, as per the review above. 🔔

@joboet
Copy link
Member

joboet commented Apr 1, 2025

The docs for these functions currently state:

This check will most probably get changed to a compile time error before this method gets stabilized.

That should probably get fixed before stabilization or as part of stabilization.

Couldn't we also remove the panic for these methods and return the entire slice as remainder if N is zero?

@cuviper
Copy link
Member

cuviper commented Apr 1, 2025

Couldn't we also remove the panic for these methods and return the entire slice as remainder if N is zero?

That's possible, but this statement in the docs would have to make an exception on the "strictly less" part:

Splits the slice into a slice of N-element arrays, starting at the beginning of the slice, and a remainder slice with length strictly less than N.

@scottmcm
Copy link
Member

scottmcm commented Apr 2, 2025

Couldn't we also remove the panic for these methods and return the entire slice as remainder if N is zero?

We could do lots of things, but the big question is whether the alternatives are more useful than they are surprising. Like #74985 (comment) discussed another option above.

In general, I think its good when we panic because something cannot meet the natural post-condition. It's always possible to expand a post-condition to make a panic unnecessary, but that has its own problems when the programmer ends up trying to figure out why something so weird is happening in an edge case rather than getting a really obvious failure from the «sorry, I cannot do that» panic.

Does someone calling .as_chunks() ever want the whole thing to be in the remainder? None of my uses for as_chunks that I can think of ever wanted that. And it breaks the pattern that held for everything else, where as you use a smaller and smaller chunk size the remainder gets shorter and shorter, or where as the input gets longer the remainder gets longer until you fill a chunk and the remainder goes back to zero. Said otherwise, I can't imagine anyone ever calling .as_chunks::<0>() to get that behaviour, and if it happens in generic code it feels more likely that the generic code will do something wrong with the result than be happy with that result -- for example, maybe they notice the input length is longer than the chunk size so of course they can look at the first chunk, but that's not true if it just all got put in the remainder. But we could fix that by returning not (&[], input) but (&[[]; usize::MAX], input)! So would that be better? In some ways I think it is, since now you always have foo.as_chunks::<N>().0.len() >= foo.as_chunks::<N+1>().0.len() again, which is a good property. But is it what people actually want? No, I still don't think it's better than a panic.

It's like how we could say that 0 / 0 is 0 in rust, and that would even be handy for some people sometimes, but we don't and it panics. Or how we could say that indexing has wrapping behaviour, which would be handy for some people sometimes, but we don't and it panics for too-large indices. Or we could have said that 0.ilog2() returns -1 as Self when overflow checking is off, which is in some sense a very natural result, but we didn't and it always panics. Or how originally .step_by(0) repeated the first item forever, but before it was stabilized that was changed, so now that just always panics.

@Kixunil
Copy link
Contributor

Kixunil commented Apr 2, 2025

I'd like to note that my request for explaining why take the less conservative approach and which real-world code needs to check N wasn't addressed yet. I don't find the responses so far satisfying because they do not include realistic code which needs the panic rather than compile error.

To put it differently, I don't have a problem with panics if some code actually needs them but I highly doubt there is any such production code. A single example would convince me that panic is better.

@joboet
Copy link
Member

joboet commented Apr 2, 2025

But we could fix that by returning not (&[], input) but (&[[]; usize::MAX], input)!

I didn't think about the length of the slice being the (mathematically undefined) dividend – I guess my mind got these methods mixed up with split_first_chunk... please ignore my previous comment.

@BurntSushi
Copy link
Member

I'd like to note that #74985 (comment) for explaining why take the less conservative approach and which real-world code needs to check N wasn't addressed yet.

I personally didn't respond because to me, examples aren't what convinces me to prefer panics. Post-mono errors are what convinces me to prefer panics. (Because I think post-mono errors should be avoided at almost all costs.)

@scottmcm
Copy link
Member

scottmcm commented Apr 2, 2025

I guess my mind got these methods mixed up with split_first_chunk...

I'll just second this point, actually, because I think it's a good one. I used to write .as_chunks().0[0] in examples because there wasn't a .first_chunk().unwrap() so it was by far the easiest way to get an array. But thankfully we have better ways than that horrible snippit now 🙂

the length of the slice being the (mathematically undefined) dividend

A really interesting (but slightly off-topic) thing this makes me think about: Okasaki chapter 9 is about numerical representations, and the equivalence between a way of writing a number and a container of that length. The simplest example is that a linked list is a unary representation of its own length. The book does a neat job of extending that to other things too, like what the container equivalent of a binary number would be -- and how the push/concat implementation on the data structure ends up with the same complexity as the increment/addition operator on a binary number.

So from that perspective, &[T] is a number, and first_chunk is checked_sub on that number, as_chunks is /+% on that number, and as_flattened is *-by-a-constant on it.

@joboet
Copy link
Member

joboet commented Apr 2, 2025

A really interesting (but slightly off-topic) thing this makes me think about: Okasaki chapter 9 is about numerical representations, and the equivalence between a way of writing a number and a container of that length. The simplest example is that a linked list is a unary representation of its own length. The book does a neat job of extending that to other things too, like what the container equivalent of a binary number would be -- and how the push/concat implementation on the data structure ends up with the same complexity as the increment/addition operator on a binary number.

So from that perspective, &[T] is a number, and first_chunk is checked_sub on that number, as_chunks is /+% on that number, and as_flattened is *-by-a-constant on it.

This is getting very off-topic, but this equivalence is what I found so incredibly fascinating about lambda calculus. It's taught me that data is defined by how you interact with it – and these interactions are basically nothing but folds.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-array Area: `[T; N]` A-const-generics Area: const generics (parameters and arguments) A-slice Area: `[T]` C-tracking-issue Category: An issue tracking the progress of sth. like the implementation of an RFC disposition-merge This issue / PR is in PFCP or FCP with a disposition to merge it. F-generic_const_exprs `#![feature(generic_const_exprs)]` final-comment-period In the final comment period and will be merged soon unless new substantive objections are raised. Libs-Tracked Libs issues that are tracked on the team's project board. T-libs-api Relevant to the library API team, which will review and decide on the PR/issue.
Projects
None yet
Development

No branches or pull requests