-
-
Notifications
You must be signed in to change notification settings - Fork 379
Description
Summary:
I'm not sure where to post this as it's a Stan wide thing. Currently int values are just basic C int types with values from +/-2,147,483,647. Recently @bob-carpenter had a model with 1M parameters and it had me thinking that it's more realistic that we start having users try larger data sets in the future. For reference, a 10 column matrix of doubles with rows of length max int would be about 172 GB (8 * 2,147,483,647 * 10). Single machines are capable now of having 1TB of memory. While I think we are a few years away from having to worry about this it's something we should think about a little.
I think there's a few options for us
- Have a
stan::indexalias that we use throughout all the repos. We can set this tolong int,unsigned int, or whatever we want. (Eigen::Indexislong intby default) - Add a
long intto the language and have all the code that accepts integers be templated or useautoto deduce the int type. This is more work but lets users keep using int types without worry any loss in speed or memory size from larger integral types.
I'm personally fine with both. The only thing I feel strongly about is that we should keep using a signed type as the default for any index type
Current Version:
v2.34.1