-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
mathematically defining transforms #36
Comments
In general I like this formulation, but I had some thoughts (sorry this is long):
|
I would love to make this explicit in a way that I can understand. My measure theory is weak past the basic definitions of sigma algebras and Lebesgue integration. The uniformity we want is for all I don't know how to say that when we're dealing with unconstrained spaces that lead to improper "densities".
Thanks. We should be as mathematically precise as possible without turning this into an impenetrable measure theory paper. In practice, we run into numerical problems even in simple positive-constraining transforms like exp() which quickly overflow to +infinity and underflow to zero.
Isn't it only bijective with the constrained space? The Jacobian is from the Cholesky factor with
What does super-diagonal mean?
I was trying to lay out the general approach that way. Do you think we should just skip the usual direction of transform and inverse when talking about changes of variables? We can do that, but we probably want a footnote or something to connect the dots, because every stats presentation I've ever seen does it the way I wrote it in the Stan reference manual. |
similar thing with alr too right? we don't actually take the Nth dimension in the jacobian. I'm not sure about how would one go about communicating that idea, in a general way |
The paper's getting a bit out of hand with everyone writing in their own style. I'm trying to work with Meenal now to strip out most of what's there and then build up a draft with coherent notation. As such, the first thing we need to do is define transforms. To me, the goal of the whole business is that when we sample on the unconstrained scale and transform, we get a uniform distribution on the constrained scale. So informally, we want something like this.
The reason this definition doesn't work is that$p_Y$ may be improper, so it doesn't really make sense to talk about $Y \sim p_Y(\cdot)$ . So instead, I think we want this:
We can then go on and define the special case of one-one changes of variables.
We can then move on to the example of simplexes, where we are defining a transform from$\mathbb{R}^N$ to $\Delta^N \subseteq \mathbb{R}^{N+1}$ by setting element $N + 1$ to $1$ minus the sum of the first $N$ elements. Another example is going from $\binom{M}{2} + M$ to $M \times M$ dimensions with the Cholesky factor of covariance matrix transform by multiplying by itself transposed.
The text was updated successfully, but these errors were encountered: