Add mean et al. for truncated log normal by ararslan · Pull Request #1874 · JuliaStats/Distributions.jl

ararslan · 2024-06-26T02:40:49Z

This PR adds mgf for truncated normal and uses that to implement mean, var, skewness, and kurtosis for truncated log normal based on this observation. ~~median is also implemented for truncated log normal.~~

Fixes #709

devmotion · 2024-06-26T07:53:45Z

My worry (that was also expressed in issues such as #968) is that generally numerical integration is challenging and a fallback might lead to silently incorrect results. It seems such a fallback would be wrong (or at least problematic) e.g. if the moments are not finite (such as e.g. for Cauchy).

So my general feeling is that numerical integration should maybe be restricted to a smaller subset of distributions, or maybe even only be available as a separate function. In case we want to use it more broadly, I think it would also be safer to error if the integration error estimate is too large, to reduce the probability of silently incorrect results.

PaulSoderlind · 2024-06-26T14:34:05Z

@devmotion Is there any numerical integration in this code?

ararslan · 2024-06-26T15:34:42Z

@devmotion, perhaps you're thinking of #1875? As @PaulSoderlind noted, there's no integration here.

codecov-commenter · 2024-06-27T01:52:53Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 85.71%. Comparing base (65f056c) to head (70c7810).
Report is 3 commits behind head on master.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1874      +/-   ##
==========================================
- Coverage   85.99%   85.71%   -0.29%     
==========================================
  Files         144      145       +1     
  Lines        8666     8706      +40     
==========================================
+ Hits         7452     7462      +10     
- Misses       1214     1244      +30

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

devmotion · 2024-06-27T13:34:13Z

Oh yes, indeed, it seems I commented on the wrong PR.

ararslan · 2024-06-28T20:07:50Z

CI failures on nightly are unrelated.

devmotion · 2024-06-28T22:10:50Z

src/truncated/lognormal.jl

+    a = d.lower === nothing ? nothing : log(T(minimum(d)))
+    b = d.upper === nothing ? nothing : log(T(maximum(d)))


Is there a reason to use minimum/maximum instead of d.lower/d.upper?

I feel like maybe I had a reason but who knows what it was

Ah, I remember now: it was to handle truncation points outside of the support, e.g. truncated(LogNormal(); lower=-1). In that case, d.lower == -1 but minimum(d) == 0. I should add a test for that.

devmotion · 2024-06-28T22:13:55Z

src/truncated/normal.jl

 end

+function mgf(d::Truncated{<:Normal{<:Real},Continuous}, t::Real)
+    T = promote_type(partype(d), typeof(t))


I guess

Suggested change

T = promote_type(partype(d), typeof(t))

T = float(promote_type(partype(d), typeof(t)))

would be more correct.

devmotion · 2024-06-28T22:17:33Z

src/truncated/lognormal.jl

+    return @horner(m1, m4, -4m3, 6m2, 0, -3) / v^2 - 3
+end
+
+median(d::Truncated{<:LogNormal}) = exp(median(_truncnorm(d)))


This suggests we should add a similar definition for quantile as well?

Oh good call, it turns out that this relation holds in general, not just for the median.

Actually, defining median/quantile is unnecessary as there's already a fallback definition for computing quantiles for truncated distributions that would be more efficient than the additional layer of indirection involved with constructing a truncated normal.

devmotion · 2024-06-28T22:18:36Z

src/truncated/lognormal.jl

+function var(d::Truncated{<:LogNormal})
+    tn = _truncnorm(d)
+    # Ensure the variance doesn't end up negative, which can occur due to numerical issues
+    return max(mgf(tn, 2) - mgf(tn, 1)^2, 0)


Repeated evaluation of mgf involves repeated calculations of the same (intermediate) quantities. But my fear is that optimizing this further will lead to less readable code...

Yeah, I had the same thought and wasn't sure what to do about it so I just... didn't address it, haha

Fixes 709

ararslan · 2024-06-29T01:22:28Z

Ugh, type inference issue on 1.3 and I can't test on 1.3 locally 😑

devmotion · 2024-09-26T00:08:42Z

src/truncated/lognormal.jl

+    a = d.lower === nothing || d.lower <= 0 ? nothing : log(T(d.lower))
+    b = d.upper === nothing || isinf(d.upper) ? nothing : log(T(d.upper))


Tests on Julia 1.3 pass locally when changing this to

Suggested change

a = d.lower === nothing || d.lower <= 0 ? nothing : log(T(d.lower))

b = d.upper === nothing || isinf(d.upper) ? nothing : log(T(d.upper))

a = d.lower === nothing ? nothing : log(T(max(d.lower, 0)))

b = d.upper === nothing ? nothing : log(T(d.upper))

That isn't functionally equivalent though; IIRC, I was relying on a and/or b being nothing in those cases so that truncated would handle it a particular way.

But arguably this optimization (d.lower === nothing or d.upper === nothing) should already have happened, either by a user or internally, when constructing the d = truncated(LogNormal(...))? Maybe one shouldn't expect that unoptimized inputs lead to optimized algorithms. AFAICT the optimization of truncated(Normal(...), ...) is also only exploited in the case where this returns a Normal (a = b = nothing); the code for Truncated{<:Normal} does not seem to use the fact that a bound might be nothing. In the Normal case, arguably the LogNormal shouldn't be truncated in the first place.

I'm currently sick and can only barely comprehend that message but if you'd prefer to go with your suggested change then feel free to apply it, I trust your judgement

Did we have this conversation a few months ago? My brain similarly can't comprehend whether this is the same discussion: #1874 (comment)

I should probably just log off and go sleep

devmotion mentioned this pull request Jun 27, 2024

Define fallback implementations for mean, var, and entropy #1875

Closed

3 tasks

ararslan marked this pull request as ready for review June 28, 2024 20:07

ararslan requested a review from devmotion June 28, 2024 20:07

devmotion reviewed Jun 28, 2024

View reviewed changes

Add mean et al. for truncated log normal

036a24d

Fixes 709

ararslan force-pushed the aa/trunclognorm branch from 9d417c7 to 036a24d Compare June 28, 2024 22:54

Handle truncation outside support

c73a6e1

Blind guess at fixing type inference on 1.3

70c7810

devmotion reviewed Sep 26, 2024

View reviewed changes

		a = d.lower === nothing ? nothing : log(T(minimum(d)))
		b = d.upper === nothing ? nothing : log(T(maximum(d)))

	T = promote_type(partype(d), typeof(t))
	T = float(promote_type(partype(d), typeof(t)))

		a = d.lower === nothing \|\| d.lower <= 0 ? nothing : log(T(d.lower))
		b = d.upper === nothing \|\| isinf(d.upper) ? nothing : log(T(d.upper))

Conversation

ararslan commented Jun 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

devmotion commented Jun 26, 2024

Uh oh!

PaulSoderlind commented Jun 26, 2024

Uh oh!

ararslan commented Jun 26, 2024

Uh oh!

codecov-commenter commented Jun 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

devmotion commented Jun 27, 2024

Uh oh!

ararslan commented Jun 28, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ararslan commented Jun 29, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

ararslan commented Jun 26, 2024 •

edited

Loading

codecov-commenter commented Jun 27, 2024 •

edited

Loading