Add Ops for Gaussian Hypergeometric Function, Pochhammer Symbol, and Factorials #90

ColtAllen · 2022-12-07T15:55:17Z

This PR is a continuation of #87 in a new fork and adds Ops for the Gaussian Hypergeometric Function, Pochhammer Symbol, and Factorials as hyp2f1, poch, and factorial, respectively.

hyp2f1 involves an infinite summation and uses a scipy implementation for this reason, but once scan is performant enough to assume this task, hyp2f1 can be rewritten in terms of poch and factorial in a future PR.

The only test failing is test_grad for hyp2f1. It seems to be a data type issue:

TypeError: ('float() argument must be a string or a number, not \'ScalarVariable\'\n
Apply node that caused the error: Elemwise{hyp2f1_der}(input 0, input 1, input 2, input 3, TensorConstant{(1, 1) of 0.0})\n
Toposort index: 0\nInputs types: [TensorType(float64, (2, 3)), TensorType(float64, (2, 3)), TensorType(float64, (2, 3)), TensorType(float64, (2, 3)), TensorType(float64, (1, 1))]\n
Inputs shapes: [(2, 3), (2, 3), (2, 3), (2, 3), (1, 1)]\n
Inputs strides: [(24, 8), (24, 8), (24, 8), (24, 8), (8, 8)]\n
Inputs values: [\'not shown\', \'not shown\', \'not shown\', \'not shown\', array([[0.]])]\n
Outputs clients: [[Elemwise{Mul}[(0, 1)](random_projection, Elemwise{hyp2f1_der}.0)]]\n\n
Backtrace when the node is created (use PyTensor flag traceback__limit=N to make it longer):\n  
File "/mnt/c/Users/colta/portfolio/pytensor/tests/tensor/utils.py", line 590, in test_grad\n    
utt.verify_grad(\n  
File "/mnt/c/Users/colta/portfolio/pytensor/tests/unittest_tools.py", line 70, in verify_grad\n    
orig_verify_grad(op, pt, n_tests, rng, *args, **kwargs)\n  
File "/mnt/c/Users/colta/portfolio/pytensor/pytensor/gradient.py", line 1844, in verify_grad\n    
symbolic_grad = grad(cost, tensor_pt, disconnected_inputs="ignore")\n  
File "/mnt/c/Users/colta/portfolio/pytensor/pytensor/gradient.py", line 619, in grad\n    _
rval: Sequence[Variable] = _populate_grad_dict(\n  File "/mnt/c/Users/colta/portfolio/pytensor/pytensor/gradient.py", line 1426, in _populate_grad_dict\n    
rval = [access_grad_cache(elem) for elem in wrt]\n  
File "/mnt/c/Users/colta/portfolio/pytensor/pytensor/gradient.py", line 1426, in <listcomp>\n    
rval = [access_grad_cache(elem) for elem in wrt]\n  
File "/mnt/c/Users/colta/portfolio/pytensor/pytensor/gradient.py", line 1381, in access_grad_cache\n    
term = access_term_cache(node)[idx]\n  
File "/mnt/c/Users/colta/portfolio/pytensor/pytensor/gradient.py", line 1209, in access_term_cache\n    
input_grads = node.op.L_op(inputs, node.outputs, new_output_grads)\n\n
HINT: Use the PyTensor flag `exception_verbosity=high` for a debug print-out and storage map footprint of this Apply node.', 
'Test Elemwise{hyp2f1,no_inplace}::normal: Error occurred while computing the gradient on the following inputs: 
[array([[764.16214925, 550.49823533, 542.19109217],\n       [613.93532095, 341.6967123 , 284.38215306]]), 
array([[764.16214925, 550.49823533, 542.19109217],\n       [613.93532095, 341.6967123 , 284.38215306]]), 
array([[764.16214925, 550.49823533, 542.19109217],\n       [613.93532095, 341.6967123 , 284.38215306]]), 
array([[0.38208107, 0.27524912, 0.27109555],\n       [0.30696766, 0.17084836, 0.14219108]])]')

pytensor/scalar/math.py

ricardoV94 · 2022-12-07T16:54:17Z

@ColtAllen Do you mind squashing your commits into: 1) Add factorial and poch helpers and 2) Add Hyp2F1 and derivatives?

ColtAllen · 2022-12-07T17:40:24Z

@ColtAllen Do you mind squashing your commits into: 1) Add factorial and poch helpers and 2) Add Hyp2F1 and derivatives?

Commits are squashed, but I edited the wrong commit message; sorry about that.

ricardoV94 · 2022-12-07T17:42:14Z

You can edit the commit message with interactive rebase (backup your branch first if it's your first time)

https://docs.github.com/en/pull-requests/committing-changes-to-your-project/creating-and-editing-commits/changing-a-commit-message

twiecki · 2022-12-21T10:16:41Z

@ColtAllen Can you run pre-commit on this?

ColtAllen · 2022-12-21T21:20:49Z

Hey @twiecki,

After a conversation with @ricardoV94, we decided the next step(s) are for the hypf1 derivatives to be written in Stan. He offered to do so since he's more familiar with that language. This could involve some considerable changes to what has already been written, so it may be best to wait until afterward to run a pre-commit.

codecov-commenter · 2022-12-30T22:55:27Z

Codecov Report

Merging #90 (10d4d94) into main (8051ffb) will increase coverage by 0.00%.
The diff coverage is 84.21%.

Additional details and impacted files

@@           Coverage Diff           @@
##             main      #90   +/-   ##
=======================================
  Coverage   79.95%   79.95%           
=======================================
  Files         170      170           
  Lines       44856    44950   +94     
  Branches     9498     9510   +12     
=======================================
+ Hits        35863    35942   +79     
- Misses       6780     6790   +10     
- Partials     2213     2218    +5

Impacted Files	Coverage Δ
pytensor/scalar/math.py	`85.00% <82.55%> (-0.30%)`	⬇️
pytensor/tensor/inplace.py	`100.00% <100.00%> (ø)`
pytensor/tensor/math.py	`90.69% <100.00%> (+0.01%)`	⬆️
pytensor/tensor/special.py	`90.90% <100.00%> (+0.26%)`	⬆️

ricardoV94 · 2023-01-01T08:30:40Z

@OriolAbril does the error in RTD make any sense to you?

OriolAbril · 2023-01-01T20:52:19Z

Doesn't mean anything to me, I'd need to try and reproduce it locally to get a better idea :/. Has nothing similar ever happened outside rtd?

ricardoV94 · 2023-01-02T14:39:54Z

@ColtAllen I've finally come around this one. Do you want to have a look?

ColtAllen · 2023-01-03T13:01:49Z

pytensor/scalar/math.py

+        def check_2f1_converges(a, b, c, z) -> bool:
+            num_terms = 0
+            is_polynomial = False
+
+            def is_nonpositive_integer(x):
+                return x <= 0 and x.is_integer()
+
+            if is_nonpositive_integer(a) and abs(a) >= num_terms:
+                is_polynomial = True
+                num_terms = int(np.floor(abs(a)))
+            if is_nonpositive_integer(b) and abs(b) >= num_terms:
+                is_polynomial = True
+                num_terms = int(np.floor(abs(b)))
+
+            is_undefined = is_nonpositive_integer(c) and abs(c) <= num_terms
+
+            return not is_undefined and (
+                is_polynomial or np.abs(z) < 1 or (np.abs(z) == 1 and c > (a + b))
+            )


Just idle curiosity, but what is the significance of the num_terms variable?

I think, when the series has a negative integer in the denominator it will eventually have a divide by zero and blow up... unless it has a negative integer in the numerator that will arrive at zero first, truncating the series before that happens

ColtAllen · 2023-01-03T13:16:03Z

@ColtAllen I've finally come around this one. Do you want to have a look?

Nice! We discussed using Stan, but it looks like you were able to get this running with numpy. Will there be a performance hit in doing so?

Also, thinking long-term, do you think this is good to go for the conceivable future, or are there known bottlenecks worth revisiting later as additional backends are adopted and improvements made elsewhere in pytensor? If so these are worth documenting for future work.

ricardoV94 · 2023-01-03T13:38:34Z

I didn't think of using Stan directly just adapting their implementation. Using numpy is certainly not the most efficient thing we can do. The broader issue is discussed here: #83

I want to try to take a stab at it in the following days, but for now I think numpy will suffice. Depending on the range of values it can actually converge pretty quickly (~10 iterations).

Actually I wanted to ask if you could try it in the model that motivated the issue and see how it fares.

OriolAbril · 2023-01-03T17:08:42Z

The rtd issue doesn't seem to happen in other PRs :/.

michaelosthege · 2023-01-04T17:05:01Z

rtd build should get fixed by a rebase @ricardoV94

Co-authored-by: ColtAllen <[email protected]>

ricardoV94

Self-approval warning :)

ricardoV94 changed the base branch from downstream_1288 to main December 7, 2022 16:01

ricardoV94 reviewed Dec 7, 2022

View reviewed changes

pytensor/scalar/math.py Outdated Show resolved Hide resolved

ColtAllen force-pushed the downstream_1288 branch from 14273e9 to 823a702 Compare December 7, 2022 17:36

ColtAllen force-pushed the downstream_1288 branch from 823a702 to 32b922a Compare December 7, 2022 17:51

ricardoV94 mentioned this pull request Dec 30, 2022

Implement opt-in stabilization by computing common operations on log scale #158

Closed

ricardoV94 force-pushed the downstream_1288 branch 2 times, most recently from 12985af to 0b861d3 Compare December 30, 2022 21:50

ricardoV94 mentioned this pull request Jan 2, 2023

[BUGFIX] grad_2f1: Add minimum number of iterations and early stopping test stan-dev/math#2858

Merged

5 tasks

ricardoV94 force-pushed the downstream_1288 branch 2 times, most recently from a3dd6b2 to cab2655 Compare January 2, 2023 14:38

ricardoV94 force-pushed the downstream_1288 branch from cab2655 to 0778e10 Compare January 2, 2023 14:44

ColtAllen commented Jan 3, 2023

View reviewed changes

ricardoV94 added the enhancement New feature or request label Jan 4, 2023

ColtAllen and others added 2 commits January 5, 2023 06:41

Add factorial and poch helpers

4278ce4

Implement Hyp2F1 and gradients

10d4d94

Co-authored-by: ColtAllen <[email protected]>

ricardoV94 force-pushed the downstream_1288 branch from 0778e10 to 10d4d94 Compare January 5, 2023 05:41

ricardoV94 approved these changes Jan 5, 2023

View reviewed changes

twiecki merged commit 9b2cb97 into pymc-devs:main Jan 5, 2023

ColtAllen deleted the downstream_1288 branch January 10, 2023 04:37

ColtAllen mentioned this pull request Jan 10, 2023

Add Pareto/NBD Model pymc-labs/pymc-marketing#127

Closed

Add Ops for Gaussian Hypergeometric Function, Pochhammer Symbol, and Factorials #90

Add Ops for Gaussian Hypergeometric Function, Pochhammer Symbol, and Factorials #90

Uh oh!

Conversation

ColtAllen commented Dec 7, 2022

Uh oh!

Uh oh!

ricardoV94 commented Dec 7, 2022

Uh oh!

ColtAllen commented Dec 7, 2022

Uh oh!

ricardoV94 commented Dec 7, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

twiecki commented Dec 21, 2022

Uh oh!

ColtAllen commented Dec 21, 2022

Uh oh!

codecov-commenter commented Dec 30, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

ricardoV94 commented Jan 1, 2023

Uh oh!

OriolAbril commented Jan 1, 2023

Uh oh!

ricardoV94 commented Jan 2, 2023

Uh oh!

ColtAllen Jan 3, 2023

Choose a reason for hiding this comment

Uh oh!

ricardoV94 Jan 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

ColtAllen commented Jan 3, 2023

Uh oh!

ricardoV94 commented Jan 3, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

OriolAbril commented Jan 3, 2023

Uh oh!

michaelosthege commented Jan 4, 2023

Uh oh!

ricardoV94 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ricardoV94 commented Dec 7, 2022 •

edited

Loading

codecov-commenter commented Dec 30, 2022 •

edited

Loading

ricardoV94 Jan 3, 2023 •

edited

Loading

ricardoV94 commented Jan 3, 2023 •

edited

Loading