Compute correct log_likelihood for models with partially observed variables

Masked observations are no longer associated with the observed RV, but to a separate free RV. 

```python
import numpy as np
import pymc as pm

with pm.Model() as m:
  x = pm.Normal('x', observed=[np.nan, 1, 2, 3])

print(m['x_observed'].tag.observations)  # TensorConstant{[1.0 2.0 3.0]}
```

As such I am not sure whether we are computing the correcting model `log_likelihood` for partially observed models, **assuming the imputed observations should appear in the final log_likelihood**:

Running all tests in `test_idata_conversion.py` with coverage, confirmed that these lines lines in `InferenceDataConverter.log_likelihood-vals_point` are not being triggered:

https://github.com/pymc-devs/pymc/blob/0c90e8204fcf4040edca62a490e2f5aee982f96a/pymc/backends/arviz.py#L248-L257

This came up in #5245



	if isinstance(var.owner.op, (AdvancedIncSubtensor, AdvancedIncSubtensor1)):
	try:
	obs_data = extract_obs_data(var.tag.observations)
	except TypeError:
	warnings.warn(f"Could not extract data from symbolic observation {var}")

	mask = obs_data.mask
	if np.ndim(mask) > np.ndim(log_like_val):
	mask = np.any(mask, axis=-1)
	log_like_val = np.where(mask, np.nan, log_like_val)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compute correct log_likelihood for models with partially observed variables #5255

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Compute correct log_likelihood for models with partially observed variables #5255

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions