ref(aci): dual write workflow group action status #92522

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

cathteng wants to merge 8 commits into master from cathy/aci/dual-write-workflow-group-action-status

Member

cathteng commented May 29, 2025

Must merge #92478 first

Dual write WorkflowGroupActionStatus alongside ActionGroupStatus. We need a new function to do this because each action can be associated with its own set of workflows, and we want to enforce each workflow's frequency independently for each workflow+action combo. It's also difficult to query specific combinations of workflow+action statuses, so I query all the statuses for the actions+group and iterate to find the valid ones.

I renamed the old function to filter_recently_fired_actions and the new function is called filter_recently_fired_workflow_actions :)

github-actions bot added the Scope: Backend label

vercel bot deployed to Preview

May 29, 2025 21:52

View deployment

vercel bot deployed to Preview

May 29, 2025 21:57

View deployment

codecov bot commented May 29, 2025 •

edited

Loading

Codecov Report

All modified and coverable lines are covered by tests ✅

✅ All tests successful. No failed tests found.

Additional details and impacted files

@@            Coverage Diff             @@
##           master   #92522      +/-   ##
==========================================
+ Coverage   86.61%   87.89%   +1.28%     
==========================================
  Files       10236    10236              
  Lines      586994   587131     +137     
  Branches    22806    22806              
==========================================
+ Hits       508398   516084    +7686     
+ Misses      78166    70617    -7549     
  Partials      430      430

cathteng commented

View reviewed changes

src/sentry/workflow_engine/processors/action.py Outdated

Comment on lines 144 to 146

+                  all_statuses = WorkflowActionGroupStatus.objects.filter(
+                      group=group, action_id__in=action_to_workflows_ids.keys()
+                  )

Member Author

cathteng May 29, 2025 •

edited

Loading

wasn't sure how to query for specific action+workflow combos without making a long query with a ton of Q()'s, so opted for iterating through a big query

Member

kcons May 30, 2025

any reason to not also do workflow_id__in just to ensure that results are as narrow as we can simply get them?

Then the filtering below can use [] instead of get too.

Member Author

cathteng May 30, 2025

good point, will update

vercel bot deployed to Preview

May 29, 2025 22:20

View deployment

cathteng commented

View reviewed changes

src/sentry/workflow_engine/processors/action.py

+                  workflow_action_statuses = WorkflowActionGroupStatus.objects.filter(
+                      id__in=status_ids, date_updated__lt=now
+                  )
+                  action_ids = {status.action_id for status in workflow_action_statuses}

Member Author

cathteng May 29, 2025

using a set will dedupe actions if multiple workflows would fire the same action. in a follow up we should know which workflow is firing it (out of all the workflows that could have done so here)

cathteng marked this pull request as ready for review

May 29, 2025 22:29

cathteng requested a review from a team as a code owner

May 29, 2025 22:29

Base automatically changed from cathy/aci/workflow-action-group-status to master

May 30, 2025 16:07

cathteng requested a review from a team as a code owner

May 30, 2025 16:07

cathteng requested a review from kcons

May 30, 2025 17:04

cathteng added 7 commits

May 30, 2025 10:07


          dual write workflow group action status

fa02abf


          tests

3e38f69


          smol ref

63ec4a6


          smol ref again

de60178


          remove unused code

b24816d


          more renaming

ffcbecb


          test sub functions

441484a

cathteng force-pushed the cathy/aci/dual-write-workflow-group-action-status branch from 9f0ffee to 441484a Compare

May 30, 2025 17:07

vercel bot deployed to Preview

May 30, 2025 17:09

View deployment

kcons approved these changes

View reviewed changes

Member

kcons left a comment

Looks good, some thoughts tho.

src/sentry/workflow_engine/processors/action.py Outdated

Comment on lines 144 to 146

+                  all_statuses = WorkflowActionGroupStatus.objects.filter(
+                      group=group, action_id__in=action_to_workflows_ids.keys()
+                  )

Member

kcons May 30, 2025

any reason to not also do workflow_id__in just to ensure that results are as narrow as we can simply get them?

Then the filtering below can use [] instead of get too.

src/sentry/workflow_engine/processors/action.py

                   create_workflow_fire_histories(filtered_actions, event_data)
                   return filtered_actions
+              def get_workflow_group_action_statuses(
+                  action_to_workflows_ids: dict[int, set[int]], group: Group
+              ) -> dict[int, list[WorkflowActionGroupStatus]]:

Member

kcons May 30, 2025

Docstring.. "returns them grouped by Action ID" as I'm not sure the key is obvious enough from name or type.

src/sentry/workflow_engine/processors/action.py Outdated

@@ @@ -127,11 +130,122 @@ def filter_recently_fired_workflow_actions( @@
                   actions_without_statuses_ids = {action.id for action in actions_without_statuses}
                   filtered_actions = actions.filter(id__in=actions_to_include | actions_without_statuses_ids)
+                  # dual write to WorkflowActionGroupStatus
+                  filter_recently_fired_workflow_actions(filtered_action_groups, event_data)

Member

kcons May 30, 2025

Maybe

# dual write to ... ignoring results for now until they are canonical
_ = filter_recently_fired_workflow_action(...)

just to make our discarding of the return value explicit and obviously intentional.

src/sentry/workflow_engine/processors/action.py

		return actions_with_statuses


		def update_workflow_action_group_statuses(

Member

kcons May 30, 2025

Should document what it returns.

src/sentry/workflow_engine/processors/action.py Outdated

+                  # TODO: write this in a single spot
+                  # create_workflow_fire_histories
+                  return Action.objects.filter(id__in=action_ids).distinct()

Member

kcons May 30, 2025

aren't these guaranteed to be distinct?

Member Author

cathteng May 30, 2025

true. thanks

src/sentry/workflow_engine/processors/action.py

@@ @@ -94,7 +97,7 @@ def create_workflow_fire_histories( @@
               # TODO(cathy): only reinforce workflow frequency for certain issue types
-              def filter_recently_fired_workflow_actions(
+              def filter_recently_fired_actions(

Member

kcons May 30, 2025

By name, this just filters what we pass in, but it also does status and history updating.
I think a docstring with "Returns actions associated with the provided DataConditionGroups, excluding those that ... whatever. Also updates ...".

I do wonder if some of the book-keeping could be separated from the filtering to simplify that.

src/sentry/workflow_engine/processors/action.py

+                  )
+                  # TODO: need to know the exact workflow that fired the action
+                  return action_ids

Member

kcons May 30, 2025

So, action_ids are the actions not filtered out based on workflow config and previous status? Rather than constructing action_ids, I wonder if it'd be simpler to note action_ids filtered out, and return <full set of action ids> - filtered_action_ids, and leave the WAGS book-keeping as its own thing?

Member Author

cathteng May 30, 2025

i will look into whether this can be cleaned up a bit in a follow up, i'm also not a huge fan how i've mixed updating the statuses with figuring out which actions to fire

src/sentry/workflow_engine/processors/action.py

+                  )
+                  # TODO: write this in a single spot
+                  # create_workflow_fire_histories

Member

kcons May 30, 2025

kinda think this should be lifted out of here instead; noting which workflows fire isn't really a job for action filtering; the higher level stuff needs to know "we have our list of workflow actions to fire". If we bump this to a higher level, this TODO doesn't need to exist and I think the correctness of it all is clearer.
That said, not a change for this PR.


          ref from reviews

0f7e7a7

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels