Anomaly: Added support for >1 target category columns #536

govarsha · 2024-01-23T13:50:53Z

ODSC-52180

Right now anomaly operator supports only one target category column. Added code to support more than 1.

Screenshots:

github-actions · 2024-01-23T14:23:41Z

📌 Cov diff with main:

📌 Overall coverage:

github-actions · 2024-01-23T23:55:08Z

📌 Cov diff with main:

📌 Overall coverage:

ahosler · 2024-01-24T09:31:32Z

ads/opctl/operator/lowcode/anomaly/model/anomaly_dataset.py

- for group, df in self.data.groupby(spec.target_category_columns[0])
+ # Merge target category columns
+
+ self.data["__Series__"] = utils._merge_category_columns(self.data, spec.target_category_columns)


It looks like the "Series" column gets added to self.data and then is referenced throughout the rest of the code.
Could you add to the docstring of this method "Merges all target category columns into a single column named "Series"" Or something equivalent.

I also see that the same code is run on line 180 of base_model.py. Do we need to do this twice?

I will add the docstring.
Here _load_data in anomaly_dataset doesn't load/preprocess validation data. We are loading the validation data in base_model and the merge_category_columns is done there on validation data.

ahosler · 2024-01-24T09:31:43Z

ads/opctl/operator/lowcode/anomaly/model/base_model.py

 if data.empty:
 return total_metrics, summary_metrics, None
-
+ data["__Series__"] = utils._merge_category_columns(data, self.spec.target_category_columns)


Is this redundant?

What is the intended behaviour when there is no target_category_column present?

The method _load_data in anomaly_dataset doesn't load/preprocess validation data. We are loading the validation data here and the merge_category_columns is done here for that.

And yeah I forgot to handle when no target_category_column is present for validation data. I will do that change

added support for multiple target category columns

73ebf11

oracle-contributor-agreement bot added the OCA Verified All contributors have signed the Oracle Contributor Agreement. label Jan 23, 2024

govarsha changed the title ~~Added support for multiple target category columns~~ Anomaly: Added support for multiple target category columns Jan 23, 2024

govarsha changed the title ~~Anomaly: Added support for multiple target category columns~~ Anomaly: Added support for >1 target category columns Jan 23, 2024

bug fix

5caf552

govarsha requested review from ahosler, codeloop, mrDzurb and prasankh January 23, 2024 23:45

ahosler merged commit 8c1f1a9 into feature/anomaly_detect Jan 25, 2024

ahosler reviewed Jan 25, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Anomaly: Added support for >1 target category columns #536

Anomaly: Added support for >1 target category columns #536

Uh oh!

govarsha commented Jan 23, 2024 •

edited

Loading

github-actions bot commented Jan 23, 2024

github-actions bot commented Jan 23, 2024

ahosler Jan 24, 2024

govarsha Jan 25, 2024

ahosler Jan 24, 2024

ahosler Jan 24, 2024

govarsha Jan 25, 2024

govarsha Jan 25, 2024

Labels

2 participants

Anomaly: Added support for >1 target category columns #536

Anomaly: Added support for >1 target category columns #536

Uh oh!

Conversation

govarsha commented Jan 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

github-actions bot commented Jan 23, 2024

github-actions bot commented Jan 23, 2024

ahosler Jan 24, 2024

Choose a reason for hiding this comment

govarsha Jan 25, 2024

Choose a reason for hiding this comment

ahosler Jan 24, 2024

Choose a reason for hiding this comment

ahosler Jan 24, 2024

Choose a reason for hiding this comment

govarsha Jan 25, 2024

Choose a reason for hiding this comment

govarsha Jan 25, 2024

Choose a reason for hiding this comment

Labels

2 participants

govarsha commented Jan 23, 2024 •

edited

Loading