adding extrinsic eval notebook #374

ephamhung-oss · 2025-10-20T20:15:28Z

No description provided.

Signed-off-by: Eric Pham-Hung <ephamhung@ephamhung-mlt.client.nvidia.com>

Signed-off-by: Eric Pham-Hung <ephamhung@nvidia.com>

nina-xu

overall it looks good! thanks so much for putting this together

nina-xu · 2025-11-05T17:46:18Z

nemo/NeMo-Safe-Synthesizer/advanced/extrinsic_evaluation.ipynb

+   "id": "630e3e17",
+   "metadata": {},
+   "source": [
+    "# 🎛️ NeMo Safe Synthesizer 101: Extrinsic Evaluation\n",


nina-xu · 2025-11-05T17:47:12Z

nemo/NeMo-Safe-Synthesizer/advanced/extrinsic_evaluation.ipynb

+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# This script defines a scikit-learn pipeline for a classification task.\n",


For the extrinsic evaluation portion, there’s a bit of code repetition. Suggest to DRY it up by defining the train + eval steps into a function, and call that function twice with train_and_evaluate_logistic_regression(df, test_df); train_and_evaluate_logistic_regression(synthetic_df, test_df). This also makes it very clear to a user what we are doing here.

nina-xu · 2025-11-05T17:48:02Z

nemo/NeMo-Safe-Synthesizer/advanced/extrinsic_evaluation.ipynb

+    "from sklearn.metrics import classification_report, accuracy_score, roc_auc_score\n",
+    "\n",
+    "original_pipeline = full_pipeline \n",
+    "print(\"\\n--- Training Benchmark Model on Original Data (1000 rows) ---\")\n",


I don't think the 1000 here is accurate here?

nina-xu · 2025-11-05T17:49:42Z

nemo/NeMo-Safe-Synthesizer/advanced/extrinsic_evaluation.ipynb

+    "| Accuracy            |                 0.9404 |      0.9278 |\n",
+    "| ROC AUC Score       |                 0.9782 |      0.9762 |\n",
+    "| Precision (Class 1) |                 0.9626 |      0.9423 |\n",
+    "| Recall (Class 1)    |                 0.9646 |      0.9714 |\n",


This is amazing results. out of curiorsity what was the SQS?

adding extrinsic eval notebook

df31a32

Signed-off-by: Eric Pham-Hung <ephamhung@ephamhung-mlt.client.nvidia.com>

ephamhung-oss marked this pull request as ready for review October 20, 2025 20:20

ephamhung-oss added 4 commits November 4, 2025 16:48

updated traintest split and comments

81d8174

Signed-off-by: Eric Pham-Hung <ephamhung@nvidia.com>

Merge branch 'NVIDIA:main' into add-extrinsic-eval

9112a6d

moved to advanced

25cc972

Signed-off-by: Eric Pham-Hung <ephamhung@nvidia.com>

updating intro

37fcd12

nina-xu reviewed Nov 5, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

adding extrinsic eval notebook #374

adding extrinsic eval notebook #374

Uh oh!

ephamhung-oss commented Oct 20, 2025

Uh oh!

nina-xu left a comment

Uh oh!

nina-xu Nov 5, 2025

Uh oh!

nina-xu Nov 5, 2025

Uh oh!

nina-xu Nov 5, 2025

Uh oh!

nina-xu Nov 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

adding extrinsic eval notebook #374

Are you sure you want to change the base?

adding extrinsic eval notebook #374

Uh oh!

Conversation

ephamhung-oss commented Oct 20, 2025

Uh oh!

nina-xu left a comment

Choose a reason for hiding this comment

Uh oh!

nina-xu Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

nina-xu Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

nina-xu Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

nina-xu Nov 5, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants