Skip to content

Conversation

@dimapihtar
Copy link
Collaborator

Important

The Update branch button must only be pressed in very rare occassions.
An outdated branch is never blocking the merge of a PR.
Please reach out to the automation team before pressing that button.

What does this PR do ?

Add a one line overview of what this PR aims to accomplish.

Collection: [Note which collection this PR will affect]

Changelog

  • Add specific line by line info of high level changes in this PR.

Usage

  • You can potentially add a usage example below
# Add a code snippet demonstrating how to use this 

GitHub Actions CI

The Jenkins CI system has been replaced by GitHub Actions self-hosted runners.

The GitHub Actions CI will run automatically when the "Run CICD" label is added to the PR.
To re-run CI remove and add the label again.
To run CI on an untrusted fork, a NeMo user with write access must first click "Approve and run".

Before your PR is "Ready for review"

Pre checks:

  • Make sure you read and followed Contributor guidelines
  • Did you write any new necessary tests?
  • Did you add or update any necessary documentation?
  • Does the PR affect components that are optional to install? (Ex: Numba, Pynini, Apex etc)
    • Reviewer: Does the PR have correct import guards for all optional libraries?

PR Type:

  • New Feature
  • Bugfix
  • Documentation

If you haven't finished some of the above items you can still open "Draft" PR.

Who can review?

Anyone in the community is free to review the PR once the checks have passed.
Contributor guidelines contains specific people who can review PRs to various areas.

Additional Information

  • Related to # (issue)

Signed-off-by: dimapihtar <[email protected]>
@github-actions github-actions bot added the NLP label Jul 8, 2025
dimapihtar and others added 2 commits July 8, 2025 09:56
from nemo.collections.nlp.data.common.sequence_to_sequence_dataset import SequenceToSequenceDataset

try:
from nemo.collections.nlp.data.common.sequence_to_sequence_dataset import SequenceToSequenceDataset

Check notice

Code scanning / CodeQL

Unused import Note

Import of 'SequenceToSequenceDataset' is not used.

Copilot Autofix

AI 6 months ago

To fix the problem, the unused import of SequenceToSequenceDataset should be removed. This involves deleting both the try block that imports SequenceToSequenceDataset and the except block that assigns it to ABC. This will clean up the code and remove unnecessary dependencies.

Suggested changeset 1
nemo/collections/nlp/models/language_modeling/megatron_retro_fine_tune_model.py

Autofix patch

Autofix patch
Run the following command in your local git repository to apply this patch
cat << 'EOF' | git apply
diff --git a/nemo/collections/nlp/models/language_modeling/megatron_retro_fine_tune_model.py b/nemo/collections/nlp/models/language_modeling/megatron_retro_fine_tune_model.py
--- a/nemo/collections/nlp/models/language_modeling/megatron_retro_fine_tune_model.py
+++ b/nemo/collections/nlp/models/language_modeling/megatron_retro_fine_tune_model.py
@@ -28,8 +28,3 @@
 
-try:
-    from nemo.collections.nlp.data.common.sequence_to_sequence_dataset import SequenceToSequenceDataset
-except ModuleNotFoundError:
-    from abc import ABC
 
-    SequenceToSequenceDataset = ABC
 
EOF
@@ -28,8 +28,3 @@

try:
from nemo.collections.nlp.data.common.sequence_to_sequence_dataset import SequenceToSequenceDataset
except ModuleNotFoundError:
from abc import ABC

SequenceToSequenceDataset = ABC

Copilot is powered by AI and may make mistakes. Always verify output.
except ModuleNotFoundError:
from abc import ABC

SequenceToSequenceDataset = ABC

Check notice

Code scanning / CodeQL

Unused global variable Note

The global variable 'SequenceToSequenceDataset' is not used.

Copilot Autofix

AI 6 months ago

To fix the issue, we should remove the assignment to SequenceToSequenceDataset in the except block. Since the variable is unused and does not have any side effects, deleting the assignment will clean up the code without affecting functionality. If the variable is intended for future use or documentation purposes, we should rename it to indicate that it is unused (e.g., _unused_SequenceToSequenceDataset).

Suggested changeset 1
nemo/collections/nlp/models/language_modeling/megatron_retro_fine_tune_model.py

Autofix patch

Autofix patch
Run the following command in your local git repository to apply this patch
cat << 'EOF' | git apply
diff --git a/nemo/collections/nlp/models/language_modeling/megatron_retro_fine_tune_model.py b/nemo/collections/nlp/models/language_modeling/megatron_retro_fine_tune_model.py
--- a/nemo/collections/nlp/models/language_modeling/megatron_retro_fine_tune_model.py
+++ b/nemo/collections/nlp/models/language_modeling/megatron_retro_fine_tune_model.py
@@ -33,4 +33,2 @@
 
-    SequenceToSequenceDataset = ABC
-
 from nemo.collections.nlp.data.language_modeling.megatron.base_dataset_utils import (
EOF
@@ -33,4 +33,2 @@

SequenceToSequenceDataset = ABC

from nemo.collections.nlp.data.language_modeling.megatron.base_dataset_utils import (
Copilot is powered by AI and may make mistakes. Always verify output.
Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
@github-actions
Copy link
Contributor

github-actions bot commented Jul 9, 2025

[🤖]: Hi @dimapihtar 👋,

We wanted to let you know that a CICD pipeline for this PR just finished successfully.

So it might be time to merge this PR or get some approvals.

//cc @chtruong814 @ko3n1g @pablo-garay @thomasdhc

@dimapihtar dimapihtar requested a review from yaoyu-33 July 10, 2025 15:30
@dimapihtar dimapihtar marked this pull request as ready for review July 10, 2025 16:53
@dimapihtar dimapihtar merged commit 351206f into main Jul 10, 2025
248 checks passed
@dimapihtar dimapihtar deleted the dpykhtar/remove_nlp_coll branch July 10, 2025 16:53
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Jul 23, 2025
* remove rag collection

Signed-off-by: dimapihtar <[email protected]>

* remove data/common

Signed-off-by: dimapihtar <[email protected]>

* Apply isort and black reformatting

Signed-off-by: dimapihtar <[email protected]>

* fix importso

Signed-off-by: dimapihtar <[email protected]>

* fix imports

Signed-off-by: dimapihtar <[email protected]>

---------

Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Co-authored-by: dimapihtar <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
monica-sekoyan pushed a commit that referenced this pull request Aug 4, 2025
* remove rag collection

Signed-off-by: dimapihtar <[email protected]>

* remove data/common

Signed-off-by: dimapihtar <[email protected]>

* Apply isort and black reformatting

Signed-off-by: dimapihtar <[email protected]>

* fix importso

Signed-off-by: dimapihtar <[email protected]>

* fix imports

Signed-off-by: dimapihtar <[email protected]>

---------

Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Co-authored-by: dimapihtar <[email protected]>
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Aug 5, 2025
* remove rag collection

Signed-off-by: dimapihtar <[email protected]>

* remove data/common

Signed-off-by: dimapihtar <[email protected]>

* Apply isort and black reformatting

Signed-off-by: dimapihtar <[email protected]>

* fix importso

Signed-off-by: dimapihtar <[email protected]>

* fix imports

Signed-off-by: dimapihtar <[email protected]>

---------

Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Co-authored-by: dimapihtar <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
AmirHussein96 pushed a commit to AmirHussein96/NeMo that referenced this pull request Aug 5, 2025
* remove rag collection

Signed-off-by: dimapihtar <[email protected]>

* remove data/common

Signed-off-by: dimapihtar <[email protected]>

* Apply isort and black reformatting

Signed-off-by: dimapihtar <[email protected]>

* fix importso

Signed-off-by: dimapihtar <[email protected]>

* fix imports

Signed-off-by: dimapihtar <[email protected]>

---------

Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Co-authored-by: dimapihtar <[email protected]>
Signed-off-by: Amir Hussein <[email protected]>
nasretdinovr pushed a commit to nasretdinovr/NeMo that referenced this pull request Aug 8, 2025
* remove rag collection

Signed-off-by: dimapihtar <[email protected]>

* remove data/common

Signed-off-by: dimapihtar <[email protected]>

* Apply isort and black reformatting

Signed-off-by: dimapihtar <[email protected]>

* fix importso

Signed-off-by: dimapihtar <[email protected]>

* fix imports

Signed-off-by: dimapihtar <[email protected]>

---------

Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Co-authored-by: dimapihtar <[email protected]>
guyueh1 pushed a commit to guyueh1/NeMo that referenced this pull request Aug 25, 2025
* remove rag collection

Signed-off-by: dimapihtar <[email protected]>

* remove data/common

Signed-off-by: dimapihtar <[email protected]>

* Apply isort and black reformatting

Signed-off-by: dimapihtar <[email protected]>

* fix importso

Signed-off-by: dimapihtar <[email protected]>

* fix imports

Signed-off-by: dimapihtar <[email protected]>

---------

Signed-off-by: dimapihtar <[email protected]>
Signed-off-by: dimapihtar <[email protected]>
Co-authored-by: dimapihtar <[email protected]>
Signed-off-by: Guyue Huang <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants