Fix POOL_SHORT_READS receiving reads in wrong roder resulting in faulty pooling #894

jfy133 · 2025-10-25T06:05:12Z

Closes #890

Essentially the input tuple for pooling was in the wrong format meaning only R1s were being pooled and out of order (i.e, what was meant to be a samples R2, was the second samples R1)
This was missed as the --coassembly_group parameter was missed out in the new config structures

TODO:

Run tests for all other configs to make sure nothing else changed
Regenerate snapshot for test_alternative now coassembly activated

PR checklist

github-actions · 2025-10-25T06:07:56Z

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

Posted for pipeline commit bb20e9a

+| ✅ 379 tests passed       |+
#| ❔   1 tests were ignored |#
!| ❗   6 tests had warnings |!

Details

❗ Test warnings:

pipeline_todos - TODO string in main.nf: Remove this line if you don't need a FASTA file [TODO: try and test using for --host_fasta and --host_genome]
pipeline_todos - TODO string in methods_description_template.yml: #Update the HTML below to your preferred methods description, e.g. add publication citation for this pipeline
pipeline_todos - TODO string in main.nf: Optionally add in-text citation tools to this list.
pipeline_todos - TODO string in main.nf: Optionally add bibliographic entries to this list.
pipeline_todos - TODO string in main.nf: Only uncomment below if logic in toolCitationText/toolBibliographyText has been filled!
pipeline_todos - TODO string in nextflow.config: Specify any additional parameters here

❔ Tests ignored:

files_unchanged - File ignored due to lint config: .github/PULL_REQUEST_TEMPLATE.md

✅ Tests passed:

files_exist - File found: .gitattributes
files_exist - File found: .gitignore
files_exist - File found: .nf-core.yml
files_exist - File found: .prettierignore
files_exist - File found: .prettierrc.yml
files_exist - File found: CHANGELOG.md
files_exist - File found: CITATIONS.md
files_exist - File found: CODE_OF_CONDUCT.md
files_exist - File found: LICENSE or LICENSE.md or LICENCE or LICENCE.md
files_exist - File found: nextflow_schema.json
files_exist - File found: nextflow.config
files_exist - File found: README.md
files_exist - File found: .github/.dockstore.yml
files_exist - File found: .github/CONTRIBUTING.md
files_exist - File found: .github/ISSUE_TEMPLATE/bug_report.yml
files_exist - File found: .github/ISSUE_TEMPLATE/config.yml
files_exist - File found: .github/ISSUE_TEMPLATE/feature_request.yml
files_exist - File found: .github/PULL_REQUEST_TEMPLATE.md
files_exist - File found: .github/workflows/branch.yml
files_exist - File found: .github/workflows/nf-test.yml
files_exist - File found: .github/actions/get-shards/action.yml
files_exist - File found: .github/actions/nf-test/action.yml
files_exist - File found: .github/workflows/linting_comment.yml
files_exist - File found: .github/workflows/linting.yml
files_exist - File found: assets/email_template.html
files_exist - File found: assets/email_template.txt
files_exist - File found: assets/sendmail_template.txt
files_exist - File found: assets/nf-core-mag_logo_light.png
files_exist - File found: conf/modules.config
files_exist - File found: conf/test.config
files_exist - File found: conf/test_full.config
files_exist - File found: docs/images/nf-core-mag_logo_light.png
files_exist - File found: docs/images/nf-core-mag_logo_dark.png
files_exist - File found: docs/output.md
files_exist - File found: docs/README.md
files_exist - File found: docs/README.md
files_exist - File found: docs/usage.md
files_exist - File found: nf-test.config
files_exist - File found: tests/default.nf.test
files_exist - File found: main.nf
files_exist - File found: assets/multiqc_config.yml
files_exist - File found: conf/base.config
files_exist - File found: conf/igenomes.config
files_exist - File found: conf/igenomes_ignored.config
files_exist - File found: .github/workflows/awstest.yml
files_exist - File found: .github/workflows/awsfulltest.yml
files_exist - File found: modules.json
files_exist - File found: ro-crate-metadata.json
files_exist - File not found check: .github/ISSUE_TEMPLATE/bug_report.md
files_exist - File not found check: .github/ISSUE_TEMPLATE/feature_request.md
files_exist - File not found check: .github/workflows/push_dockerhub.yml
files_exist - File not found check: .markdownlint.yml
files_exist - File not found check: .nf-core.yaml
files_exist - File not found check: .yamllint.yml
files_exist - File not found check: bin/markdown_to_html.r
files_exist - File not found check: conf/aws.config
files_exist - File not found check: docs/images/nf-core-mag_logo.png
files_exist - File not found check: lib/Checks.groovy
files_exist - File not found check: lib/Completion.groovy
files_exist - File not found check: lib/NfcoreTemplate.groovy
files_exist - File not found check: lib/Utils.groovy
files_exist - File not found check: lib/Workflow.groovy
files_exist - File not found check: lib/WorkflowMain.groovy
files_exist - File not found check: lib/WorkflowMag.groovy
files_exist - File not found check: parameters.settings.json
files_exist - File not found check: pipeline_template.yml
files_exist - File not found check: Singularity
files_exist - File not found check: lib/nfcore_external_java_deps.jar
files_exist - File not found check: .travis.yml
nextflow_config - Found nf-schema plugin
nextflow_config - Config variable found: manifest.name
nextflow_config - Config variable found: manifest.nextflowVersion
nextflow_config - Config variable found: manifest.description
nextflow_config - Config variable found: manifest.version
nextflow_config - Config variable found: manifest.homePage
nextflow_config - Config variable found: timeline.enabled
nextflow_config - Config variable found: trace.enabled
nextflow_config - Config variable found: report.enabled
nextflow_config - Config variable found: dag.enabled
nextflow_config - Config variable found: process.cpus
nextflow_config - Config variable found: process.memory
nextflow_config - Config variable found: process.time
nextflow_config - Config variable found: params.outdir
nextflow_config - Config variable found: params.input
nextflow_config - Config variable found: manifest.mainScript
nextflow_config - Config variable found: timeline.file
nextflow_config - Config variable found: trace.file
nextflow_config - Config variable found: report.file
nextflow_config - Config variable found: dag.file
nextflow_config - Config variable (correctly) not found: params.nf_required_version
nextflow_config - Config variable (correctly) not found: params.container
nextflow_config - Config variable (correctly) not found: params.singleEnd
nextflow_config - Config variable (correctly) not found: params.igenomesIgnore
nextflow_config - Config variable (correctly) not found: params.name
nextflow_config - Config variable (correctly) not found: params.enable_conda
nextflow_config - Config variable (correctly) not found: params.max_cpus
nextflow_config - Config variable (correctly) not found: params.max_memory
nextflow_config - Config variable (correctly) not found: params.max_time
nextflow_config - Config variable (correctly) not found: params.validationFailUnrecognisedParams
nextflow_config - Config variable (correctly) not found: params.validationLenientMode
nextflow_config - Config variable (correctly) not found: params.validationSchemaIgnoreParams
nextflow_config - Config variable (correctly) not found: params.validationShowHiddenParams
nextflow_config - Config variable (correctly) not found: validation.failUnrecognisedParams
nextflow_config - Config variable (correctly) not found: validation.failUnrecognisedHeaders
nextflow_config - Config timeline.enabled had correct value: true
nextflow_config - Config report.enabled had correct value: true
nextflow_config - Config trace.enabled had correct value: true
nextflow_config - Config dag.enabled had correct value: true
nextflow_config - Config manifest.name began with nf-core/
nextflow_config - Config variable manifest.homePage began with https://github.com/nf-core/
nextflow_config - Config dag.file ended with .html
nextflow_config - Config variable manifest.nextflowVersion started with >= or !>=
nextflow_config - Config manifest.version ends in dev: 5.3.0dev
nextflow_config - Config params.custom_config_version is set to master
nextflow_config - Config params.custom_config_base is set to https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Lines for loading custom profiles found
nextflow_config - nextflow.config contains configuration profile test
nextflow_config - Config default value correct: params.igenomes_base= s3://ngi-igenomes/igenomes/
nextflow_config - Config default value correct: params.custom_config_version= master
nextflow_config - Config default value correct: params.custom_config_base= https://raw.githubusercontent.com/nf-core/configs/master
nextflow_config - Config default value correct: params.publish_dir_mode= copy
nextflow_config - Config default value correct: params.max_multiqc_email_size= 25.MB
nextflow_config - Config default value correct: params.validate_params= true
nextflow_config - Config default value correct: params.pipelines_testdata_base_path= https://raw.githubusercontent.com/nf-core/test-datasets/
nextflow_config - Config default value correct: params.spades_fix_cpus= -1
nextflow_config - Config default value correct: params.spadeshybrid_fix_cpus= -1
nextflow_config - Config default value correct: params.metabat_rng_seed= 1
nextflow_config - Config default value correct: params.clip_tool= fastp
nextflow_config - Config default value correct: params.reads_minlength= 15
nextflow_config - Config default value correct: params.fastp_qualified_quality= 15
nextflow_config - Config default value correct: params.fastp_cut_mean_quality= 15
nextflow_config - Config default value correct: params.adapterremoval_minquality= 2
nextflow_config - Config default value correct: params.adapterremoval_adapter1= AGATCGGAAGAGCACACGTCTGAACTCCAGTCACNNNNNNATCTCGTATGCCGTCTTCTGCTTG
nextflow_config - Config default value correct: params.adapterremoval_adapter2= AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGTAGATCTCGGTGGTCGCCGTATCATT
nextflow_config - Config default value correct: params.bbnorm_target= 100
nextflow_config - Config default value correct: params.bbnorm_min= 5
nextflow_config - Config default value correct: params.longreads_min_length= 1000
nextflow_config - Config default value correct: params.longreads_keep_percent= 90
nextflow_config - Config default value correct: params.longreads_length_weight= 10
nextflow_config - Config default value correct: params.longread_adaptertrimming_tool= porechop_abi
nextflow_config - Config default value correct: params.longread_filtering_tool= filtlong
nextflow_config - Config default value correct: params.gtdb_db= https://data.gtdb.aau.ecogenomic.org/releases/release226/226.0/auxillary_files/gtdbtk_package/full_package/gtdbtk_r226_data.tar.gz
nextflow_config - Config default value correct: params.gtdbtk_min_completeness= 50.0
nextflow_config - Config default value correct: params.gtdbtk_max_contamination= 10.0
nextflow_config - Config default value correct: params.gtdbtk_min_perc_aa= 10.0
nextflow_config - Config default value correct: params.gtdbtk_min_af= 0.65
nextflow_config - Config default value correct: params.gtdbtk_pplacer_cpus= 1
nextflow_config - Config default value correct: params.spades_downstreaminput= scaffolds
nextflow_config - Config default value correct: params.genomad_min_score= 0.7
nextflow_config - Config default value correct: params.genomad_splits= 1
nextflow_config - Config default value correct: params.binning_map_mode= group
nextflow_config - Config default value correct: params.bin_metabinner_scale= large
nextflow_config - Config default value correct: params.min_contig_size= 1500
nextflow_config - Config default value correct: params.min_length_unbinned_contigs= 1000000
nextflow_config - Config default value correct: params.max_unbinned_contigs= 100
nextflow_config - Config default value correct: params.bin_min_size= 0
nextflow_config - Config default value correct: params.bin_concoct_chunksize= 10000
nextflow_config - Config default value correct: params.bin_concoct_overlap= 0
nextflow_config - Config default value correct: params.bin_domain_classification_tool= tiara
nextflow_config - Config default value correct: params.tiara_min_length= 3000
nextflow_config - Config default value correct: params.busco_db_lineage= auto
nextflow_config - Config default value correct: params.checkm_download_url= https://zenodo.org/records/7401545/files/checkm_data_2015_01_16.tar.gz
nextflow_config - Config default value correct: params.checkm2_db_version= 14897628
nextflow_config - Config default value correct: params.refine_bins_dastool_threshold= 0.5
nextflow_config - Config default value correct: params.postbinning_input= raw_bins_only
nextflow_config - Config default value correct: params.gunc_database_type= progenomes
nextflow_config - Config default value correct: params.pydamage_accuracy= 0.5
nextflow_config - Config default value correct: params.freebayes_ploidy= 1
nextflow_config - Config default value correct: params.freebayes_min_basequality= 20
nextflow_config - Config default value correct: params.freebayes_minallelefreq= 0.33
nextflow_config - Config default value correct: params.bcftools_view_high_variant_quality= 30
nextflow_config - Config default value correct: params.bcftools_view_medium_variant_quality= 20
nextflow_config - Config default value correct: params.bcftools_view_minimal_allelesupport= 3
nf_test_content - 'tests/test_single_end.nf.test' contains outdir parameter
nf_test_content - 'tests/test_single_end.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_single_end.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/default.nf.test' contains outdir parameter
nf_test_content - 'tests/default.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/default.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_hybrid.nf.test' contains outdir parameter
nf_test_content - 'tests/test_hybrid.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_hybrid.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_minimal.nf.test' contains outdir parameter
nf_test_content - 'tests/test_minimal.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_minimal.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_assembly_input.nf.test' contains outdir parameter
nf_test_content - 'tests/test_assembly_input.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_longreadonly.nf.test' contains outdir parameter
nf_test_content - 'tests/test_longreadonly.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_longreadonly.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_longreadonly_alternatives.nf.test' contains outdir parameter
nf_test_content - 'tests/test_longreadonly_alternatives.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_longreadonly_alternatives.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/test_alternatives.nf.test' contains outdir parameter
nf_test_content - 'tests/test_alternatives.nf.test' snapshots a 'versions.yml' file
nf_test_content - 'tests/nextflow.config' contains modules_testdata_base_path
nf_test_content - 'tests/nextflow.config' contains pipelines_testdata_base_path
nf_test_content - 'nf-test.config' sets a testsDir
nf_test_content - 'nf-test.config' sets a workDir
nf_test_content - 'nf-test.config' sets a configFile
files_unchanged - .gitattributes matches the template
files_unchanged - .prettierrc.yml matches the template
files_unchanged - CODE_OF_CONDUCT.md matches the template
files_unchanged - LICENSE matches the template
files_unchanged - .github/.dockstore.yml matches the template
files_unchanged - .github/CONTRIBUTING.md matches the template
files_unchanged - .github/ISSUE_TEMPLATE/bug_report.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/config.yml matches the template
files_unchanged - .github/ISSUE_TEMPLATE/feature_request.yml matches the template
files_unchanged - .github/workflows/branch.yml matches the template
files_unchanged - .github/workflows/linting_comment.yml matches the template
files_unchanged - .github/workflows/linting.yml matches the template
files_unchanged - assets/email_template.html matches the template
files_unchanged - assets/email_template.txt matches the template
files_unchanged - assets/sendmail_template.txt matches the template
files_unchanged - assets/nf-core-mag_logo_light.png matches the template
files_unchanged - docs/images/nf-core-mag_logo_light.png matches the template
files_unchanged - docs/images/nf-core-mag_logo_dark.png matches the template
files_unchanged - docs/README.md matches the template
files_unchanged - .gitignore matches the template
files_unchanged - .prettierignore matches the template
actions_nf_test - '.github/workflows/nf-test.yml' is triggered on expected events
actions_nf_test - '.github/workflows/nf-test.yml' checks minimum NF version
actions_awstest - '.github/workflows/awstest.yml' is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml is triggered correctly
actions_awsfulltest - .github/workflows/awsfulltest.yml does not use -profile test
readme - README Nextflow minimum version badge matched config. Badge: 25.04.2, Config: 25.04.2
readme - README nf-core template version badge found.
readme - README Zenodo placeholder was replaced with DOI.
pipeline_if_empty_null - No ifEmpty(null) strings found
plugin_includes - No wrong validation plugin imports have been found
pipeline_name_conventions - Name adheres to nf-core convention
template_strings - Did not find any Jinja template strings (0 files)
schema_lint - Schema lint passed
schema_lint - Schema title + description lint passed
schema_lint - Input mimetype lint passed: 'text/csv'
schema_params - Schema matched params returned from nextflow config
system_exit - No System.exit calls found
actions_schema_validation - Workflow validation passed: linting.yml
actions_schema_validation - Workflow validation passed: download_pipeline.yml
actions_schema_validation - Workflow validation passed: template-version-comment.yml
actions_schema_validation - Workflow validation passed: linting_comment.yml
actions_schema_validation - Workflow validation passed: release-announcements.yml
actions_schema_validation - Workflow validation passed: fix_linting.yml
actions_schema_validation - Workflow validation passed: nf-test.yml
actions_schema_validation - Workflow validation passed: awsfulltest.yml
actions_schema_validation - Workflow validation passed: branch.yml
actions_schema_validation - Workflow validation passed: awstest.yml
actions_schema_validation - Workflow validation passed: clean-up.yml
merge_markers - No merge markers found in pipeline files
modules_json - Only installed modules found in modules.json
multiqc_config - assets/multiqc_config.yml found and not ignored.
multiqc_config - assets/multiqc_config.yml contains report_section_order
multiqc_config - assets/multiqc_config.yml contains export_plots
multiqc_config - assets/multiqc_config.yml contains report_comment
multiqc_config - assets/multiqc_config.yml follows the ordering scheme of the minimally required plugins.
multiqc_config - assets/multiqc_config.yml contains a matching 'report_comment'.
multiqc_config - assets/multiqc_config.yml contains 'export_plots: true'.
modules_structure - modules directory structure is correct 'modules/nf-core/TOOL/SUBTOOL'
local_component_structure - local subworkflows directory structure is correct 'subworkflows/local/TOOL/SUBTOOL'
base_config - conf/base.config found and not ignored.
base_config - BOWTIE2_HOST_REMOVAL_BUILD found in conf/base.config and Nextflow scripts.
base_config - BOWTIE2_HOST_REMOVAL_ALIGN found in conf/base.config and Nextflow scripts.
base_config - BOWTIE2_PHIX_REMOVAL_ALIGN found in conf/base.config and Nextflow scripts.
base_config - PORECHOP_PORECHOP found in conf/base.config and Nextflow scripts.
base_config - NANOLYSE found in conf/base.config and Nextflow scripts.
base_config - FILTLONG found in conf/base.config and Nextflow scripts.
base_config - CATPACK_BINS found in conf/base.config and Nextflow scripts.
base_config - CATPACK_CONTIGS found in conf/base.config and Nextflow scripts.
base_config - GTDBTK_CLASSIFYWF found in conf/base.config and Nextflow scripts.
base_config - MEGAHIT found in conf/base.config and Nextflow scripts.
base_config - METASPADES found in conf/base.config and Nextflow scripts.
base_config - METASPADESHYBRID found in conf/base.config and Nextflow scripts.
base_config - METAMDBG_ASM found in conf/base.config and Nextflow scripts.
base_config - FLYE found in conf/base.config and Nextflow scripts.
base_config - BOWTIE2_ASSEMBLY_ALIGN found in conf/base.config and Nextflow scripts.
base_config - METABAT2_METABAT2 found in conf/base.config and Nextflow scripts.
base_config - MAG_DEPTHS found in conf/base.config and Nextflow scripts.
base_config - MAG_DEPTHS_PLOT found in conf/base.config and Nextflow scripts.
base_config - BUSCO_BUSCO found in conf/base.config and Nextflow scripts.
base_config - MAXBIN2 found in conf/base.config and Nextflow scripts.
base_config - COMEBIN_RUNCOMEBIN found in conf/base.config and Nextflow scripts.
base_config - METABINNER_METABINNER found in conf/base.config and Nextflow scripts.
base_config - DASTOOL_DASTOOL found in conf/base.config and Nextflow scripts.
base_config - CHECKM_LINEAGEWF found in conf/base.config and Nextflow scripts.
base_config - CHECKM2_PREDICT found in conf/base.config and Nextflow scripts.
modules_config - conf/modules.config found and not ignored.
modules_config - FASTQC_RAW found in conf/modules.config and Nextflow scripts.
modules_config - FASTP found in conf/modules.config and Nextflow scripts.
modules_config - TRIMMOMATIC found in conf/modules.config and Nextflow scripts.
modules_config - ADAPTERREMOVAL_PE found in conf/modules.config and Nextflow scripts.
modules_config - ADAPTERREMOVAL_SE found in conf/modules.config and Nextflow scripts.
modules_config - BOWTIE2_PHIX_REMOVAL_ALIGN found in conf/modules.config and Nextflow scripts.
modules_config - BOWTIE2_HOST_REMOVAL_ALIGN found in conf/modules.config and Nextflow scripts.
modules_config - FASTQC_TRIMMED found in conf/modules.config and Nextflow scripts.
modules_config - BBMAP_BBNORM found in conf/modules.config and Nextflow scripts.
modules_config - PORECHOP_PORECHOP found in conf/modules.config and Nextflow scripts.
modules_config - PORECHOP_ABI found in conf/modules.config and Nextflow scripts.
modules_config - FILTLONG found in conf/modules.config and Nextflow scripts.
modules_config - NANOQ found in conf/modules.config and Nextflow scripts.
modules_config - NANOLYSE found in conf/modules.config and Nextflow scripts.
modules_config - CHOPPER found in conf/modules.config and Nextflow scripts.
modules_config - NANOPLOT_RAW found in conf/modules.config and Nextflow scripts.
modules_config - NANOPLOT_FILTERED found in conf/modules.config and Nextflow scripts.
modules_config - MINIMAP2_HOST_INDEX found in conf/modules.config and Nextflow scripts.
modules_config - MINIMAP2_HOST_ALIGN found in conf/modules.config and Nextflow scripts.
modules_config - MINIMAP2_ASSEMBLY_ALIGN found in conf/modules.config and Nextflow scripts.
modules_config - SAMTOOLS_HOSTREMOVED_UNMAPPED found in conf/modules.config and Nextflow scripts.
modules_config - SAMTOOLS_HOSTREMOVED_STATS found in conf/modules.config and Nextflow scripts.
modules_config - MEGAHIT found in conf/modules.config and Nextflow scripts.
modules_config - METASPADES found in conf/modules.config and Nextflow scripts.
modules_config - METASPADESHYBRID found in conf/modules.config and Nextflow scripts.
modules_config - FLYE found in conf/modules.config and Nextflow scripts.
modules_config - METAMDBG_ASM found in conf/modules.config and Nextflow scripts.
modules_config - QUAST found in conf/modules.config and Nextflow scripts.
modules_config - QUAST_BINS found in conf/modules.config and Nextflow scripts.
modules_config - GENOMAD_ENDTOEND found in conf/modules.config and Nextflow scripts.
modules_config - BOWTIE2_ASSEMBLY_ALIGN found in conf/modules.config and Nextflow scripts.
modules_config - MAG_DEPTHS_PLOT found in conf/modules.config and Nextflow scripts.
modules_config - BIN_SUMMARY found in conf/modules.config and Nextflow scripts.
modules_config - BUSCO_UNTAR found in conf/modules.config and Nextflow scripts.
modules_config - BUSCO_BUSCO found in conf/modules.config and Nextflow scripts.
modules_config - CHECKM_UNTAR found in conf/modules.config and Nextflow scripts.
modules_config - CHECKM_LINEAGEWF found in conf/modules.config and Nextflow scripts.
modules_config - CHECKM_QA found in conf/modules.config and Nextflow scripts.
modules_config - CONCAT_BUSCO_TSV found in conf/modules.config and Nextflow scripts.
modules_config - CHECKM2_DATABASEDOWNLOAD found in conf/modules.config and Nextflow scripts.
modules_config - CHECKM2_PREDICT found in conf/modules.config and Nextflow scripts.
modules_config - GUNC_DOWNLOADDB found in conf/modules.config and Nextflow scripts.
modules_config - GUNC_RUN found in conf/modules.config and Nextflow scripts.
modules_config - GUNC_MERGECHECKM found in conf/modules.config and Nextflow scripts.
modules_config - CATPACK_PREPARE found in conf/modules.config and Nextflow scripts.
modules_config - CATPACK_BINS found in conf/modules.config and Nextflow scripts.
modules_config - CATPACK_ADDNAMES_BINS found in conf/modules.config and Nextflow scripts.
modules_config - CATPACK_SUMMARISE_BINS found in conf/modules.config and Nextflow scripts.
modules_config - CATPACK_UNBINS found in conf/modules.config and Nextflow scripts.
modules_config - CATPACK_ADDNAMES_UNBINS found in conf/modules.config and Nextflow scripts.
modules_config - CATPACK_SUMMARISE_UNBINS found in conf/modules.config and Nextflow scripts.
modules_config - GTDBTK_CLASSIFYWF found in conf/modules.config and Nextflow scripts.
modules_config - GTDBTK_SUMMARY found in conf/modules.config and Nextflow scripts.
modules_config - PROKKA found in conf/modules.config and Nextflow scripts.
modules_config - PRODIGAL found in conf/modules.config and Nextflow scripts.
modules_config - FREEBAYES found in conf/modules.config and Nextflow scripts.
modules_config - BCFTOOLS_VIEW found in conf/modules.config and Nextflow scripts.
modules_config - BCFTOOLS_CONSENSUS found in conf/modules.config and Nextflow scripts.
modules_config - BCFTOOLS_INDEX found in conf/modules.config and Nextflow scripts.
modules_config - PYDAMAGE_ANALYZE found in conf/modules.config and Nextflow scripts.
modules_config - PYDAMAGE_FILTER found in conf/modules.config and Nextflow scripts.
modules_config - SAMTOOLS_FAIDX found in conf/modules.config and Nextflow scripts.
modules_config - METABAT2_JGISUMMARIZEBAMCONTIGDEPTHS_SHORTREAD found in conf/modules.config and Nextflow scripts.
modules_config - METABAT2_JGISUMMARIZEBAMCONTIGDEPTHS_LONGREAD found in conf/modules.config and Nextflow scripts.
modules_config - METABAT2_METABAT2 found in conf/modules.config and Nextflow scripts.
modules_config - MAXBIN2 found in conf/modules.config and Nextflow scripts.
modules_config - ADJUST_MAXBIN2_EXT found in conf/modules.config and Nextflow scripts.
modules_config - CONCOCT_CUTUPFASTA found in conf/modules.config and Nextflow scripts.
modules_config - CONCOCT_ found in conf/modules.config and Nextflow scripts.
modules_config - COMEBIN_RUNCOMEBIN found in conf/modules.config and Nextflow scripts.
modules_config - METABINNER_KMER found in conf/modules.config and Nextflow scripts.
modules_config - METABINNER_TOOSHORT found in conf/modules.config and Nextflow scripts.
modules_config - METABINNER_METABINNER found in conf/modules.config and Nextflow scripts.
modules_config - METABINNER_BINS found in conf/modules.config and Nextflow scripts.
modules_config - SEQKIT_STATS found in conf/modules.config and Nextflow scripts.
modules_config - SPLIT_FASTA found in conf/modules.config and Nextflow scripts.
modules_config - DASTOOL_FASTATOCONTIG2BIN_METABAT2 found in conf/modules.config and Nextflow scripts.
modules_config - DASTOOL_FASTATOCONTIG2BIN_MAXBIN2 found in conf/modules.config and Nextflow scripts.
modules_config - DASTOOL_FASTATOCONTIG2BIN_CONCOCT found in conf/modules.config and Nextflow scripts.
modules_config - DASTOOL_FASTATOCONTIG2BIN_COMEBIN found in conf/modules.config and Nextflow scripts.
modules_config - DASTOOL_FASTATOCONTIG2BIN_TIARA found in conf/modules.config and Nextflow scripts.
modules_config - DASTOOL_DASTOOL found in conf/modules.config and Nextflow scripts.
modules_config - RENAME_POSTDASTOOL found in conf/modules.config and Nextflow scripts.
modules_config - TIARA_TIARA found in conf/modules.config and Nextflow scripts.
modules_config - TIARA_CLASSIFY found in conf/modules.config and Nextflow scripts.
modules_config - TIARA_SUMMARY found in conf/modules.config and Nextflow scripts.
modules_config - MMSEQS_DATABASES found in conf/modules.config and Nextflow scripts.
modules_config - METAEUK_EASYPREDICT found in conf/modules.config and Nextflow scripts.
modules_config - MULTIQC found in conf/modules.config and Nextflow scripts.
nfcore_yml - Repository type in .nf-core.yml is valid: pipeline
nfcore_yml - nf-core version in .nf-core.yml is set to the latest version: 3.4.1
rocrate_readme_sync - RO-Crate descriptions are in sync with README.md.

Run details

nf-core/tools version 3.4.1
Run at 2025-11-17 12:43:26

jfy133 · 2025-10-25T06:16:52Z

@nf-core-bot fix linting

…es-contain-unequal-number-of-reads

prototaxites · 2025-10-28T09:36:10Z

subworkflows/local/assembly/main.nf

+        // We have to merge reads together to match tuple structure of POOL_SHORT_READS/
+        // This MUST be in a interleaved structure (s1_r1, s1_r2, s2_r1, s2_r2, ...)
+        // So we merge the two list of R1 and R2s, and sort them to ensure correct order above
+        ch_short_reads_grouped_for_pooling = ch_short_reads_grouped.map { meta, reads1, reads2 -> [meta, [reads1 + reads2].flatten().sort()] }


Can we assume that the reads files here are standardly-named such that a sort() won't break the order?

Yes, I think we can assume that because all those files are renamed for ${prefix} at that point, imho. Or is it possible to skip the complete QC so that original files names come through? Not entirely sure...

I do think it's possible to basically completely skip QC...

How likely do you think it would be that people don't have a _R1 / _R2, _1 / _2, _F, _R in their FASTQ files?

I would be wary of assuming anything about file names unless we have strictly controlled it. One way to do that would be also to force a schema like the above in the samplesheet validation, so we stop early before errors.

Otherwise we have to be careful with channel order, etc.?

Typical Illumina output from the sequencing facilities & companies I know is <sample>_R1_<lane>.fastq.gz. Single-end read files might not have any of those pattern to identify direction (R1/1F/whatever).
I think that makes it already more complicated to catch? I am not an regex expert though.

@d4straub 's pattern the one of the most common patterns I've seen too... and other people append the adapter index sequence to the end after the lane ID too... so I really don't think this will be simply be solvable.

But for me that isnt needed, potentially we could add a comment (Warning) in the docs about the sorting issue with SPAdes & skipping all QC & file names, maybe to the co-assembly step (https://nf-co.re/mag/5.1.0/docs/usage/#the-group-column?),

I'm erring for this, but I want this to be a democracy.

@dialvarezs any thoughts?

Deleted my previous comment, as I wasn't sure it actually worked, but I think it does:

ch_a = Channel.of(["meta", ["a", "c", "b"], ["d", "b", "f"]]) ch_a.map { meta, f1, f2 -> def transposed_pairs = [f1, f2].transpose() println transposed_pairs def sorted_pairs = transposed_pairs.sort { it[0] } println sorted_pairs def interleaved = sorted_pairs.flatten() return [meta, interleaved] }.view() transposed: [[a, d], [c, b], [b, f]] sorted: [[a, d], [b, f], [c, b]] output: [meta, [a, d, b, f, c, b]]

So we can just sort on fasta1's name, avoiding issues with naming entirely.

I tried that code above and it seems fine to me. It also works with e.g.
ch_a = Channel.of(["meta", ["a_s1_R1_a", "c_s3_R1_c", "b_s2_R1_b"], ["d_s1_R2_d", "b_s3_R2_b", "f_s2_R2_f"]])
that is sorted to
[meta, [a_s1_R1_a, d_s1_R2_d, b_s2_R1_b, f_s2_R2_f, c_s3_R1_c, b_s3_R2_b]]

OK nice thanks for the cross-validation @d4straub ! When I am more functional I will try to implement it!

subworkflows/local/assembly/main.nf

…es-contain-unequal-number-of-reads

…-unequal-number-of-reads' of github.com:nf-core/mag into 890-metaspades-exit-status-21-paired-read-files-contain-unequal-number-of-reads

prototaxites · 2025-10-30T10:27:18Z

subworkflows/local/assembly/main.nf

+        // We have to merge reads together to match tuple structure of POOL_SHORT_READS/
+        // This MUST be in a interleaved structure (s1_r1, s1_r2, s2_r1, s2_r2, ...)
+        // So we merge the two list of R1 and R2s, and sort them to ensure correct order above
+        ch_short_reads_grouped_for_pooling = ch_short_reads_grouped.map { meta, reads1, reads2 -> [meta, [reads1 + reads2].flatten().sort()] }


Suggested change

ch_short_reads_grouped_for_pooling = ch_short_reads_grouped.map { meta, reads1, reads2 -> [meta, [reads1 + reads2].flatten().sort()] }

ch_short_reads_grouped_for_pooling = ch_short_reads_grouped.map { meta, reads1, reads2 -> [meta, [reads1, reads2].transpose().sort { it[0].getName() }.flatten()] }

…es-contain-unequal-number-of-reads

dialvarezs · 2025-11-16T05:31:35Z

This was almost ready TBH. I just implemented the suggestions from @prototaxites and @d4straub in the discussion and verified that everything works as expected. The only change I made was moving the sorting to the grouping step, so the files start sorted from there. Then, as .transpose() keeps the order, just flattening the result is enough to get the interleaved reads for pooling.

d4straub

I think that looks fine, but the .tranpose().sort { it[0] }.flatten() is a little less obvious now (which is probably fine). @prototaxites came up with that fix, so may he judge ;)

conf/test_alternatives.config

CHANGELOG.md

d4straub · 2025-11-17T06:54:56Z

subworkflows/local/assembly/main.nf

+        // We have to merge reads together to match tuple structure of POOL_SHORT_READS/
+        // This MUST be in a interleaved structure (s1_r1, s1_r2, s2_r1, s2_r2, ...)
+        // So we merge the two list of R1 and R2s, and sort them to ensure correct order above


but that sorting isnt done here now?

Sure, I will update that line.
In fact, the sorting step isn’t required for this to work, because since we preserve the tuple order, the r1 and r2 pairs will be in the same order. I added it anyway for the sake of result stability (to get snapshots working, I have been adding sorting steps basically everywhere).

Co-authored-by: Daniel Straub <[email protected]>

subworkflows/local/assembly/main.nf

jfy133 · 2025-11-17T12:38:09Z

I'm ok with this, but can't approve my own PR @d4straub could you give a ✔️ if you're happy, and ofc @prototaxites !

prototaxites

LGTM!

jfy133 · 2025-11-17T13:54:49Z

Let's go! @dialvarezs thanks for wrapping this up 🙏

I suggest we wait one more week to see if any more last minute bug fixes come up, otherwise let's get this out in a release!

dialvarezs · 2025-11-17T13:58:50Z

I suggest we wait one more week to see if any more last minute bug fixes come up, otherwise let's get this out in a release!

So we can continue with our flood of bi-weekly releases 😂

jfy133 · 2025-11-17T15:12:00Z

I suggest we wait one more week to see if any more last minute bug fixes come up, otherwise let's get this out in a release!

So we can continue with our flood of bi-weekly releases 😂

Exactly

jfy133 added 2 commits October 24, 2025 16:53

Make sure to actually test coassembly

bdf5ac3

Fix input to pooling step to combine reads correctly

f73c635

jfy133 requested review from d4straub, dialvarezs, muabnezor and prototaxites as code owners October 25, 2025 06:05

jfy133 marked this pull request as draft October 25, 2025 06:05

Document fix

dac9ccd

nf-core-bot and others added 3 commits October 25, 2025 06:17

[automated] Fix code linting

24a456b

Merge branch 'dev' into 890-metaspades-exit-status-21-paired-read-fil…

b033632

…es-contain-unequal-number-of-reads

Correct version in changelog

12f2b25

prototaxites reviewed Oct 28, 2025

View reviewed changes

d4straub reviewed Oct 28, 2025

View reviewed changes

subworkflows/local/assembly/main.nf Show resolved Hide resolved

jfy133 added 3 commits October 28, 2025 17:48

Merge branch 'dev' into 890-metaspades-exit-status-21-paired-read-fil…

0b327d6

…es-contain-unequal-number-of-reads

Update snapshot to represent pooled (i.e. coassembled) data

3cfcb59

Merge branch '890-metaspades-exit-status-21-paired-read-files-contain…

66b1301

…-unequal-number-of-reads' of github.com:nf-core/mag into 890-metaspades-exit-status-21-paired-read-files-contain-unequal-number-of-reads

jfy133 marked this pull request as ready for review October 29, 2025 06:20

prototaxites reviewed Oct 30, 2025

View reviewed changes

d4straub mentioned this pull request Nov 5, 2025

Version bump for v5.2.0 release [Puce Pangolin] #913

Merged

11 tasks

dialvarezs added 3 commits November 16, 2025 01:31

Merge branch 'dev' into 890-metaspades-exit-status-21-paired-read-fil…

13fb75b

…es-contain-unequal-number-of-reads

Update snapshot

4965ca7

Finish fixing of pooling reads

515f422

d4straub reviewed Nov 17, 2025

View reviewed changes

Update CHANGELOG.md

71af6b1

Co-authored-by: Daniel Straub <[email protected]>

jfy133 commented Nov 17, 2025

View reviewed changes

subworkflows/local/assembly/main.nf Show resolved Hide resolved

jfy133 requested review from d4straub and prototaxites November 17, 2025 12:37

Update comment

bb20e9a

prototaxites approved these changes Nov 17, 2025

View reviewed changes

jfy133 merged commit 052e43c into dev Nov 17, 2025
22 checks passed

jfy133 deleted the 890-metaspades-exit-status-21-paired-read-files-contain-unequal-number-of-reads branch November 17, 2025 13:54

This was referenced Nov 27, 2025

METASPADES exit status 21: paired read files contain unequal number of reads #890

Closed

Release: v5.3.0 Rainbow Rattlesnake #948

Merged

	ch_short_reads_grouped_for_pooling = ch_short_reads_grouped.map { meta, reads1, reads2 -> [meta, [reads1 + reads2].flatten().sort()] }
	ch_short_reads_grouped_for_pooling = ch_short_reads_grouped.map { meta, reads1, reads2 -> [meta, [reads1, reads2].transpose().sort { it[0].getName() }.flatten()] }

Fix POOL_SHORT_READS receiving reads in wrong roder resulting in faulty pooling #894

Fix POOL_SHORT_READS receiving reads in wrong roder resulting in faulty pooling #894

Uh oh!

Conversation

jfy133 commented Oct 25, 2025

PR checklist

Uh oh!

github-actions bot commented Oct 25, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

nf-core pipelines lint overall result: Passed ✅ ⚠️

❗ Test warnings:

❔ Tests ignored:

✅ Tests passed:

Run details

Uh oh!

jfy133 commented Oct 25, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

d4straub Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dialvarezs commented Nov 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

d4straub left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

dialvarezs Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

jfy133 commented Nov 17, 2025

Uh oh!

prototaxites left a comment

Choose a reason for hiding this comment

Uh oh!

jfy133 commented Nov 17, 2025

Uh oh!

Uh oh!

dialvarezs commented Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jfy133 commented Nov 17, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

github-actions bot commented Oct 25, 2025 •

edited

Loading

`nf-core pipelines lint` overall result: Passed ✅ ⚠️

d4straub Oct 28, 2025 •

edited

Loading

dialvarezs commented Nov 16, 2025 •

edited

Loading

dialvarezs Nov 17, 2025 •

edited

Loading

dialvarezs commented Nov 17, 2025 •

edited

Loading