Add sylph/profile module #7118

sofstam · 2024-11-28T14:48:50Z

PR checklist

Moves the local module sylph/profile to nf-core modules. Closes nf-core/seqinspector#65
Removes unnecessary pattern from sylph/sketch meta file.

…etch

modules/nf-core/sylph/profile/meta.yml

famosab

Suggested to be a bit more specific about the created csv :)

famosab · 2024-12-11T12:57:59Z

modules/nf-core/sylph/profile/tests/main.nf.test.snap

+                "versions.yml:md5,7b5a545483277cc0ff9189f8891e737f"
+            ],
+            [
+                false


Is it expected that the contains check always leads to the string not being contained?

The string should be contained. Looking into it.

modules/nf-core/sylph/profile/tests/main.nf.test

famosab · 2024-12-11T12:59:27Z

modules/nf-core/sylph/profile/tests/main.nf.test

+                { assert process.success },
+                { assert snapshot(
+                    process.out.versions,
+                    process.out.profile_out.collect { file(it[1]).readLines().contains("complete genome") },


MAybe you can assert the file a bit better using: https://github.com/lukfor/nft-csv

Do you mean to count the columns of the output file?

anything that makes sense - counting columns or asserting column header both seems fine to me

Updated this now.

modules/nf-core/sylph/sketch/meta.yml

modules/nf-core/sylph/profile/main.nf

SPPearce · 2025-01-08T08:10:56Z

@sofstam , do you need a hand to get this finished off?

sofstam · 2025-01-08T10:52:50Z

@SPPearce Was on holidays, looking at the comments now :)

Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

SPPearce · 2025-01-16T13:02:29Z

modules/nf-core/sylph/profile/tests/main.nf.test

+                { assert snapshot(process.out.versions).match("versions_single") },
+                { assert output_content.size() > 1 },  // Ensure there's at least a header and one data line
+                { assert output_content[0] == "Sample_file\tGenome_file\tTaxonomic_abundance\tSequence_abundance\tAdjusted_ANI\tEff_cov\tANI_5-95_percentile\tEff_lambda\tLambda_5-95_percentile\tMedian_cov\tMean_cov_geq1\tContainment_ind\tNaive_ANI\tkmers_reassigned\tContig_name" },
+                { assert snapshot(output_content.take(5).join("\n")).match("profile_out_content_single") }  // Snapshot first 5 lines


Can probably just do this. You don't need to name the snapshots if you only have one per test.

Suggested change

{ assert snapshot(process.out.versions).match("versions_single") },

{ assert output_content.size() > 1 }, // Ensure there's at least a header and one data line

{ assert output_content[0] == "Sample_file\tGenome_file\tTaxonomic_abundance\tSequence_abundance\tAdjusted_ANI\tEff_cov\tANI_5-95_percentile\tEff_lambda\tLambda_5-95_percentile\tMedian_cov\tMean_cov_geq1\tContainment_ind\tNaive_ANI\tkmers_reassigned\tContig_name" },

{ assert snapshot(output_content.take(5).join("\n")).match("profile_out_content_single") } // Snapshot first 5 lines

{ assert snapshot(

process.out.versions,

output_content.take(5).join("\n")

).match() },

{ assert output_content.size() > 1 }, // Ensure there's at least a header and one data line

{ assert output_content[0] == "Sample_file\tGenome_file\tTaxonomic_abundance\tSequence_abundance\tAdjusted_ANI\tEff_cov\tANI_5-95_percentile\tEff_lambda\tLambda_5-95_percentile\tMedian_cov\tMean_cov_geq1\tContainment_ind\tNaive_ANI\tkmers_reassigned\tContig_name" }

Or you could probably do:

Suggested change

{ assert snapshot(process.out.versions).match("versions_single") },

{ assert output_content.size() > 1 }, // Ensure there's at least a header and one data line

{ assert output_content[0] == "Sample_file\tGenome_file\tTaxonomic_abundance\tSequence_abundance\tAdjusted_ANI\tEff_cov\tANI_5-95_percentile\tEff_lambda\tLambda_5-95_percentile\tMedian_cov\tMean_cov_geq1\tContainment_ind\tNaive_ANI\tkmers_reassigned\tContig_name" },

{ assert snapshot(output_content.take(5).join("\n")).match("profile_out_content_single") } // Snapshot first 5 lines

{ assert snapshot(

process.out.versions,

file(output_content).readLines()[0..4]

).match() }

SPPearce · 2025-01-16T20:19:52Z

modules/nf-core/sylph/profile/tests/main.nf.test

+            process {
+                """
+                input[0] = [ [ id:'test', single_end:true ], // meta map
+                            [ file(params.test_data['sarscov2']['illumina']['test_1_fastq_gz'], checkIfExists: true) ]


Can you swap these to the more recent file paths please

modules/nf-core/sylph/profile/tests/main.nf.test

SPPearce · 2025-01-16T20:27:14Z

One of my comments may have been related to some previous code, can't see the comment on my phone

modules/nf-core/sylph/profile/main.nf

mahesh-panchal · 2025-01-17T08:37:19Z

modules/nf-core/sylph/profile/meta.yml

+  - - pre_sketched_files:
+        type: file
+        description: Pre-sketched *.syldb/*.sylsp files. Raw single-end fastq/fasta are allowed and will be automatically sketched to .sylsp/.syldb.
+        pattern: "*.{syldb,sylsp}"


Update this pattern too.

mahesh-panchal · 2025-01-17T08:40:41Z

modules/nf-core/sylph/profile/meta.yml

+        description: |
+          List of input FastQ/FASTA files of size 1 and 2 for single-end and paired-end data,
+          respectively. They are automatically sketched to .sylsp/.syldb
+  - - pre_sketched_files:


I would suggest making this variable name more general given it could be fasta. Something like database ( since the manual uses the same term too ).

mahesh-panchal · 2025-01-17T08:42:30Z

modules/nf-core/sylph/profile/tests/main.nf.test

+            assertAll(
+                { assert process.success },
+                { assert snapshot(process.out.versions).match("versions_single") },
+                { assert output_content.size() > 1 }


Is there not a specific file or contents to test for here?

Co-authored-by: Simon Pearce <24893913+SPPearce@users.noreply.github.com>

Co-authored-by: Mahesh Binzer-Panchal <mahesh.binzer-panchal@nbis.se>

sofstam and others added 11 commits November 28, 2024 15:45

Add sylph/profile module and remove unnecessary pattern from sylph/sk…

daf9e8d

…etch

Remove description

dfdd3fc

Update snap

a2f1344

Try again with linting

8544ee9

Merge branch 'master' into sylph_profile

1e576e0

Update snap

fa0e2c1

Fix linting for meta.yaml

e45886d

Update tests

2451fcd

Add contains in nf-test

c06f4fb

Add contains

c95a100

Merge branch 'master' into sylph_profile

9cda43a

mahesh-panchal reviewed Dec 5, 2024

View reviewed changes

modules/nf-core/sylph/profile/meta.yml Outdated Show resolved Hide resolved

modules/nf-core/sylph/profile/meta.yml Show resolved Hide resolved

famosab requested changes Dec 11, 2024

View reviewed changes

sofstam and others added 14 commits January 8, 2025 12:43

Merge branch 'master' into sylph_profile

85188bb

Update modules/nf-core/sylph/profile/tests/main.nf.test

6509880

Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

Update modules/nf-core/sylph/profile/main.nf

ecc9e18

Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

Update modules/nf-core/sylph/profile/main.nf

0d5cd9b

Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

Update modules/nf-core/sylph/profile/main.nf

2e1e013

Co-authored-by: Famke Bäuerle <45968370+famosab@users.noreply.github.com>

Update description

114c010

Remove single_end from the output

b752de4

Merge branch 'master' into sylph_profile

9a86542

Update meta.yml file to include info if fastq/fasta files are provided

174b6d5

Add tests

e5da167

Merge branch 'master' into sylph_profile

1cbbc12

Fix test

f7bc9cd

Fix tests

bb4bb66

Merge branch 'master' into sylph_profile

041ef71

sofstam requested a review from mahesh-panchal January 16, 2025 15:30

sofstam requested a review from famosab January 16, 2025 15:30

SPPearce reviewed Jan 16, 2025

View reviewed changes

mahesh-panchal reviewed Jan 17, 2025

View reviewed changes

sofstam and others added 3 commits January 17, 2025 10:06

Update modules/nf-core/sylph/profile/tests/main.nf.test

4d96cf3

Co-authored-by: Simon Pearce <24893913+SPPearce@users.noreply.github.com>

Update modules/nf-core/sylph/profile/main.nf

8139064

Co-authored-by: Mahesh Binzer-Panchal <mahesh.binzer-panchal@nbis.se>

Merge branch 'master' into sylph_profile

e924d82

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add sylph/profile module #7118

Add sylph/profile module #7118

sofstam commented Nov 28, 2024

famosab left a comment

famosab Dec 11, 2024

sofstam Jan 8, 2025

famosab Dec 11, 2024

sofstam Jan 8, 2025

famosab Jan 9, 2025

sofstam Jan 16, 2025

SPPearce commented Jan 8, 2025

sofstam commented Jan 8, 2025

SPPearce Jan 16, 2025

SPPearce Jan 16, 2025

SPPearce commented Jan 16, 2025

mahesh-panchal Jan 17, 2025

mahesh-panchal Jan 17, 2025

mahesh-panchal Jan 17, 2025

Add sylph/profile module #7118

Are you sure you want to change the base?

Add sylph/profile module #7118

Conversation

sofstam commented Nov 28, 2024

PR checklist

famosab left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SPPearce commented Jan 8, 2025

sofstam commented Jan 8, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SPPearce commented Jan 16, 2025

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment