Skip to content

Update logs

SHANG Jiayu edited this page Dec 26, 2024 · 15 revisions

Dec 26, 2024

Important

Now, the end-to-end task allow to skip the PhaMer(virus identification). If users already have the viral contigs as their inputs, they can run end-to-end task using --skip Y to skip the virus identification However, please noted that the default parameters is --skip N

We also added a log output that tells the user that PhaMer detected no viruses and stopped the following pipelines in the end-to-end task in --skip N condition.

Dec 9, 2024

Version 2.1.9

Important

Add a new column for the PhaGCN so that bacteriophages can be easily identified now. Set a more flexible mode for diamond alignments

Dec 6th, 2024

Version 2.1.8

  • Revised some typos in the name of the outputs and Wiki
  • Allows more sensitive search during protein alignments
    • This may affect the results.
    • On the benchmark test set, this version improved the recall of the overall performance and the precision will not be affected.

Nov 27th, 2024

Version 2.1.7

Important

Adjust the default parameters for the phylogenetic task - marker alignment coverage --mcov: 50 - marker alignment identity --mpident: 25

  • Add message information when the calling program (diamond, blast+, fasttree, etc.) fails.

Nov 21th, 2024

Version 2.1.6

  • Revised some typos in the --help descriptions
  • Host nodes are added to the cherry's network for better visualization

Nov 11th, 2024

Version 2.1.5

Updates:

  • Fixed a bug that --task tree will incorrectly combine DNA and Protein sequences in one file
  • Fixed some typos in the scripts

Nov 6th, 2024

Version 2.1.4

Updates:

  • All the os.system() are replaced by subprocess.run(), providing standard error and return a non-zero exit code in case one of the calls fails.
  • Providing a short parameters for --dbdir (-d) and --outpth (-o)

Nov 4th, 2024

Version 2.1.3

Updates:

  • PhaVIP will provide an additional output phavip_prediction.csv. A detailed explanation of this file can be found via PhaVIP outputs
  • The protein annotation file gene_annotation.tsv will provide the alignment identity and coverage information as new columns.
  • Fixed a potential issue when running phylogenetic tree (FastTree) in the tree task. Users should re-download the PhaBOX v2 database if they would like to run the tree task. Download link
  • Fixed a typo in the PhaTYP program.

Oct 30th, 2024

Version 2.1.2

Updates:

  • End_to_end mode will only make predictions on the predicted viruses. For low-confidence viruses and non-viruses with a flag lower than the viral score threshold, we provide a file named uncertain_sequences_for_contamination_task.fa. We suggest the user run the contamination task to check the quality of their sequences first.
  • The contamination task will provide four more fasta files for users: low_quality_virus.fa, medium_quality_virus.fa, high_quality_virus.fa and a cropped proviruses.fa. Users can use them to re-run other tasks.

Oct 29th, 2024

Version 2.1.1

Updates:

  • Integrate the PhaVIP into phabox2, and provide more detailed protein annotations. In this version, phavip will be automatically called when running end_to_end, phamer, phatyp, phagcn, and cherry. The outputs are named gene_annotation.tsv in the xxx_supplementary folder.
  • End_to_end/PhaMer will provide an additional file named uncertain_sequences_for_contamination_task.fa and the outputs in the phamer_prediction.tsv will suggest the user run the contamination task to check the quality of these sequences (probability proviruses or novel viruses.
  • CHERRY will provide a full lineage for the host in either the NCBI or GTDB version. However, because CHERRY is based on sequences from NCBI for prediction, some of them cannot find the corresponding lineage in GTDB.
  • CHERRY will assign a score of 0 for the unpredicted host now rather than 'nan'.
  • Fixed an issue where PhaTYP might not output anything when there was no alignment result for the input sequence.
  • Fixed a possible problem when the length of the input sequence is equal to the filtering threshold.
  • Revised some typos in the help documents.

Oct 12th, 2024

Version 2.1

Updates:

  • A vOTU task is added for vOTU grouping.
  • A tree task is added for phylogenetic tree construction.
  • Please check the Options for detailed information.