Repliscan
Please work through the documentation and add your comments on the bottom of this page, or email comments to support@cyverse.org. Thank you.
Repliscan is simple and effective to use on organisms with different genome sizes. Even with analysis window sizes as small as 1 kilobase, reliable profiles can be generated with as little as 2.4x coverage.
Repliscan is described in the following publication:
Zynda, G., Song, J., Concia, L. et al. Repliscan: a tool for classifying replication timing regions. BMC Bioinformatics 18, 362 (2017). https://doi.org/10.1186/s12859-017-1774-x
Published07 August 2017
Mandatory arguments
Parameters
scope: Replication scope [chromosome, genome]
percentile cutoff: Remove the upper and lower % of the data when using percentile as remove
log: Apply log transform to sequenceability ratio
aggregate: Replicate agregation method [sum, median, mean, min, max]
value: Explicit replication threshold value when using threshold as value
classifier: Segmentation classification method [binary, proportion]
Plot: Plot Statistics of input coverage histograms and for cubic and derivative interpolation for each chromosome
threshold: Replication threshold method [value, auto, percent]
percent: Remove the lowest % of signal as noise when using threshold as percent
remove: Outlying data to remove [none, sqrtGamma, lognGamma, norm, whiskers, percentile]
Haar smoothing level: Haar smoothing level [1,2,3,4,5]
Analysis bin size in base pairs: 1000
Test Run
All files are located in the Community Data directory of the CyVerse Discovery Environment at the following path:
Community Data > iplantcollaborative > example_data >repliscan
(/iplant/home/shared/iplantcollaborative/example_data/repliscan)
Mandatory arguments:
Use input_ds.txt and folder for files listed on list of bams
These alignment files: bams are from maize samples, have been downsampled and do not include biological replicate information hence, this does not represent the full dataset on our initial analyses and provided for testing purposes.Parameters used for test data:
-Haar smoothing level: 3
-Analysis bin size in base pairs: 3000
-aggregate: sum
-threshold: auto
-Scope: chromosome
-value: 1
-percent: 2
-percentile cutuff: 0.04-1.0
-classifier: proportion
-Remove: percentile
--plot: yes
-Reference genome in Fasta format
Parameters:
Leave all the values as they are for default settings.
Output
In the case of test run, the output folder contains bedgraph files and gff3 files