iCLIP.clusters.get_crosslink_fdr_by_randomisation¶
-
iCLIP.clusters.get_crosslink_fdr_by_randomisation(gtf_iterator, bam, randomisations=100, nspread=15, pool=None)¶ This function will carry out the assessment of crosslink site significance using the method outlined in Wang Z et al.
Breifly the empirical distribution of hieights is calculated accross the transcript after calculating the height of any one base by summing reads sites within 15 bases.
FDR is then assessed by comparison to randomized profiles.
Results will be returned in the order of the gtf_iterator unless paralellization is used, in which case not particular order is guarenteed.
param gtf_iterator: An iterator that returns listss of CGAT.GTF.Entry type bam: pysam.AlignmentFile param pool: If a worker pool is provided work will be parallelised accross the pool rtype: pd.Series with a MultiIndex first level contig, second level base