iREAD (intron REtention Analysis and Detector)is a tool to detect intron retention(IR) events from RNA-seq datasets. Independent introns, referring to those introns that do not overlap with any exons of any splice isoforms, are used for detecting IRs. iREAD takes two input files: (1)a BAM file representing transcritome, and a bed-like text file containing independent intron coordinates, and output intron retention events based on a set of criteria that filter for reliable IR events. These criteria involves the number of reads/fragments in intronic regions, FPKM, junction reads, read distribution patterns within an intron.
a). Samtools(version:1.2)
b). Bedops(version:2.4.20), available at: https://bedops.readthedocs.io/en/latest/
c). The PERL module Parallel::ForkManager needs to be installed to support multi-core computing.
d). python module: argparse. If not installed, run 'pip install argparse' from shell to install.
Using iREAD is very simple. Only one command needs to be issued form command line, and you would be able to identify IR events from RNA-seq data. For illustration purpose, we have included a test BAM file and a intron coordinate text file along with the iREAD package for testing the package. After you unzip the source package, you should see two folders inside: one is data containing test data, the other is meta containing text files of intron coordinates for mouse (Ensembl ver75) and human(Ensembl ver77), respectively.
To run the iREAD for IR detection, assuming that you are in the folder of iREAD, just issue the following command from shell:
iread.py data/mouse_test.bam meta/intron_mouse_3875.bed -o tmp_output -t 62000000
Notes: Regarding the command above, -t specifies the totally number of mapped reads, which is needed to be provided for calculating FPKM. For this test data, the total number of mapped reads is 62000000 (reads mapped to the whole genome). The BAM file was aligned using STAR. After you run the above command, you will see screen output as below:
To get help, use:
iread.py -h