CHOMP 🐊

GitHub version GitHub license codecov Github Issues Pending Pull-Requests

CRISPR Tool

CHOMP will search for all ’n’ window sized guide RNAs (gRNAs) sequences containing an NGG at the tail end. Default window size is 23.

ex.) ATGTAGCTAGCTAGCTAGTAGGG.

It will report how many occurrences of this sequence are present in the target sequence (off-target sites), along with number of matching bases for each subject/query hit. You can use that to determine which gRNAs you may want to use as CRISPR target(s).

Run CHOMP

perl chomp.pl -seq usr/test.fasta -out gRNAs

Arguments

-seq                Sequence file to search gRNAs [required]
-genome             Genome sequence file(s) to BLAST search (search instead of -seq)
-down               Down sequence to append
-up                 Up sequence to append
-window             Window size for gRNA oligo (default = 23)
-ss                 Secondary structure prediction
-out                Out file name [required]
-outdir             Out directory name
-help               Shows this message

Output

Writes 2 files under the default directory gRNAs:

’.fasta’

Fasta file of each gRNA sequence found.

>gRNA_0:0
ATGTAGCTAGCTAGCTAGTAGGG
>gRNA_1:23
AAAAAATTTTCTCTATCTAACGG
>gRNA_2:24
AAAAATTTTCTCTATCTAACGGG
>gRNA_3:115
TGTGATCACGTACTATTATGCGG
>gRNA_4:149
AAAAATCCCATCGATCTAGCAGG
>gRNA_5:154
TCCCATCGATCTAGCAGGCCCGG
.
.
.
>gRNA_16:99
ATAGTACGTGATCACAGTCATGG

Suffix digit after ‘:’ denotes nucleotide position in sequence where gRNA was found. Ex.) gRNA_16:99 , gRNA was found starting at nucleotide position 99 in -seq sequence.

’.txt’

Report with each gRNA sequence’s details.

CHOMP will report how many occurrences of this sequence are present in the target sequence (off-target sites), along with the number of base pair matches (identities) for each. You can use this to determine which gRNA sequence is best to use for target.

NameSequenceStrandPalindromeSubjectStartOccurrencesIdentities
gRNA_13TCGTCATGCATGCTCGCTCCGGGreverseNotest17318
gRNA_12TTCGTCATGCATGCTCGCTCCGGreverseNotest17418
gRNA_3TGTGATCACGTACTATTATGCGGplusNotest11628
gRNA_16ATAGTACGTGATCACAGTCATGGreverseNotest10928
gRNA_8AAAAAAAATTTTCCCTATCGGGGplusNotest19519
gRNA_7GAAAAAAAATTTTCCCTATCGGGplusNotest19419
gRNA_6CGAAAAAAAATTTTCCCTATCGGplusNotest19319
gRNA_9AAAAAAATTTTCCCTATCGGGGGplusNotest19619
gRNA_4AAAAATCCCATCGATCTAGCAGGplusNotest150611,9,7
gRNA_0ATGTAGCTAGCTAGCTAGTAGGGplusNotest1414,12,10
gRNA_2AAAAATTTTCTCTATCTAACGGGplusNotest25315,10,8
gRNA_15TCCGGGCCTGCTAGATCGATGGGreverseNotest156615,9,7
gRNA_14CTCCGGGCCTGCTAGATCGATGGreverseNotest157615,9,7
gRNA_5TCCCATCGATCTAGCAGGCCCGGplusNotest155615,9,7
gRNA_1AAAAAATTTTCTCTATCTAACGGplusNotest24316,10,8
gRNA_11TATAGCATGGGCCCCCGATAGGGreverseNotest207123
gRNA_10CTATAGCATGGGCCCCCGATAGGreverseNotest208123

Table is sorted in increasing order using the top identity for each gRNA sequence, and then sorted by number of occurrences, in current subject.