Abstract
Discovering motifs and repeats in data sequences is of great importance in biology and a large number of efficient tools for their finding have been developed. As the number of results found can be very large, our goal is to provide a tool that, on a mathematical basis, can precisely find all motifs and repeats, filter them according to input arguments and output the results in a convenient way. RepeatsPlus is a program that provides statistical filtering according to input sequence length and number of repeat occurrences, motif mask filtering and filtering related to ambiguous letters in input sequence and a large number of other options. RepeatPlus is implemented in Python and C[Formula: see text]. It is freely available for public use. The user manual and examples of usage are also available.
Collapse