ИСТИНА |
Войти в систему Регистрация |
|
ИПМех РАН |
||
With massive influx of genomic data, its exploitation for phylogenetic analyses is becoming ever more time and labor consuming. Mining for homologous sequences in genetic databases with various data types, careful selection of orthologs, construction of multiple alignments and inferring phylogenies - each step requires usage of specific methods and software, their fine parameterization and often input-output data reformatting. We present a pipeline, which integrates conventional and some original methods of bioinformatics and phylogenetic analysis to allow for seamless data flow between individual stages of the entire procedure, from mining resources containing annotated genomes or unannotated proteomes (EST data) to building multi-gene trees using a variety of approaches. The pipeline provides means to parameterize individual methods, transfer data and output results at each step of phylogenomic analysis without the need for ad-hoc scripting.