This is an archived static version of the original phylobabble.org discussion site.

Reconstructing Phylogenetic Tree / missing data

holzdan

Hey peeps !

im very new to phylogenetics and currently trying to read myself a bit into the thematics of tree construction and getting started with all the different tools/software.

i would like to reconstruct a tree of 138 cyanobacterial strains defined by 16S , 16S-ITS , rpoc1 , RnaseP , rbclx , PC-IGS , PSA-IGS. The authors originally constructed the tree using maximum likelihood and the F84 model

I would like to implement another 4 strains where i got all loci sequences except the 16S-ITS. Now im not sure how to deal with missing data , should i construct a tree with all loci where i got missing sequences for only 4 strains , or should i use just the loci where i got total coverage ? is there a way to find the “best” method and will the choice of algorithm or substition model be influenced by this ?

cheers