Aligning Profile Vectors to Sequences in Protein Family Analysis

Pairwise profile alignment Usman Roshan BNFO 601

Protein families • PFAM: http://pfam.sanger.ac.uk/ • Family alignments can be used to search for new members in a database

Profile-sequence alignment • Given a family alignment, how can we align it to a sequence? • First, we compute a profile of the alignment. • We then align the profile to the sequence using standard dynamic programming. • However, we need to describe how to align a profile vector to a nucleotide or residue.

Profile • A profile can be described by a set of vectors of nucleotide/residue frequencies. • For each position i of the alignment, we we compute the normalized frequency of nucleotides A, C, G, and T

Aligning a profile vector to a nucleotide • ClustalW/MUSCLE • Let f be the profile vector • Score(f,j)= • where S(i,j) is substitution scoring matrix

Aligning a profile vector to a nucleotide • PSI-BLAST • Score(f,i)=log(Qi/Pi) • Pi is the background probability of nucleotide i • qij is a matrix of match/mismatch probabilities • Define gi as • and Qi as

Aligning Profile Vectors to Sequences in Protein Family Analysis

Aligning Profile Vectors to Sequences in Protein Family Analysis

Presentation Transcript

Pairwise Sequence Alignment

Pairwise Sequence Alignment

Pairwise sequence Alignment

Pairwise Alignment

Pairwise Sequence Alignment

Pairwise Sequence Alignment

Pairwise sequence Alignment

Pairwise sequence alignment

Pairwise alignment

Pairwise Sequence Alignment

Pairwise Sequence Alignment

Pairwise sequence alignment

Pairwise Sequence Alignment

Pairwise sequence Alignment

Pairwise profile alignment

Pairwise Sequence Alignment

Pairwise alignment

Pairwise alignment

Pairwise Sequence Alignment

Pairwise sequence alignment

Pairwise alignment

Pairwise sequence alignment