<< Chapter < Page Chapter >> Page >
This project is designed to give the student experience using the bioinformatics tools that have been taught in previous modules in an independent project. The student will use the project outline to develop a bioinformatics problem or question according to their interests, and determine the appropriate tools to answer the question.

The bioinformatics project is an opportunity to use the tools taught in previous modules to research an area of personal interest. The student will choose one biological sequence with some presumed functional or structural importance (it can be putative, as long as there is some evidence, preferably published evidence, that supports the putative significance of the sequence). The sequence can be nucleotide or protein, but it must be at least 140 residues in length. The sequence can represent only one domain of a protein or gene, or a regulatory region, as long as it meets the above requirements for significance and length and the student has sufficiently justified why it is appropriate to extract just one region from the entire protein or gene for the problem or question chosen. There will be five sections to the project:

  • Define the problem or question.
  • Materials and Methods.
  • Multiple sequence alignment figure.
  • Phylogenetic tree.
  • Discussion.
Examples of the types of problems that are appropriate for this project include:
  • Detection of distantly related (divergent) sequences.
  • Detection of sequence homologs in various species.
  • Detection of homologous motifs in proteins of varied function.
  • Generation of a functional domain profile from a set of sequences.

Section 1. define the problem or question.

This section will define the sequence chosen as the starting or master sequence, from which the student will generate other sequences according to the question to be answered. Discuss the significance of the sequence and justify its selection as the topic for this project. Be sure it meets all the above-specified requirements. One well-written paragraph should be able to encompass this section.

Section 2. materials and methods.

Write a complete Materials and Methods section as for a journal article. Some journal articles may be selected and supplied by the instructor to serve as examples. Possible subsection titles could include:

  • Databases
  • Database Search Tools
  • Multiple Alignment Methods
  • Generation of Phylogenetic trees
Most methods subsections can be sufficiently expressed with one paragraph.

Section 3. multiple sequence alignment.

The student should choose a question to address that will generate at least 8 related sequences (this can be including the master sequence). Perform a multiple sequence alignment and generate a multiple sequence alignment figure that pertains to the question or problem of study. For example, a problem that deals with the detection of homologous motifs in proteins of varied function should be aligned to best illustrate similarity in the chosen motif or domain, which may not necessarily illustrate the best global alignment. Once aligned, sequences may require annotation to highlight specific regions or functional domains. The student may generate the multiple sequence alignment figure by a variety of methods, but the suggested approach is to use the Biology Workbench. Alignments can be performed under the PROTEIN TOOLS or NUCLEIC TOOLS menus. Under the alignment tools menu, options include editing, viewing, displaying (MVIEW) and drawing figures of (TEXSHADE, BOXSHADE) alignments.Note the warning under BOXSHADE that when the ruler is chosen for alignment numbering, boxshade can get stuck and never finish. The ruler is a nice addition to the figure, so try changing the font size or page orientation when this happens, as the Workbench recommends.

Section 4. phylogenetic tree.

Generate a phylogenetic tree that illustrates the relationships of the sequences in the multiple alignment. The tree can be rooted or unrooted, but should demonstrate the sequence relationships clearly. Phylogenetic trees can be drawn using DRAWGRAM or DRAWTREE under the alignment tools menu. All figures in this project should be entitled, labeled and captioned (see journal articles for examples).

Section 5. discussion.

Discuss the information discovered from the sequences generated and how this information addresses the problem or question of study. Be sure that the following questions are answered, if appropriate to the problem:

  • How were functional domains determined or assigned?
  • How were profiles generated or chosen, if used?
  • Did the functional domains line up correctly in the alignments? If not, why not?
  • Did you identify partial matches, or fringe homologs?

This project challenges the student to use the bioinformatics tools introduced in previous modules for independent study. The student gains experience in the proper design of a bioinformatics experiment, and in the proper presentation of bioinformatics methods, figures and results.

Get Jobilize Job Search Mobile App in your pocket Now!

Get it on Google Play Download on the App Store Now




Source:  OpenStax, Bios 533 bioinformatics. OpenStax CNX. Sep 24, 2008 Download for free at http://cnx.org/content/col10152/1.16
Google Play and the Google Play logo are trademarks of Google Inc.

Notification Switch

Would you like to follow the 'Bios 533 bioinformatics' conversation and receive update notifications?

Ask