University of Dundee

Sequencing technology helps reveal what plant genomes really encode

27 Jul 2012

Scientists from the College of Life Sciences at the University of Dundee and the James Hutton Institute have teamed up with researchers in the USA to use a new technique to sequence the genes of the plant Arabidopsis. This approach, which allows researchers to see exactly where a plant's genes end, could be applied to crops in the hope of boosting efforts to breed new varieties.

The Biotechnology and Biological Sciences Research Council (BBSRC)-funded team are the first in the world to try to understand how plant genes are organised by directly sequencing a molecule called RNA rather than DNA. Their findings are published  in the journal Nature Structural & Molecular Biology.

 

The research involved collaboration between a team of biologists led by Dr Gordon Simpson of the Division of Plant Sciences at the University of Dundee, computational scientists led by Prof Geoff Barton at the Colle ge of Life Sciences and a technology group from Helicos Biosciences in the USA.

 

Dr Simpson explains, "Many genes work by coding for proteins, but the path from DNA to protein goes via an intermediate molecule called RNA. By sequencing the RNA you can see exactly which bits of the genome make proteins and what genes are turned on in different cells and at particular times.

 

"Until now, people have sequenced RNA by first converting it back into DNA. They chop it up, add on special molecules and then because there is not enough, they copy the bits again and again, before finally sequencing. The trouble is that all these steps introduce bias and error. What's special about what we have done is we have avoided all these steps and sequenced the RNA directly".

 

This technique allows scientists to see exactly where genes end with unprecedented certainty. This is important for two reasons. Firstly it helps us find individual genes within genomes and so work out what they do. Second, it tells us something about how cells are behaving. Cells can choose where a gene should end depending on what the gene should be doing. For example cutting off genes at different points affects when a plant flowers. It has recently become apparent that these choices occur widely in biology: for example, there are global changes in where genes end in cancer tissue.

 

Dr Simpson continues, "Using this technique we can unequivocally score where genes end, count how active genes are and say from which strand of the DNA double helix the RNA is copied. Surprisingly we found that genes that overlap each other account for a large fraction of gene activity. We also found new ends to thousands of genes and found genes that were completely new to us".

 

The huge amount of data and the novelty of the procedure required new approaches from computational scientists in Dundee. Sasha Sherstnev, who did much of the analysis, comes from a background in particle physics and previously worked at CERN on the search for the Higgs Boson. He brought experience necessary to meet the demands accompanying the rise of large data sets in biology that are being driven by changes in sequencing technology.

 

As well as enabling scientists to understand what genomes actually encode and how active genes are, direct RNA sequencing could be especially useful for situations where only a few cells are available, for example when working with patient samples.

 

The Dundee team now plan to use their expertise to understand what other genomes encode and how that changes in disease. In this way, they can help deliver greater accuracy in rational crop improvement.

 

Professor Douglas Kell, Chief Executive, BBSRC, said, "This is a great example of how mastering new techniques and embracing new ways of working can deliver valuable insights into biology. It will be interesting to see where this team directs their expertise next."