Corporate Banner
Satellite Banner
Mass Spectrometry
Scientific Community
Become a Member | Sign in
Home>News>This Article

Human Proteome Project Uncovers 193 Proteins Not Known to Exist

Published: Thursday, May 29, 2014
Last Updated: Thursday, May 29, 2014
Bookmark and Share
The team of international researchers reports the identification of 193 novel proteins that came from regions of the genome not predicted to code for proteins.

Striving for the protein equivalent of the Human Genome Project, an international team of researchers has created an initial catalog of the human “proteome,”or all of the proteins in the human body. In total, using 30 different human tissues, the team identified proteins encoded by 17,294 genes, which is about 84 percent of all of the genes in the human genome predicted to encode proteins.

In a summary of the effort, to be published May 29 in the journal Nature, the team also reports the identification of 193 novel proteins that came from regions of the genome not predicted to code for proteins, suggesting that the human genome is more complex than previously thought. The cataloging project, led by researchers at The Johns Hopkins University and the Institute of Bioinformatics in Bangalore, India, should prove an important resource for biological research and medical diagnostics, according to the team’s leaders.

“You can think of the human body as a huge library where each protein is a book,” says Akhilesh Pandey, M.D., Ph.D., a professor at the McKusick-Nathans Institute of Genetic Medicine and of biological chemistry, pathology and oncology at The Johns Hopkins University and the founder and director of the Institute of Bioinformatics. “The difficulty is that we don’t have a comprehensive catalog that gives us the titles of the available books and where to find them. We think we now have a good first draft of that comprehensive catalog.”

While genes determine many of the characteristics of an organism, they do so by providing instructions for making proteins, the building blocks and workhorses of cells, and therefore of tissues and organs. For this reason, many investigators consider a catalog of human proteins — and their location within the body — to be even more instructive and useful than the catalog of genes in the human genome.

Studying proteins is far more technically challenging than studying genes, Pandey notes, because the structures and functions of proteins are complex and diverse. And a mere list of existing proteins would not be very helpful without accompanying information about where in the body those proteins are found. Therefore, most protein studies to date have focused on individual tissues, often in the context of specific diseases, he adds.

To achieve a more comprehensive survey of the proteome, the research team began by taking samples of 30 tissues, extracting their proteins and using enzymes like chemical scissors to cut them into smaller pieces, called peptides. They then ran the peptides through a series of instruments designed to deduce their identity and measure their relative abundance.

“By generating a comprehensive human protein dataset, we have made it easier for other researchers to identify the proteins in their experiments,” says Pandey. “We believe our data will become the gold standard in the field, especially because they were all generated using uniform methods and analysis, and state-of-the-art machines.”

Among the proteins whose data patterns have been characterized for the first time are many that were never predicted to exist. (Within the genome, in addition to the DNA sequences that encode proteins, there are stretches of DNA whose sequences do not follow a conventional protein-coding gene pattern and have therefore been labeled “noncoding.”) The team’s most unexpected finding was that 193 of the proteins they identified could be traced back to these supposedly noncoding regions of DNA.

“This was the most exciting part of this study, finding further complexities in the genome,” says Pandey. “The fact that 193 of the proteins came from DNA sequences predicted to be noncoding means that we don’t fully understand how cells read DNA, because clearly those sequences do code for proteins.”

Pandey believes that the human proteome is so extensive and complex that researchers’ catalog of it will never be fully complete, but this work provides a solid foundation that others can reliably build upon.

Authors of the report include Min-Sik Kim and others from the Johns Hopkins University School of Medicine, Harsha Gowda and others from the Institute of Bioinformatics, and others from several other institutions.

Further Information
Access to this exclusive content is for Technology Networks Premium members only.

Join Technology Networks Premium for free access to:

  • Exclusive articles
  • Presentations from international conferences
  • Over 2,600+ scientific posters on ePosters
  • More than 3,800+ scientific videos on LabTube
  • 35 community eNewsletters

Sign In

Forgotten your details? Click Here
If you are not a member you can join here

*Please note: By logging into you agree to accept the use of cookies. To find out more about the cookies we use and how to delete them, see our privacy policy.

Scientific News
Newly Identified Biochemical Pathway Could Be Target for Insulin Control
Researchers at Duke Medicine and the University of Alberta are reporting the identification of a new biochemical pathway to control insulin secretion from islet beta cells in the pancreas, establishing a potential target for insulin control.
Dirty,Crusty Meals Fit for (Long-Dormant) Microbes
Researchers apply the latest analytical techniques to further our understanding of desert biocrusts.
CSI -- On The Metabolite's Trail
Bioinformaticians at the University of Jena make the most efficient search engine for molecular structures available online.
First Complete Structural Study Of A Pegylated Protein
Significant data obtained at NUI Galway reports first crystal structure of a protein modified with a single PEG chain.
Potential Ovarian Cancer Biomarker Isolated
Researchers from North Carolina State University utilized a highly sensitive mass spectrometry analysis to identify and measure difficult-to-detect N-glycan biomarkers associated with ovarian cancers in stages I – IV.
Waters’ Centers of Innovation Program Honors Ohio State University Laboratory
Wysocki-led facility recognized for proteomics, disease research and mass spectrometry excellence.
Protein Aggregation After Heat Shock Is An Organized, Reversible Response
New study finds protein aggregation after heat exposure is a reversible cellular process, not unrecoverable damage from misfolding.
World’s Largest Protein Interaction Map Created
A multinational team of scientists sifted through cells of vastly different organisms – from amoebae to worms to mice to humans – to reveal how proteins fit together to build different cells and bodies.
Evidence of Ancient Life Discovered Deep Below the Seafloor
Ancient rocks harbored microbial life deep below the seafloor, reports a team of scientists from the Woods Hole Oceanographic Institution (WHOI), Virginia Tech, and the University of Bremen.
Structural Discoveries Could Aid in Better Drug Design
Scientists have uncovered the structural details of how some proteins interact to turn two different signals into a single integrated output.
Scroll Up
Scroll Down
Skyscraper Banner

Skyscraper Banner
Go to LabTube
Go to eposters
Access to the latest scientific news
Exclusive articles
Upload and share your posters on ePosters
Latest presentations and webinars
View a library of 1,800+ scientific and medical posters
2,600+ scientific and medical posters
A library of 2,500+ scientific videos on LabTube
3,800+ scientific videos