Series on Advances in Bioinformatics and Computational Biology: Volume 4

Computational Systems Bioinformatics

Proceedings of the Conference CSB 2006

, Stanford CA

, 14 – 18 August 2006

https://doi.org/10.1142/p472 | July 2006

Pages: 416

Edited by:
Peter Markstein (Hewett-Packard Company, USA) and
Ying Xu (University of Georgia, USA)

View Full Book

Tools

Purchase Save for later

ISBN: 978-1-86094-700-1 (hardcover) USD 196.00 Add to cart

ISBN: 978-1-908979-47-6 (ebook) USD 78.00 Add to cart

This volume contains about 40 papers covering many of the latest developments in the fast-growing field of bioinformatics. The contributions span a wide range of topics, including computational genomics and genetics, protein function and computational proteomics, the transcriptome, structural bioinformatics, microarray data analysis, motif identification, biological pathways and systems, and biomedical applications. There are also abstracts from the keynote addresses and invited talks.

The papers cover not only theoretical aspects of bioinformatics but also delve into the application of new methods, with input from computation, engineering and biology disciplines. This multidisciplinary approach to bioinformatics gives these proceedings a unique viewpoint of the field.

Sample Chapter(s)
Chapter 1: Exploring the Ocean's Microbes: Sequencing the Seven Seas (122 KB)

Contents:

Exploring the Ocean's Microbes: Sequencing the Seven Seas (M E Frazier et al.)
Protein Network Comparative Genomics (T Ideker)
Bioinformatics at Microsoft Research (S Mercer)
Protein Fold Recognition Using Gradient Boost Algorithm (F Jiao et al.)
Efficient Annotation of Non-Coding RNA Structures Including Pseudoknots via Automated Filters (C Liu et al.)
Efficient Generalized Matrix Approximations for Biomarker Discovery and Visualization in Gene Expression Data (W Li et al.)
Sorting Genomes by Translocations and Deletions (X Qi et al.)
Detection of Cleavage Sites for HIV-1 Protease in Native Proteins (L You)
Identifying Biological Pathways via Phase Decomposition and Profile Extraction (Y Zhang & Z Deng)
Complexity and Scoring Function of MS/MS Peptide De Novo Sequencing (C Xu & B Ma)
Simulating In Vitro Epithelial Morphogenesis in Multiple Environments (M R Grant et al.)
and other papers

Readership: Research and application community in bioinformatics, systems biology, medicine, pharmacology and biotechnology. A useful reference for graduate researchers in bioinformatics and computational biology.

Free Access

FRONT MATTER

Pages:i–xvi

https://doi.org/10.1142/9781860947575_fmatter

No Access

EXPLORING THE OCEAN'S MICROBES: SEQUENCING THE SEVEN SEAS

Pages:1–2

https://doi.org/10.1142/9781860947575_0001

No Access

DON'T KNOW MUCH ABOUT PHILOSOPHY: THE CONFUSION OVER BIO-ONTOLOGIES

Mark A. Musen

Page:3

https://doi.org/10.1142/9781860947575_0002

Preview Abstract

For the past decade, there has been increasing interest in ontologies in the biomedical community. As interest has peaked. so has the confusion. The confusion stems from the multiple knowledge-representation languages used to encode ontologies (e.g., frame-based systems, Semantic Web standards such as RDF(S) and OWL, and languages created specifically by the bioinformatics community, such as OBO), where each language has explicit strengths and weaknesses. Biomedical scientists use ontologies for multiple purposes, from annotation of experimental data, to natural-language processing, to data integration, to construction of decision-support systems. Each of these purposes imposes different requirements concerning which entities ontologies should encode and how those entities should be encoded. Although the biomedical informatics community remains excited about ontologies, exactly what an ontology is and how it should be represented within a computer are points about which, with considerable questioning, we can see little uniformity of opinion. The confusion will persist until we can understand that different developers have very different requirements for ontologies, and therefore those developers will make very different assumptions about how ontologies should be created and structured. We will review those assumptions and the corresponding implications for ontology construction.

Our National Center for Biomedical Ontology (http://bioontology.org) is one of the seven national centers for biomedical computing formed under the NIH Roadmap. The Center takes a broad perspective on what ontologies are and how they should be developed and put to use. Our goal, simply put, is to help to eliminate much of the current confusion. The Center recognizes the importance of ontologies for use in a wide range of biomedical applications, and is developing new technology to make all relevant ontologies widely accessible, searchable, alignable, and useable within software systems. Ultimately, the Center will support the publication of biomedical ontologies online, much as we publish scientific knowledge in print media. The advent of biomedical knowledge that is widely available in machine-processable form will alter the way that we think about science and perform scientific experiments. The biomedical community soon will enter an era in which scientific knowledge will become more accessible, more useable, and more precise, and in which new methods will be needed to support a radically different kind of scientific publishing.

No Access

BIOMEDICAL INFORMATICS RESEARCH NETWORK (BIRN): BUILDING A NATIONAL COLLABORATORY FOR BIOMEDICAL AND BRAIN RESEARCH

Mark H. Ellisman

Page:5

https://doi.org/10.1142/9781860947575_0003

No Access

PROTEIN NETWORK COMPARATIVE GENOMICS

Trey Ideker

Page:7

https://doi.org/10.1142/9781860947575_0004

No Access

SYSTEMS BIOLOGY IN TWO DIMENSIONS: UNDERSTANDING AND ENGINEERING MEMBRANES AS DYNAMICAL SYSTEMS

Erik Jakobsson

Pages:9–10

https://doi.org/10.1142/9781860947575_0005

No Access

BIOINFORMATICS AT MICROSOFT RESEARCH

Simon Mercer

Page:11

https://doi.org/10.1142/9781860947575_0006

No Access

MOVIE CRUNCHING IN BIOLOGICAL DYNAMIC IMAGING

Jean-Christophe Olivo-Marin

Pages:13–14

https://doi.org/10.1142/9781860947575_0007

Preview Abstract

No Access

ENGINEERING NUCLEIC ACID-BASED MOLECULAR SENSORS FOR PROBING AND PROGRAMMING CELLULAR SYSTEMS

Christina D. Smolke

Page:15

https://doi.org/10.1142/9781860947575_0008

No Access

REACTOME: A KNOWLEDGEBASE OF BIOLOGICAL PATHWAYS

Page:17

https://doi.org/10.1142/9781860947575_0009

No Access

EFFECTIVE OPTIMIZATION ALGORITHMS FOR FRAGMENT-ASSEMBLY BASED PROTEIN STRUCTURE PREDICTION

Pages:19–29

https://doi.org/10.1142/9781860947575_0010

Preview Abstract

No Access

TRANSMEMBRANE HELIX AND TOPOLOGY PREDICTION USING HIERARCHICAL SVM CLASSIFIERS AND AN ALTERNATING GEOMETRIC SCORING FUNCTION

Pages:31–42

https://doi.org/10.1142/9781860947575_0011

Preview Abstract

No Access

PROTEIN FOLD RECOGNITION USING THE GRADIENT BOOST ALGORITHM

Pages:43–53

https://doi.org/10.1142/9781860947575_0012

Preview Abstract

No Access

A GRAPH-BASED AUTOMATED NMR BACKBONE RESONANCE SEQUENTIAL ASSIGNMENT

Xiang Wan and
Guohui Lin

Pages:55–66

https://doi.org/10.1142/9781860947575_0013

Preview Abstract

No Access

A DATA-DRIVEN, SYSTEMATIC SEARCH ALGORITHM FOR STRUCTURE DETERMINATION OF DENATURED OR DISORDERED PROTEINS

Pages:67–78

https://doi.org/10.1142/9781860947575_0014

Preview Abstract

No Access

MULTIPLE STRUCTURE ALIGNMENT BY OPTIMAL RMSD IMPLIES THAT THE AVERAGE STRUCTURE IS A CONSENSUS

Xueyi Wang and
Jack Snoeyink

Pages:79–87

https://doi.org/10.1142/9781860947575_0015

Preview Abstract

No Access

IDENTIFICATION OF α-HELICES FROM LOW RESOLUTION PROTEIN DENSITY MAPS

Pages:89–98

https://doi.org/10.1142/9781860947575_0016

Preview Abstract

No Access

EFFICIENT ANNOTATION OF NON-CODING RNA STRUCTURES INCLUDING PSEUDOKNOTS VIA AUTOMATED FILTERS

Pages:99–110

https://doi.org/10.1142/9781860947575_0017

Preview Abstract

No Access

THERMODYNAMIC MATCHERS: STRENGTHENING THE SIGNIFICANCE OF RNA FOLDING ENERGIES

Pages:111–121

https://doi.org/10.1142/9781860947575_0018

Preview Abstract

No Access

PEM: A GENERAL STATISTICAL APPROACH FOR IDENTIFYING DIFFERENTIALLY EXPRESSED GENES IN TIME-COURSE CDNA MICROARRAY EXPERIMENT WITHOUT REPLICATE

Pages:123–132

https://doi.org/10.1142/9781860947575_0019

Preview Abstract

No Access

EFFICIENT GENERALIZED MATRIX APPROXIMATIONS FOR BIOMARKER DISCOVERY AND VISUALIZATION IN GENE EXPRESSION DATA

Pages:133–144

https://doi.org/10.1142/9781860947575_0020

Preview Abstract

No Access

EFFICIENT COMPUTATION OF MINIMUM RECOMBINATION WITH GENOTYPES (NOT HAPLOTYPES)

Yufeng Wu and
Dan Gusfield

Pages:145–156

https://doi.org/10.1142/9781860947575_0021

Preview Abstract

No Access

SORTING GENOMES BY TRANSLOCATIONS AND DELETIONS

Pages:157–166

https://doi.org/10.1142/9781860947575_0022

Preview Abstract

No Access

TURNING REPEATS TO ADVANTAGE: SCAFFOLDING GENOMIC CONTIGS USING LTR RETROTRANSPOSONS

Pages:167–178

https://doi.org/10.1142/9781860947575_0023

Preview Abstract

No Access

WHOLE GENOME COMPOSITION DISTANCE FOR HIV-1 GENOTYPING

Pages:179–190

https://doi.org/10.1142/9781860947575_0024

Preview Abstract

No Access

EFFICIENT RECURSIVE LINKING ALGORITHM FOR COMPUTING THE LIKELIHOOD OF AN ORDER OF A LARGE NUMBER OF GENETIC MARKERS

Pages:191–198

https://doi.org/10.1142/9781860947575_0025

Preview Abstract

No Access

OPTIMAL IMPERFECT PHYLOGENY RECONSTRUCTION AND HAPLOTYPING (IPPH)

Pages:199–210

https://doi.org/10.1142/9781860947575_0026

Preview Abstract

No Access

TOWARD AN ALGEBRAIC UNDERSTANDING OF HAPLOTYPE INFERENCE BY PURE PARSIMONY

Pages:211–222

https://doi.org/10.1142/9781860947575_0027

Preview Abstract

No Access

GLOBAL CORRELATION ANALYSIS BETWEEN REDUNDANT PROBE SETS USING A LARGE COLLECTION OF ARABIDOPSIS ATH1 EXPRESSION PROFILING DATA

Xiangqin Cui and
Ann Loraine

Pages:223–226

https://doi.org/10.1142/9781860947575_0028

Preview Abstract

No Access

DISTANCE-BASED IDENTIFICATION OF STRUCTURE MOTIFS IN PROTEINS USING CONSTRAINED FREQUENT SUBGRAPH MINING

Pages:227–238

https://doi.org/10.1142/9781860947575_0029

Preview Abstract

No Access

AN IMPROVED GIBBS SAMPLING METHOD FOR MOTIF DISCOVERY VIA SEQUENCE WEIGHTING

Xin Chen and
Tao Jiang

Pages:239–247

https://doi.org/10.1142/9781860947575_0030

Preview Abstract

No Access

DETECTION OF CLEAVAGE SITES FOR HIV-1 PROTEASE IN NATIVE PROTEINS

Pages:249–256

https://doi.org/10.1142/9781860947575_0031

Preview Abstract

No Access

A METHODOLOGY FOR MOTIF DISCOVERY EMPLOYING ITERATED CLUSTER RE-ASSIGNMENT

Pages:257–268

https://doi.org/10.1142/9781860947575_0032

Preview Abstract

No Access

IDENTIFYING BIOLOGICAL PATHWAYS VIA PHASE DECOMPOSITION AND PROFILE EXTRACTION

Yi Zhang and
Zhidong Deng

Pages:269–280

https://doi.org/10.1142/9781860947575_0033

Preview Abstract

No Access

EXPECTATION-MAXIMIZATION ALGORITHMS FOR FUZZY ASSIGNMENT OF GENES TO CELLULAR PATHWAYS

Liviu Popescu and
Golan Yona

Pages:281–291

https://doi.org/10.1142/9781860947575_0034

Preview Abstract

No Access

CLASSIFICATION OF DROSOPHILA EMBRYONIC DEVELOPMENTAL STAGE RANGE BASED ON GENE EXPRESSION PATTERN IMAGES

Pages:293–298

https://doi.org/10.1142/9781860947575_0035

Preview Abstract

No Access

EVOLUTION VERSUS “INTELLIGENT DESIGN”: COMPARING THE TOPOLOGY OF PROTEIN-PROTEIN INTERACTION NETWORKS TO THE INTERNET

Pages:299–310

https://doi.org/10.1142/9781860947575_0036

Preview Abstract

No Access

CAVITY-AWARE MOTIFS REDUCE FALSE POSITIVES IN PROTEIN FUNCTION PREDICTION

Pages:311–323

https://doi.org/10.1142/9781860947575_0037

Preview Abstract

No Access

PROTEIN SUBCELLULAR LOCALIZATION PREDICTION BASED ON COMPARTMENT-SPECIFIC BIOLOGICAL FEATURES

Pages:325–330

https://doi.org/10.1142/9781860947575_0038

Preview Abstract

No Access

PREDICTING THE BINDING AFFINITY OF MHC CLASS II PEPTIDES

Pages:331–334

https://doi.org/10.1142/9781860947575_0039

Preview Abstract

No Access

CODON-BASED DETECTION OF POSITIVE SELECTION CAN BE BIASED BY HETEROGENEOUS DISTRIBUTION OF POLAR AMINO ACIDS ALONG PROTEIN SEQUENCES

Xuhua Xia and
Sudhir Kumar

Pages:335–340

https://doi.org/10.1142/9781860947575_0040

Preview Abstract

No Access

BAYESIAN DATA INTEGRATION: A FUNCTIONAL PERSPECTIVE

Pages:341–351

https://doi.org/10.1142/9781860947575_0041

Preview Abstract

No Access

AN ITERATIVE ALGORITHM TO QUANTIFY THE FACTORS INFLUENCING PEPTIDE FRAGMENTATION FOR MS/MS SPECTRUM

Pages:353–360

https://doi.org/10.1142/9781860947575_0042

Preview Abstract

No Access

COMPLEXITY AND SCORING FUNCTION OF MS/MS PEPTIDE DE NOVO SEQUENCING

Changjiang Xu and
Bin Ma

Pages:361–369

https://doi.org/10.1142/9781860947575_0043

Preview Abstract

No Access

EXPECTATION-MAXIMIZATION METHOD FOR RECONSTRUCTING TUMOR PHYLOGENIES FROM SINGLE-CELL DATA

Pages:371–380

https://doi.org/10.1142/9781860947575_0044

Preview Abstract

No Access

SIMULATING IN VITRO EPITHELIAL MORPHOGENESIS IN MULTIPLE ENVIRONMENTS

Pages:381–384

https://doi.org/10.1142/9781860947575_0045

Preview Abstract

No Access

A COMBINED DATA MINING APPROACH FOR INFREQUENT EVENTS: ANALYZING HIV MUTATION CHANGES BASED ON TREATMENT HISTORY

Pages:385–388

https://doi.org/10.1142/9781860947575_0046

Preview Abstract

No Access

A SYSTEMS BIOLOGY CASE STUDY OF OVARIAN CANCER DRUG RESISTANCE

Pages:389–398

https://doi.org/10.1142/9781860947575_0047

Preview Abstract

In ovarian cancer treatment, the chemotherapy drug cisplatin often induce drug resistance after prolonged use, causing cancer relapse and the eventual deaths of patients. Cisplatin-induced drug resistance is known to involve a complex set of cellular changes but its molecular mechanism(s) remain unclear. In this study, we designed a systems biology approach to examine global protein level and network level changes by comparing Proteomics profiles between cisplatin-resistant cell lines and cisplatin-sensitive cell lines. First, we used an experimental proteomics method based on a Label-free Liquid Chromatography / Mass Spectrometry (LC/MS) platform to obtain a list of 119 proteins that are differentially expressed in the samples. Second, we expanded these proteins into a cisplatin-resistant activated sub-network, which consists of 1230 proteins in 1111 protein interactions. An examination of network topology features reveals the activated responses in the network are closely coupled. Third, we examined sub-network proteins using Gene Ontology categories. We found significant enrichment of proton-transporting ATPase and ATP synthase complexes in addition to protein binding proteins. Fourth, we examined sub-network protein interaction function categories using 2-dimensional visualization matrixes. We found that significant cellular physiological responses arise from endogeneous, abiotic, and stress-related signals, which correlates well with known facts that internalized cisplatin cause DNA damage and induce cell stress. Fifth and finally, we developed a new visual representation structure for display of activated sub-networks using functional categories as network nodes and their crosstalk as network edges. This type of sub-network further shows that while cell communication and cell growth are generally important to tumor mechanisms, molecular regulation of cell differentiation and development caused by responses to genomic-wide stress seem to be more relevant to the acquisition of drug resistance.

Free Access