Search
This Book
This Book
Anywhere
Quick Search in Books
Enter words / phrases / DOI / ISBN / keywords / authors / etc
Access type:Only show content I have full access toOnly show Open Access
Advanced Search
0 My Cart
Sign in
Institutional Access

System Upgrade on Tue, May 28th, 2024 at 2am (EDT)

Existing users will be able to log into the site and access content. However, E-commerce and registration of new users may not be available for up to 12 hours.
For online purchase, please visit us again. Contact us at customercare@wspc.com for any enquiries.

Proceedings of the 6th Asia-Pacific Bioinformatics Conference cover

Series on Advances in Bioinformatics and Computational Biology: Volume 6

Proceedings of the 6th Asia-Pacific Bioinformatics Conference

The 6th Asia-Pacific Bioinformatics Conference

, Kyoto, Japan

, 14 – 17 January 2008

https://doi.org/10.1142/p544 | December 2007

Pages: 412

Edited by:
Alvis Brazma (European Bioinformatics Institute, UK),
Satoru Miyano (University of Tokyo, Japan), and
Tatsuya Akutsu (Kyoto University, Japan)

Tools

Purchase Save for later

ISBN: 978-1-84816-108-5 (hardcover) USD 192.00 Add to cart

ISBN: 978-1-908978-95-0 (ebook) USD 77.00 Add to cart

High-throughput sequencing and functional genomics technologies have given us the human genome sequence as well as those of other experimentally, medically, and agriculturally important species, thus enabling large-scale genotyping and gene expression profiling of human populations. Databases containing large numbers of sequences, polymorphisms, structures, metabolic pathways, and gene expression profiles of normal and diseased tissues are rapidly being generated for human and model organisms. Bioinformatics is therefore gaining importance in the annotation of genomic sequences; the understanding of the interplay among and between genes and proteins; the analysis of the genetic variability of species; the identification of pharmacological targets; and the inference of evolutionary origins, mechanisms, and relationships. This proceedings volume contains an up-to-date exchange of knowledge, ideas, and solutions to conceptual and practical issues of bioinformatics by researchers, professionals, and industry practitioners at the 6th Asia-Pacific Bioinformatics Conference held in Kyoto, Japan, in January 2008.

Sample Chapter(s)
Chapter 1: Recent Progress in Phylogenetic Combinatorics (185 KB)

Contents:

Recent Progress in Phylogenetic Combinatorics (A Dress)
Predicting Nucleolar Proteins Using Support-Vector Machines (M Bodén)
Structure-Approximating Design of Stable Proteins in 2D HP Model Fortified by Cysteine Monomers (A H Khodabakhshi et al.)
Seed Optimization Is No Easier than Optimal Golomb Ruler Design (B Ma & H Yao)
Analysis of Structural Strand Asymmetry in Non-coding RNAs (J Wen et al.)
Genome Halving with Double Cut and Join (R Warren & D Sankoff)
Symbolic Approaches for Finding Control Strategies in Boolean Networks (C J Langmead & S K Jha)
Optimal Algorithm for Finding DNA Motifs with Nucleotide Adjacent Dependency (F Y L Chin et al.)
and other papers

Readership: Academics, researchers, and graduate students in bioinformatics and computer science.

Free Access

FRONT MATTER

Pages:i–xv

https://doi.org/10.1142/9781848161092_fmatter

No Access

RECENT PROGRESS IN PHYLOGENETIC COMBINATORICS

ANDREAS DRESS

Pages:1–4

https://doi.org/10.1142/9781848161092_0001

No Access

KEGG FOR MEDICAL AND PHARMACEUTICAL APPLICATIONS

MINORU KANEHISA

Page:5

https://doi.org/10.1142/9781848161092_0002

Preview Abstract

KEGG (http://www.genome.jp/kegg/) is a suite of databases that integrates genomic, chemical, and systemic functional aspects of the biological systems. KEGG provides a reference knowledge base for linking genomes to life through the process of PATHWAY mapping, which is to map, for example, a genomic or transcriptomic content of genes to KEGG reference pathways to infer systemic behaviors of the cell or the organism. In addition, KEGG provides a reference knowledge base for linking genomes to the environment, such as for the analysis of drug-target relationships, through the process of BRITE mapping. KEGG BRITE is an ontology database representing functional hierarchies of various biological objects, including molecules, cells, organisms, diseases, and drugs, as well as relationships among them. The KEGG resource is being expanded to suit the needs for practical applications. KEGG PATHWAY now contains 26 pathway maps for human diseases in four subcategories: neurodegenerative disorders, infectious diseases, metabolic disorders, and cancers. Although such maps will continue to be added, they will never be sufficient to represent our knowledge of molecular mechanisms of diseases because in many cases it is too fragmentary to represent as pathways. KEGG DISEASE is a new addition to the KEGG suite accumulating molecular-level knowledge on diseases represented as lists of genes, drugs, biomarkers, etc. KEGG DRUG now covers all approved drugs in the U.S. and Japan. KEGG DRUG is a structure-based database. Each entry is a unique chemical structure that is linked to standard generic names, and is associated with efficacy and target information as well as drug classifications. Target information is presented in the context of KEGG pathways and drug classifications are part of KEGG BRITE. The generic names are linked to trade names and subsequently to outside resources of package insert information whenever available. This reflects our effort to make KEGG more useful to the general public.

No Access

PROTEIN INTERACTIONS EXTRACTED FROM GENOMES AND PAPERS

ALFONSO VALENCIA

Page:7

https://doi.org/10.1142/9781848161092_0003

Preview Abstract

No Access

STRING KERNELS WITH FEATURE SELECTION FOR SVM PROTEIN CLASSIFICATION

WEN-YUN YANG and
BAO-LIANG LU

Pages:9–18

https://doi.org/10.1142/9781848161092_0004

Preview Abstract

No Access

PREDICTING NUCLEOLAR PROTEINS USING SUPPORT-VECTOR MACHINES

MIKAEL BODÉN

Pages:19–28

https://doi.org/10.1142/9781848161092_0005

Preview Abstract

No Access

SUPERVISED ENSEMBLES OF PREDICTION METHODS FOR SUBCELLULAR LOCALIZATION

Pages:29–38

https://doi.org/10.1142/9781848161092_0006

Preview Abstract

No Access

CHEMICAL COMPOUND CLASSIFICATION WITH AUTOMATICALLY MINED STRUCTURE PATTERNS

Pages:39–48

https://doi.org/10.1142/9781848161092_0007

Preview Abstract

No Access

STRUCTURE-APPROXIMATING DESIGN OF STABLE PROTEINS IN 2D HP MODEL FORTIFIED BY CYSTEINE MONOMERS

Pages:49–58

https://doi.org/10.1142/9781848161092_0008

Preview Abstract

The inverse protein folding problem is that of designing an amino acid sequence which has a prescribed native protein fold. This problem arises in drug design where a particular structure is necessary to ensure proper protein-protein interactions. The input to the inverse protein folding problem is a shape and the goal is to design a protein sequence with a unique native fold that closely approximates the input shape. Gupta et al.¹ introduced a design in the 2D HP model of Dill that can be used to approximate any given (2D) shape. They conjectured that the protein sequences of their design are stable but only proved the stability for an infinite class of very basic structures. The HP model divides amino acids to two groups: hydrophobic (H) and polar (P), and considers only hydrophobic interactions between neighboring H amino in the energy formula. Another significant force acting during the protein folding are sulfide (SS) bridges between two cysteine amino acids. In this paper, we will enrich the HP model by adding cysteines as the third group of amino acids. A cysteine monomer acts as an H amino acid, but in addition two neighboring cysteines can form a bridge to further reduce the energy of the fold. We call our model the HPC model. We consider a subclass of linear structures designed in Gupta et al.¹ which is rich enough to approximate (although more coarsely) any given structure. We refine the structures for the HPC model by setting approximately a half of H amino acids to cysteine ones. We conjecture that these structures are stable under the HPC model and prove it under an additional assumption that non-cysteine amino acids act as cysteine ones, i.e., they tend to form their own bridges to reduce the energy. In the proof we will make an efficient use of a computational tool 2DHPSolver which significantly speeds up the progress in the technical part of the proof. This is a preliminary work, and we believe that the same techniques can be used to prove this result without the artificial assumption about non-cysteine H monomers.

No Access

DISCRIMINATION OF NATIVE FOLDS USING NETWORK PROPERTIES OF PROTEIN STRUCTURES

Pages:59–67

https://doi.org/10.1142/9781848161092_0009

Preview Abstract

No Access

INTERACTING AMINO ACID PREFERENCES OF 3D PATTERN PAIRS AT THE BINDING SITES OF TRANSIENT AND OBLIGATE PROTEIN COMPLEXES

Pages:69–78

https://doi.org/10.1142/9781848161092_0010

Preview Abstract

No Access

STRUCTURAL DESCRIPTORS OF PROTEIN-PROTEIN BINDING SITES

Pages:79–88

https://doi.org/10.1142/9781848161092_0011

Preview Abstract

No Access

A MEMORY EFFICIENT ALGORITHM FOR STRUCTURAL ALIGNMENT OF RNAs WITH EMBEDDED SIMPLE PSEUDOKNOTS

Pages:89–99

https://doi.org/10.1142/9781848161092_0012

Preview Abstract

No Access

A NOVEL METHOD FOR REDUCING COMPUTATIONAL COMPLEXITY OF WHOLE GENOME SEQUENCE ALIGNMENT

RYUICHIRO NAKATO and
OSAMU GOTOH

Pages:101–110

https://doi.org/10.1142/9781848161092_0013

Preview Abstract

No Access

fRMSDAlign: PROTEIN SEQUENCE ALIGNMENT USING PREDICTED LOCAL STRUCTURE INFORMATION FOR PAIRS WITH LOW SEQUENCE IDENTITY

Pages:111–121

https://doi.org/10.1142/9781848161092_0014

Preview Abstract

No Access

RUN PROBABILITY OF HIGH-ORDER SEED PATTERNS AND ITS APPLICATIONS TO FINDING GOOD TRANSITION SEEDS

JIALIANG YANG and
LOUXIN ZHANG

Pages:123–132

https://doi.org/10.1142/9781848161092_0015

No Access

SEED OPTIMIZATION IS NO EASIER THAN OPTIMAL GOLOMB RULER DESIGN

BIN MA and
HONGYI YAO

Pages:133–143

https://doi.org/10.1142/9781848161092_0016

Preview Abstract

No Access

INTEGRATING HIERARCHICAL CONTROLLED VOCABULARIES WITH OWL ONTOLOGY: A CASE STUDY FROM THE DOMAIN OF MOLECULAR INTERACTIONS

Pages:145–154

https://doi.org/10.1142/9781848161092_0017

Preview Abstract

No Access

SEMANTIC SIMILARITY DEFINITION OVER GENE ONTOLOGY BY FURTHER MINING OF THE INFORMATION CONTENT

YUAN-PENG LI and
BAO-LIANG LU

Pages:155–164

https://doi.org/10.1142/9781848161092_0018

Preview Abstract

No Access

FROM TEXT TO PATHWAY: CORPUS ANNOTATION FOR KNOWLEDGE ACQUISITION FROM BIOMEDICAL LITERATURE

Pages:165–175

https://doi.org/10.1142/9781848161092_0019

Preview Abstract

No Access

CLASSIFICATION OF PROTEIN SEQUENCES BASED ON WORD SEGMENTATION METHODS

Pages:177–186

https://doi.org/10.1142/9781848161092_0020

Preview Abstract

No Access

ANALYSIS OF STRUCTURAL STRAND ASYMMETRY IN NON-CODING RNAs

Pages:187–198

https://doi.org/10.1142/9781848161092_0021

Preview Abstract

No Access

FINDING NON-CODING RNAs THROUGH GENOME-SCALE CLUSTERING

Pages:199–209

https://doi.org/10.1142/9781848161092_0022

Preview Abstract

No Access

A FIXED-PARAMETER APPROACH FOR WEIGHTED CLUSTER EDITING

Pages:211–220

https://doi.org/10.1142/9781848161092_0023

Preview Abstract

No Access

IMAGE COMPRESSION-BASED APPROACH TO MEASURING THE SIMILARITY OF PROTEIN STRUCTURES

Pages:221–230

https://doi.org/10.1142/9781848161092_0024

Preview Abstract

No Access

GENOME HALVING WITH DOUBLE CUT AND JOIN

ROBERT WARREN and
DAVID SANKOFF

Pages:231–240

https://doi.org/10.1142/9781848161092_0025

No Access

PHYLOGENETIC RECONSTRUCTION FROM COMPLETE GENE ORDERS OF WHOLE GENOMES

Pages:241–250

https://doi.org/10.1142/9781848161092_0026

Preview Abstract

Rapidly increasing numbers of organisms have been completely sequenced and most of their genes identified; homologies among these genes are also getting established. It thus has become possible to represent whole genomes as ordered lists of gene identifiers and to study the evolution of these entities through computational means, in systematics as well as in comparative genomics. While dealing with rearrangements is nontrivial, the biggest stumbling block remains gene duplication and losses, leading to considerable difficulties in determining orthologs among gene families—all the more since orthology determination has a direct impact on the selection of rearrangements. None of the existing phylogenetic reconstruction methods that use gene orders is able to exploit the information present in complete gene families—most assume singleton families and equal gene content, limiting the evolutionary operations to rearrangements, while others make it so by eliminating nonshared genes and selecting one exemplar from each gene family. In this work, we leverage our past work on genomic distances, on tight bounding of parsimony scores through linear programming, and on divide-and-conquer methods for large-scale reconstruction to build the first computational approach to phylogenetic reconstruction from complete gene order data, taking into account not only rearrangements, but also duplication and loss of genes. Our approach can handle mulitchromosomal data and gene families of arbitrary sizes and scale up to hundreds of genomes through the use of disk-covering methods. We present experimental results on simulated unichromosomal genomes in a range of sizes consistent with prokaryotes. Our results confirm that equalizing gene content, as done in existing phylogenetic tools, discards important phylogenetic information; in particular, our approach easily outperforms that most commonly referenced tool, MGR, often returning trees with less that one quarter of the errors found in the MGR trees.

No Access

SPR-BASED TREE RECONCILIATION: NON-BINARY TREES AND MULTIPLE SOLUTIONS

CUONG THAN and
LUAY NAKHLEH

Pages:251–260

https://doi.org/10.1142/9781848161092_0027

Preview Abstract

No Access

ALIGNMENT OF MINISATELLITE MAPS: A MINIMUM SPANNING TREE-BASED APPROACH

Pages:261–272

https://doi.org/10.1142/9781848161092_0028

Preview Abstract

No Access

METABOLIC PATHWAY ALIGNMENT (M-PAL) REVEALS DIVERSITY AND ALTERNATIVES IN CONSERVED NETWORKS

Pages:273–285

https://doi.org/10.1142/9781848161092_0029

Preview Abstract

No Access

AUTOMATIC MODELING OF SIGNAL PATHWAYS FROM PROTEIN-PROTEIN INTERACTION NETWORKS

Pages:287–296

https://doi.org/10.1142/9781848161092_0030

Preview Abstract

No Access

SIMULTANEOUSLY SEGMENTING MULTIPLE GENE EXPRESSION TIME COURSES BY ANALYZING CLUSTER DYNAMICS

Pages:297–306

https://doi.org/10.1142/9781848161092_0031

Preview Abstract

No Access

SYMBOLIC APPROACHES FOR FINDING CONTROL STRATEGIES IN BOOLEAN NETWORKS

Pages:307–319

https://doi.org/10.1142/9781848161092_0032

Preview Abstract

No Access

ESTIMATION OF POPULATION ALLELE FREQUENCIES FROM SMALL SAMPLES CONTAINING MULTIPLE GENERATIONS

DMITRY A. KONOVALOV and
DIK HEG

Pages:321–331

https://doi.org/10.1142/9781848161092_0033

Preview Abstract

No Access

LINEAR TIME PROBABILISTIC ALGORITHMS FOR THE SINGULAR HAPLOTYPE RECONSTRUCTION PROBLEM FROM SNP FRAGMENTS

Pages:333–342

https://doi.org/10.1142/9781848161092_0034

Preview Abstract

No Access

OPTIMAL ALGORITHM FOR FINDING DNA MOTIFS WITH NUCLEOTIDE ADJACENT DEPENDENCY

Pages:343–352

https://doi.org/10.1142/9781848161092_0035

Preview Abstract

No Access

PRIMER SELECTION METHODS FOR DETECTION OF GENOMIC INVERSIONS AND DELETIONS VIA PAMP

Pages:353–362

https://doi.org/10.1142/9781848161092_0036

Preview Abstract

No Access

GenePC AND ASPIC INTEGRATE GENE PREDICTIONS WITH EXPRESSED SEQUENCE ALIGNMENTS TO PREDICT ALTERNATIVE TRANSCRIPTS

Pages:363–371

https://doi.org/10.1142/9781848161092_0037

Preview Abstract

No Access

COMPARING AND ANALYSING GENE EXPRESSION PATTERNS ACROSS ANIMAL SPECIES USING 4DXPRESS

Pages:373–382

https://doi.org/10.1142/9781848161092_0038

Preview Abstract

No Access

NEAR-SIGMOID MODELING TO SIMULTANEOUSLY PROFILE GENOME-WIDE DNA REPLICATION TIMING AND EFFICIENCY IN SINGLE DNA REPLICATION MICROARRAY STUDIES

Pages:383–392

https://doi.org/10.1142/9781848161092_0039

Preview Abstract

Free Access

BACK MATTER

Pages:393–394

https://doi.org/10.1142/9781848161092_bmatter