GCA
GCA Attend a GCA Conference

TECHNICAL TRACK
TUESDAY, FEBRUARY 29

 

8:30 am
Morning Keynote: Grass-roots XML

Jon Bosak, Sun Microsystems, conference co-chair


Jon Bosak of Sun Microsystems organized and led the working group that created XML, and he continues to oversee the development of the XML family of standards as Chair of the W3C XML Coordination Group. He is a founding member of OASIS and was a primary force behind the development of the DocBook standard for Unix and Linux documentation.

Session #1: Documents


9:00 am
Publishing with XML and the Open eBook Specification
David Goldstein, Chief Technical Officer, Versaware, Inc.
davidg@versaware.com

Biography:
David Goldstein is the CTO of Versaware, Inc., one of the companies pioneering the electronic publication industry. Mr. Goldstein is active in XML based consulting and serves on the authoring committee of the Open eBook Specification. He also serves on both the executive and technical committees of the EBX group working on digital rights management for eBooks. Prior to his work at Versaware, he was a Product Manager and the Technical Evangelist of Accent Software, Inc., which produced some of the first multi-lingual Unicode-based word processors and translation tools.
Abstract:
Sparked by the growth in WEB access and the availability of low cost, hand-held reading devices, the electronic publication industry is on the verge of redefining our relationship with the "printed" word. We are witnessing a revolution in the way we read, access, search and purchase "books". However, there are several obstacles to overcome. These include the plethora of word processing and publishing formats, the interoperability gap separating traditional and electronic publishing, the representation and encoding of a myriad of languages and alphabets, and the high cost of re-processing content for various target media.
The Open eBook (OEB) Specification is part of an industry-sponsored initiative to resolve many of these issues. The specification seeks to have an immediate and direct impact on the creation, advancement, and growth of a flourishing eBook industry. Fully XML and Unicode compliant, this specification supports feature-rich and media-enhanced electronic publication. Content providers can now feel confident that their documents can be delivered on a wide range of reading devices without the need to reprocess. Any OEB-compliant reading system will display these publications in a manner that most faithfully represents the original content, even in consideration of limitations of a particular reading device.
This presentation will introduce the goals and design principles of the Open eBook Specification and will detail its features, limitations and relationship to other industry standards. Current tools for producing and displaying OEB publications will be demonstrated and future directions of the OEB authoring committee's work will be described.



9:30 am
Ęsop: A Browser for XML Documents and Open eBook Publications

Christopher Maden, Solutions Architect, Yomu
crism@exemplary.net
Deborah Hooker, Vice President of Engineering, Yomu

deb@exemplary.net

Biographies:
Chris Maden received his Sc.B. in electrical engineering from Brown University; he worked for Electronic Book Technologies for three years before taking a position with O'Reilly & Associates, where he developed a DSSSL-based print and on-line publishing system. He contributed to the development of HTML 2.0 and XML, as well as the DynaText and DynaWeb products at EBT, and is on the XSL Working Group. He now provides OEB solutions to publishers and works with the Ęsop architecture team at Yomu.

Deb Hooker has fifteen years of experience architecting and building software solutions in domains as diverse as astronomy, satellite remote sensing, telecommunications and finance. She is the Vice President of Engineering for Yomu.
Abstract:
Yomu plans to become the world's leading distributor of electronic books (ebooks). By ebooks, we mean any large documents that are not suited to HTML and Web-based viewing. This may include books previously published in paper form, new texts written specifically for this medium, internal corporate documents, or any other large, complex document.
Towards this end, we are developing sop, a high-quality ebook browser, written in Java. Ęsop incorporates technology to be a general-purpose XML browser, initially focused on the subset required by the OEB Publication Structure specification published by the Open eBook Consortium (URL:http://www.openebook.org/). We intend to provide high-quality cross-platform rendering, addressing desktop and handheld systems. The version of Ęsop seen at the conference will be the first publicly seen version of our product. We intend to have a stable release by June of 2000.
Ęsop will enable the reader to enjoy a product that takes into account human understanding of how reading works. To this end, Ęsop incorporates features that allow for accessibility, internationalization, as well as less commonly understood features to improve reading comprehension. We intend to demonstrate some of these features, and clearly delineate the advantages involved.
We will demonstrate the rendering features of Ęsop, including what we're calling "multi-view browsing". Multi-view browsing allows the same rendered version of a document to be flowed into different contexts; for example, a user who is more comfortable with traditional print books would prefer the illusion of on-screen pages, while a more experienced computer user may want a single continuous scroll. We will generate three views of Ęsop during this presentation: single page, two pages, and the scrolling view.
We will discuss the processing model that Ęsop uses. XML's primary strength, the separation of content and behavior, leads to a question that the standards have yet to answer: what happens when content and behavior must be unified? The order of processing a document, a stylesheet, and associated links can produce different results. In addition, the processing of "hints" in the instance (such as XLinks, HTML-style tags, SVG, etc.) is unclear, especially when coupled with stylesheets that may reference these behavior-specific tags a little, a lot, or not at all. We will share our thoughts that led to our current prcoessing model, which we believe produces the best results.
User annotations are a key feature of an ebook browser. Annotations can be shared among users of a book without depending on a particular rendered view of a book. A problem with document transformation is that it can cause information based on the reliability of a document's pre-transformed structure to become unreliable. Tracking annotation locations back to their origin in the source document and reliably re-locating them in a new rendered view is an interesting problem, and we will share our thoughts on the various approaches we have considered, where we arrived, and why.
We have also developed a stylesheet object model capable of representing the information contained in both CSS and XSL stylesheets. Considerations of performance and completeness that went into creating the model will be covered, as will our experiences implementing XSLT and XSL formatting objects.



10:00 am
Implementing the DOM for MathML in the IBM techexplorer
Hypermedia Browser

Sam Dooley, Staff Programmer, IBM Research
dooley@watson.ibm.com

Biography:
Sam Dooley is a member of the Advanced Internet Publishing Group at IBM Research, and is responsible for the current implementations of several XML technologies for the techexplorer Hypermedia Browser, including DOM Level 1 interfaces for MathML and LaTeX node types, parsing and rendering MathML presentation and content elements, core XML language support, and content markup extensions to the LaTeX language used in techexplorer's interactive courseware offerings. Research interests include applications of interactive mathematical documents for symbolic computation, scientific courseware, and the representation of mathematical inference.
Abstract:
The Document Object Model (DOM) allows an application to expose a standard collection of interfaces so that external programs, scripts, and applets can modify the internal structure of a document. MathML provides standard vocabularies for the communication of mathematical information contained in internet documents. Together they allow web content developers to create applications with an unprecedented level of sophistication in the treatment of the interaction between a user and the mathematical content of a document. These applications include interactive scientific and technical documents, interactive courseware, and mathematical software interfaces that begin to realize the promise of XML technologies for providing extensible documents with rich domain-specific semantic content.
These technologies have been brought together in the implementation of the DOM interfaces for the MathML node types supported by the IBM techexplorer Hypermedia Browser. IBM techexplorer is a browser plugin for rendering mathematical documents written in LaTeX and MathML, and stands as the first open-standards platform for developing interactive mathematical applications for the internet. By using widely-supported markup languages for mathematics (MathML, LaTeX) and industry standard interfaces (DOM) for interacting with the document structure, C++ and Java application developers have open access to the mathematical content of a document, and a high degree of control over its presentation to the user.

IBM techexplorer implements full support for the DOM Level 1 interfaces for both the MathML 1.0 presentation and content tag sets, and for a collection of element types corresponding to commonly used TeX and LaTeX syntactic structures. The implementation of these interfaces involved adding DOM support to a set of existing C++ classes for representing and rendering documents written in LaTeX and MathML. Several issues were addressed during the design of the DOM implementation for techexplorer, including: providing a set of application-independent DOM implementation classes for C++, using delegation instead of multiple inheritance, handling application-specific treatment of the representation of text through an application-independent DOM interface (Text), exposing application-specific representations of attributes through a uniform Attribute interface, and the use of namespaces for handling collisions between separate element vocabularies. Solutions to these issues point to implications and design requirements for the use of the DOM in other domain-specific areas.


Session #2: News and Information


11:00 am
XML Standards for News: The View at Halftime
Deren Hansen, Director, News Technologies & XML Evangelist, WAVO Corporation
dhansen@wavo.com

Biography:
Deren Hansen, WAVO's designated XML Evangelist, represents his company in various XML industry and standards activities. He serves as editor for the IDEAlliance's PRISM (Publisher Requirements for Industry Standard Metadata) initiative, and sits on the International Press Telecommunication Council's IPTC-2000 working group. An expert in the electronic news and information industry, Mr. Hansen created WAVO's content acquisition system and was one of the architects of WAVO's NewsPak service. Mr. Hansen holds degrees from the Massachusetts Institute of Technology and the University of Delaware.
Abstract:
Continuous news feeds are a priority for many web sites. If someone wanting to add a news feed were to search for XML news standards they would find a bewildering collection of acronyms - NML, NITF, XMLNews, PRISM, IPTC-2000, NewsML - whose associated specifications and proponents share suspicious similarities and tantalizing differences. Deren Hansen, whose responsibilities at WAVO Corp. have thrust him squarely in the middle of such things, explains the origins of, the relationships between, and likely future of the various XML standards for news.

11:30 am
An XML-Centric Architecture for Immersive Sports Coverage
Jeffrey E. Sussna, Chief Architect, Quokka Sports, Inc.

jeff.sussna@quokka.com

Biography:
Jeffrey E. Sussna has 15 years of experience designing and building advanced software products. He has led the development of cutting edge interactive television and Web applications and end-to-end content management systems for major media companies including Time Warner, News Corp., and the Chicago Tribune. Jeff has held technical leadership positions at Silicon Valley companies such as Apple and Oracle, and Multimedia Gulch companies such as Ikonic Interactive and Quokka Sports. Jeff is currently Chief Architect at Quokka Sports, Inc., where he is responsible for defining distributed systems and protocols for live production of online sports coverage.
Abstract:
Quokka Sports produces immersive coverage of world-class sporting events for Web, wireless, and broadband environments. This coverage blends web content, video, and multiple live data feeds, letting users control their own perspective on an event. Immersion production relies on a high-volume, low-latency production process that integrates multiple data types, data formats, tools, organizations, and delivery platforms. Quokka is using XML as the basis for a data-centric backbone for its end-to-end immersion production environment, the Quokka Sports Platform (QSP).
QSP uses XML for several fundamental types of production data:
* Metadata about individual assets (represented using RDF)
* Live data streams for such things as timing, position, telemetry, and biometrics
* Concept-based organizational and search structures (also represented using RDF)
* Platform-independent content publishing
* Content packaging for distribution channels
* Dynamic content serving
* Client-side navigation
XML-based production data enable interfaces between people organizations, tools, and components throughout the QSP lifecycle. These interfaces include:
* Event venues and the Quokka production studio
* Third-party content creators and the Quokka production studio
* Distributed Quokka production facilities
* The Quokka production studio and content distribution partners
* Dynamic and static content serving systems
* Event producers and end-users (via client-side content navigation)

This presentation describes XML's central role in the design of a loosely coupled distributed system that integrates multiple content types, tools, and organizations. It begins by introducing QSP and describing the various XML-based formats used by QSP. It explains how these formats help QSP meet its goals of openness, flexibility, and adaptability. Finally, the presentation discusses XML adoption issues, including design decisions and tradeoffs, internal knowledge transfer, and cross-organizational perceptual issues.

Session #3: Technologies

2:00 pm
Introduction to XUL, the XML-Based User interface Language: An XML Application for Defining the User Interface of Software Applications
Eric Krock, Senior Product Manager, Netscape Communications

ekrock@netscape.com

Biography:
Eric Krock is Senior Product Manager for Netscape Communicator at Netscape Communications. He is responsible for component technologies including the Netscape Gecko layout engine and its support for W3C standards, JavaScript, the Open JVM Interface, and the Mozilla Plug-in API. Previously he worked as a Technology Evangelist responsible for JavaScript, Dynamic HTML, and Commerce Applications.
Abstract:
Topics covered in this session:
* history of cross-platform GUI development: Windows, Macintosh, Motif, and cross-platform solutions
* maturation of web standards over 1994-2000
* power offered by web standards today
* HTML, XML, and XML Namespaces for representing structured data
* RDF for representing resources
* CSS for formatting
* DOM for positioning and event handling and its CSS interface for style control
* JavaScript as scripting language
* motivation for XUL
* XUL foundation concepts: packages, widgets, box and spring layout
* XUL markup
* XUL examples: toolbar, pulldown menus, bookmarks with RDF, dialogs, windows, trees
* real-world example: UI of Mozilla browser
* modifying the UI by editing XUL files; ChromeZone themes on mozillazine.org
* XUL futures: XUL overlays to enable modularity; XUL dynamically loaded from server
* resources for learning more


2:30 pm
Solaris WBEM: Sun's Implementation of CIM Over HTTP

Ed Mooney, Staff Engineer, XML Technology Center, Sun Microsystems, Inc.
Ed.Mooney@Sun.com

Biography:
Employed by Sun since 1993, most recently on the engineering staff in the XML Technology Center. Past assignments have all been in the realm of electronic publishing, including many dues paid converting Frame to SGML. Chair of the Interoperability Working Group within the Distributed Management Task Force.
Abstract:
Sun Microsystems will deliver support for Web Based Enterprise Management with Solaris(TM) 8 (Q1 2000)[1]. This technology enables Solaris 8 applications to manage other WBEM-enabled platforms, and other WBEM-enabled applications to manage the Solaris Operating Environment. Java and XML lie at the heart of this support.
WBEM is an initiative of the Distributed Management Task Force[2]. Of the major computer vendors among the DMTF's membership, the following sit on its board of directors:
* Cisco
* Compaq Computer Corp.
* Dell Computer Corp.
* Hewlett-Packard Company
* IBM/Tivoli Systems, Inc.
* Intel Corporation
* Microsoft Corporation
* NEC Corporation
* Novell
* Santa Cruz Operation, Inc. (SCO)
* Sun Microsystems, Inc.
* Symantec Corporation
XML forms one foot of the tripod on which WBEM rests. The Common Information Model (CIM) and a mapping of CIM to HTTP form the other two feet.
The Common Information Model (CIM) is an object-oriented, language-neutral data model. It comprises a specification[3] and schema[4].
The CIM specification defines a metaschema. Classes and instances occupy the metaschema along with other abstractions such as qualifiers, properties, methods and associations.
The schema describe the components to be managed. Schema are written in plain text using a language called Managed Object Format (MOF).
The CIM specification recognizes three types of schema. The core schema provide a small set of classes, associations and properties for analyzing and describing managed systems. The common schema provide definitions for technology-independent systems, applications, networks and devices. Extension schemas are technology-dependent extensions to the common model. The DMTF provides the core and common schema. Each vendor provides its own extension schema.
The "Specification for the Representation of CIM in XML"[5] provides a syntax both for declaring managed objects in XML and for XML encodings of operations against a WBEM-enabled device. The "Specification for CIM Operations over HTTP"[6] maps a set of 23 operations onto HTTP.
WBEM services will make the Solaris Operating Environment manageable by tools from enterprise vendors other than Sun. These services include 100% Pure Java(TM) implementations of:
* A manager for CIM objects
* A MOF compiler
* An API for writing object manager clients
The interfaces support two transports: Remote Method Invocation (RMI) and HTTP. The HTTP transport implements the "Specification for CIM Operations over HTTP." It was built using Project X, Sun's Java implementation of DOM and SAX, and the Java Servlet Development Kit, also from Sun.
The HTTP transport exposes no public interfaces. The goal is for any functionality accessible by way of RMI also to be accessible by way of HTTP. However, the HTTP transport awaits key technologies from the DMTF to be as functional as RMI. Among these are models for events and security.
Many of the non-public interfaces of the HTTP transport are devoted to mapping Sun's WBEM client API to the functional profiles specified in Section 2.6 of "Specification for CIM Operations over HTTP." The design favors DOM over SAX. Even so, it doesn't use DOM methods directly. In general, it parses strings into DOM at the latest possible opportunity. It uses Project X's TreeWalker class to parse DOM objects into CIM objects.
We have no hard performance statistics. Subjectively, performance over HTTP is noticeably slower compared to RMI, but acceptably responsive from a web-surfer's perspective.
Though perhaps a toddler today, WBEM technology is growing fast. The DMTF published the standards only after Sun, Microsoft, and Hewlett-Packard had shown that their prototypes could interoperate. It is hard at work on models for events, queries, security and discovery which it also plans to realize in XML.
1. http://www.sun.com/solaris/wbem/
2. http://dmtf.org/
3. http://dmtf.org/spec/cim_spec_v22/
4. http://dmtf.org/spec/cim_schema_v22.html
5. http://dmtf.org/download/spec/xmls/CIM_XML_Mapping20. htm
6. http://dmtf.org/download/spec/xmls/CIM_HTTP_Mapping10. htm


3:00 pm
What XML Schema Designers Need to Know About Measurement Units
Frank Olken, Computer Scientist , Lawrence Berkeley National Laboratory
olken@lbl.gov
John McCarthy, Computer Scientist, Lawrence Berkeley National Laboratory

jlmccarthy@lbl.gov

Biography:
Frank Olken is a database researcher at Lawrence Berkeley National Laboratory. He currently works on metadata and related standards (ISO 11179, XML Schema, XML Query Language) and various informatic issues (data exchange, distributed systems, etc.) related to electric power systems. His other interests include statistical data management, sampling from database, OLAP, genomic databases, workflow management, ....

John McCarthy is a database administrator and researcher at Lawrence Berkeley National Laboratory. He currently also works on metadata standards (ISO 11179, XML Schema Language).
Abstract:
On September 23, 1999, NASA lost the Mars Climate Observer spacecraft (at a cost of $125 Million). "The 'root cause' of the loss of the spacecraft was the failed translation of English units into metric units in a segment of ground-based, navigation-related mission software, as NASA has previously announced," said Arthur Stephenson, Chairman of the Mars Climate Orbiter Mission Failure Investigation Board.
This loss reminds us of the importance of measurement units for data exchange in various fields (navigation, engineering, architecture, medicine, science, and commerce). Misunderstandings of measurement units are not simply expensive - in medical settings and aircraft navigation they can be fatal. As the Failure Investigation Board noted, NASA contractors failed to fully specify the interface (i.e., measurement units) between the two software packages. While NASA contemplates various management review mechanisms to prevent recurrence of such problems, we believe the appropriate answer is to effectively extend the type systems used to specify data interchanges (e.g., XML Schemas for XML documents) to include measurement unit information. Hence, it would be possible to automatically detect (and often to automatically resolve via conversion) inconsistent measurement units, much as we now routinely detect the need to convert among integer, single precision floating point and double precision floating point numbers in computer programs.
We review the classical semantic theory of physical measurement units known as dimensional analysis. This theory was developed over the past 100 years by physicists, chemists and mathematicians. Each kind of measurement has a "dimensionality", e.g., mass, length, time, length/time (i.e., speed), etc. The dimensionality may be specified as the ratio of two monomials in which the variables (indeterminants) represent the basis dimensions e.g. in the SI system of units, mass, length, time, current, .... Thus dimensionality for speed would be (Mass**1/Length**1). The dimensionality for energy is (Mass**1 * Length**2 / Time**2 ). Note that dimensionality is a partial specification of the semantics of a measurement. Also note that given a specified list of basis dimensions, we can specify the dimensionality of a measurement as 2 exponent vectors, where we have restricted the exponents to be non-negative.
In contrast measurement units are representational aspect of a measurement. Typically they are specified as a scale factor with respect to a set of basis units (one basis unit for each basis dimension). Thus 1 inch is defined as 2.54 * 10**-2 meters.
The reason for using a ratio of two monomials for dimensionality is that this permits us to distinguish "dimensionless" quantities, such as mass ratios (mass/mass), molar ratios (moles/moles), or volume ratios (length**3/length**3). Such a requirement arises with respect to dimensionless (ratiometric) specification of concentrations.
Most common dimensionally consistent unit conversions can be accomplished by dividing by the ratio of the unit scale factors with respect to a common set of basis units. However, dimensionally inconsistent conversions, e.g., mass to volume are quite common in commercial applications. Such conversions are more complex, relying on knowledge of materials properties (such as density) which vary with the condition (e.g., temperature) of the material.
Not all unit conversions are simple multiplications. Temperature conversions are affine transformations, accounting for differing origins of the temperature scales. Also note that "temperature coordinates" and "temperature intervals" are different sorts of measurements and have differing unit conversions. Lamentably, this distinction is not always made explicitly. Similar distinctions must be made between spatial interval lengths and spatial coordinates as well as temporal intervals and coordinates. Temporal coordinates are commonly referred to as dates or datetimestamps.
Having reviewed this generic discussion of dimensionality and measurement unit conversion we proceed to discuss how measurement units (and dimensionality) could be systematically encoded in XML (either in schemas or documents). We observe that usage of conventional abbreviations (miles/hour) would require the development of an application specific parser. Instead we suggest a systematic mapping of conventional units designations into legal XML names (miles_Per_hour). Such names could be used to reference definitions (dimensionality monomials, scale factors, ...) via designated namespaces (e.g., ISOMeasurementUnits). Such an approach, while more verbose than conventional abbreviations, would be simpler to implement and maintain. We provide detailed examples of how this could be done.
We suggest that such measurement unit encodings should be standardized for use in conjunction with the W3C XML Schema Languages specification. We envision that such measurement units specifications would typically be specified in the XML schema for a data exchange document as an extension of the datatype specification. We consider the implications of this with respect to the extensibility and annotation mechanisms proposed for the XML Schema Language.

Finally, we discuss the relation of this work to previous efforts concerning measurement units and dimensionality specification in the standards, database, and programming language communities.


Session #4: Databases


4:00 pm
RDBMS in XML, XML in RDBMS: Architectural Considerations
Dale Hunscher, CEO, South Wind Design, Inc.
dale@swdi.com

Biography:
Mr. Hunscher has worked as a programmer, systems analyst/designer/architect, engineering manager, and executive in the software field since 1983. His company, South Wind Design, Inc., has provided OEM software development and engineering services since 1993.
South Wind originally specialized in ODBC driver and application tool development, and expanded beginning in 1996 into technical support tools and Internet development, especially Web-enabled database applications. For the past eighteen months Mr. Hunscher has played a central role in the development and deployment of an XML transaction processing system that applies 30,000 - 100,000 XML documents per day against a 100+ GB Oracle database.
Abstract:
As XML becomes a key enabling technology for electronic commerce (e-commerce) and electronic data interchange (EDI), interplay with enterprise relational data base management systems (RDBMS) is inevitable. How will this interplay be structured? Will the RDBMS paradigm drive document structure, or will XML document structure be mapped onto RDBMS structures? Or will there be synergy between the two?
This presentation starts with a review of the architecture and terminology of RDBMS entities and relationships and XML document structures. While many similarities exist in the two terminologies, do the terms held in common really mean the same thing? After examining this question in some detail, we identify the key differences between the two conceptual frameworks. With this as our foundation we can begin to look at some possible mappings between RDBMS and XML, and evaluate the fit and utility of each.
XML documents are usually transactional in nature, representing an event that will cause a change in state in the systems on each side of the transaction. For example, an online purchase triggers the simultaneous transfer of goods from seller to buyer and cash equivalent from buyer to seller, mediated by one or more financial institutions that handle the accounts of the buyer and the seller, as well as other aspects of the transaction, such as the financial network over which the transaction is transmitted. The transaction causes changes in the accounting mechanisms on all sides of the transaction. The transaction means different things to each of the involved parties. For one it may denote the movement of value from one asset account to another, and for another, the reduction or increase in both a liability and an asset account.
An XML document transaction set can easily represent the details of these financial conversations for all the transaction processing systems along the way. How are such transaction documents best mapped to RDBMS transactions? One approach is to use XSL to transform a transaction into one or more SQL statements. We'll look in some detail at a simple example of how this can be done, to expose and analyze some of the pitfalls involved.
Finally we'll consider the challenge of representing RDBMS data sets (SQL query result sets) as XML documents. There are a number of design aspects to consider in mapping the data set to a document:
* The SQL result set is tabular in structure. XML documents are inherently hierarchical. What is the best way to map from one to another?
* Because one of the great strengths of RDBMS is query flexibility, SQL result set meta-data is highly dynamic. For best results an XML document should have a fixed structure, whether in the form of a Document Type Definition or an XML-Data schema. What is the best way to manage the flexibility of the SQL result set in producing XML document structures?
* XML allows the document designer two mechanisms for storing instance data: attribute values and element content. Which are most appropriate for representing result set content?
* A result set can contain an arbitrary number of rows. Should each row be represented as an XML document, or should a single document "container" represent the entire result set? Moreover, most RDBMS APIs support block cursors, which provide for the retrieval of multiple-row subsets of the result set. What is the best way to handle block cursors in mapping to XML documents?
Along the way we'll learn about some of the RDBMS vendor approaches to handling interactions with XML documents and take a look at WDDX, a proposal for mapping RDBMS result sets to XML documents.



4:30 pm
Enabling Your Business Data for XML with DB2 XML Extender

Jane Xu, Senior Software Engineer, Member of DataBase Technology Institute, Software Group,
IBM Corporation
jxu@us.ibm.com
Josephine Cheng, Distinguish Engineer Manager of DataBase Technology Institute, Software Group, IBM Corporation

Biography:
Jane Xu is a senior software engineer at IBM Database Technology Institute. She is currently the architect and technical lead of DB2 UDB XML Extender. Jane Xu received her Ph.D in Computer Science from the University of Southern California in 1990 and since then has been worked for IBM. Prior to that, she obtained her B.S. and M.S. degree in Computer Science from University Hawaii, Manoa. She worked on architecture and strategy of distributed file and storage systems while she was with IBM Storage Systems Division. After joining IBM Software Group, she was the technical lead of IBM Digital Library multimedia support and Net.Data products.Jane has published many publications in technical magazines and conferences. She filed certain number of patents for IBM.

Josephine Cheng is an IBM Distinguished Engineer at Santa Teresa Laboratory and a member at IBM Academy of Technology.She is also a manager at the Database Technology Institute responsible for advanced technologies for IBM database products including the product development of DB2 WWW Connection, Net.Data, XML Extender and DB2 Everywhere. Josephine received YWCA's 1996 Tribute to Women and Industry Award. She is a co-editor and co-author of "Web Gateway Tools" published by Wiley Computer Publishing, 4/97. She serves in National Research Council Information Panel for technology assessment since 1998. Josephine received the B.S. degree in Mathematics and Computer Sciences and the M.S. in Computer Sciences from the University of California, Los Angeles.
Abstract:
E-business requires stability, scalability and security. When e-business is adopting XML, DB2 allows you to leverage the power of XML to your existing business data, where XML can be your Web document content or your B2B data interchange format. DB2 XML Extender is a new extender of the DB2 extender family. It is a SDK which provides XML support for DB2. It not only makes DB2 as the repository of XMLdocuments and DTDs, but also enables existing relational data for XML. With XML Extender, you can decompose XML data into relational tables, and generate XML documents from data stored in DB2.
When companies turn to XML for their Web content, a repository for XML documents is needed. XML Extender provides new data types that let you store XML documents in DB2 databases and new functions that assist you in working with these structured documents, including fast indexing on XML elements and attributes. Entire XML documents can be stored in DB2 database as character data up to 2GB each, or stored as external files but still managed by DB2. Retrieve functions allow you to retrieve either the entire XML document or individual elements and attributes. XML Extender provides ways to ensure high performance queries.Ż You can specify elements or attributes in the XML document to be converted into standard SQL data types, and to be indexed for facilitating the high performance search. XML Extender also works the DB2 Text Extender, which has a new section search feature that supports the structured text search for XML.
While electronic data interchange(EDI) has been around for many years, implementing EDI systems to communicate with supplies and trading partners among companies is still expensive. XML-based data interchange formats are opening up B2B communication to all companies with lower cost. Because of the flexibility and openness, XML is becoming the standard to define data interchange formats. DB2 XML Extender allows you to transform XML content to a collection of traditional data types in new or existing DB2 tables. It provides stored procedures that decompose the XML content, convert elements and attributes into SQL data types and stores them in relational tables. Stored procedures are also provided to compose XML documents from your existing business data in DB2.
The mapping between relational tables and XML documents via the Data Access Definition(DAD), an XML formatted specification. In DAD, you can specify how to extract XML elements and attributes from entirely stored XML documents to create indexing. You can specify the way to compose or decompose XML documents from or into relational tables via the explicit or implicit SQL query,Ż based on the XSLT data model. The mapping is directly from relational tables to desired XML structure without further intermediate transformation. This gives XML Extender better performance.
DB2 XML Extender is also the repository of DTDs. With DTDs stored in DB2 XML Extender provides validation feature to validate entire XML documents stored as DB2 column data, as well as to validate input or generated XML documents for decomposition and composition.
The DB2 XML Extender comes with a visual administration tool is provided to easily define the mapping of elements and attributes from and XML document into columns and tables.

In this presentation, we will discuss the following topics:
* How to determine XML data storage and access method.
* How to stored entire XML documents in DB2 and create index on elements and attributes for fast search.
* How to generate XML documents from data in relational tables, and stored un-tagged data from XML documents into traditional SQL data columns.
* How to validate XML documents which XML documents stored in the same column with more than DTDs.



5:00 pm
Building Oracle8i XML Applications with XML, XSLT, XSQL Pages,
and Java

Steve Muench, Lead XML Evangelist, Oracle Corporation
smuench@oracle.com

Biography:
Steve Muench is Consulting Product Manager on the Business Components for Java Development Team, and Oracle's lead technical evangelist for XML. In his ten years at Oracle, he's been involved in the support, development, and evangelism of Oracle's application development tools and database. In addition to being Oracle's primary representative to the XSL Working Group at the W3C, he's been a driving force in helping Oracle development teams from the database server, to application server, to tools, to packaged applications weave XML sensibly into their future development plans. He's busy working on a book on using Oracle8i, XML and XSLT for O'Reilly, due out sometime in the year 2000. Steve has presented at the following industry conferences: XML98 (keynote), XMLEurope99 (Keynote), XML99 ("Relational Databases to XML"), and two XMLOne conferences, in addition to JavaOne 1999 and countless Oracle OpenWorld conferences.
Abstract:
Oracle's XSQL Servlet is a free yet amazingly powerful utility that can be used with any Servlet Engine to more easily build applications that combine the power of SQL, XML, and XSLT without programming.
In the presentation we walk through numerous demonstrations of applications you can build using XSQL Pages by combining your knowledge of SQL with XSLT Transformations to make magic happen.

We'll explore all of XSQL's built-in action elements and step through the details of building your own custom actions to implement virtually any kind of server-side XML processing your particular application might need.

MANAGEMENT TRACK --

TECHNICAL TRACK ( March 1, March 2)


Attend a GCA ConferenceBecome a GCA MemberBuy a GCA Publication
Today's News Digest
What is XML?What is SGML?ICEGCA's Mail.dat
Technical CommitteesTechnical ResourcesTargeted InitiativesGCA's GRACol
What is GCA?GCA Press ReleasesGCA MembersGCA's ICCContact GCA
GCA - Phone: +1 703-519-8160   Click Here For Legal And Technical Information
Click Here For Legal And Technical Information email: info@gca.org