ACM CIKM'98/Workshops/Tutorials Registration Form ================================================= Please complete this form (TYPE or PRINT), and return with your payment to the address given below (Please copy this form for additional attendees). Your paper number, if any:__________________________ Your paper title, if any:_____________________________________________________ ______________________________________________________________________________ Paper authors:___________________________________________________________ Your First Name:_________________________ Last Name:_______________________________ Title (Dr/Mr/Ms/Prof.):_____ Position:________________________________________ Company/Univ.:___________________________________ Dept.:______________________ Address:______________________________________________________________________ City:_______________________________ State:_____ Zip/Postal Code:_____________ Country:______________________ Phone:_________________________________________ Fax:__________________________________E-mail:_________________________________ ============================================================================== Advance Registration RATE ("received" by October 1, 1998 by CIKM98 or by the date set by a program chair of any workshop): CIKM98 Reg. Fee: ACM ____Member($350.00) ____Non-Member($400.00) ___Student($250.00) Tutorial Reg. Fee: ACM (each half day)____Member($200.00) ____Non-Member($300.00) ___Student($150.00) Circle your choice: T1 T2 T4 T5 (see below for tutorial titles) Workshop Reg. Fee: (DOLAP98 - W4) ACM ____Member($250.00) ____Non-Member($300.00) ___Student($200.00) (see below for workshop titles) (each one day) ACM ____Member($200.00) ____Non-Member($250.00) ___Student($150.00) Circle your choice: W3 W5 (see below for workshop titles) (each two days)____Member($350.00) ____Non-Member($400.00) ___Student($250.00) Circle your choice: W1 W2 (see below for workshop titles) ------------------------------------------------------------------------------- LATE registration RATE ("received" AFTER October 1, 1998 by CIKM or by the date set by a program chair of any workshop): CIKM98 Reg. Fee: ACM ____Member($400.00) ____Non-Member($450.00) ___Student($280.00) Tutorial Reg. Fee: ACM (each half day)____Member($250.00) ____Non-Member($350.00) ___Student($180.00) Circle your choice: T1 T2 T4 T5 (see below for tutorial titles) Workshop Reg. Fee: (DOLAP98 - W4) ACM ____Member($300.00) ____Non-Member($350.00) ___Student($250.00) (see below for workshop titles) (each one day) ACM ____Member($250.00) ____Non-Member($300.00) ___Student($180.00) Circle your choice: W3 W5 (see below for workshop titles) (each two days)____Member($400.00) ____Non-Member($450.00) ___Student($280.00) Circle your choice: W1 W2 (see below for workshop titles) ------------------------------------------------------------------------------- Total Reg. Fees (from above) $_______________ Extra Page Charge:(US$150.00 per page; for author) $_______________ Extra Conf. Reception Tickets: ($30/each) $_______________ Extra Conf. Banquet Tickets: ($50/each) $_______________ Extra copy of conf./Wkshop proceedings: ($50/each) $_______________ Extra Workshop Reception Tickets:($30/each) $_______________ Total Fees Enclosed: US$_____________________ ** Make check/money order payable to ACM-CIKM98 ** Check method of payment: ___Check/money order (payable to ACM-CIKM98) ___Company Purchase Order number:_____________________ (we accept P.O only from companies in USA and "ONSITE RATE" (see above) is applied to each P.O) Signature:________________________________________ Date:_______________________ ACM membership number (required for member rate):______________________________ * Please let us know of your plan for the following events (please put check mark): ________________________________________________________________________ | | WILL ATTEND | WILL NOT ATTEND | |________________________________________|_____________|_________________| |Nov. 3 CIKM Reception 6:00-8:00PM | | | |----------------------------------------|-------------|-----------------| |Nov. 4 CIKM Banquet 6:30-9:30PM | | | |----------------------------------------|-------------|-----------------| |Nov. 5 Workshop Reception (Workshop | | | | attendees only) 6:30-8:00PM | | | |________________________________________|_____________|_________________| ------------------------------------------------------------------------------- * TUTORIALS: (for more info please see below/end of this form or visit tutorial home page or contact Prof. I.Y. Song at songiy@dunx1.ocs.drexel.edu, Tutorial Chair; http://www.pages.drexel.edu/faculty/songiy/CIKM98tut.html) All Tutorials are Half-Day on Tuesday, Nov. 3, 1998 (Morning: 8AM-12NOON; Afternoon: 1PM-5PM): T1. Data Mining on Large Databases (Morning) Rajeev Rastogi and Kyuseok Shim T2. Models in Information Retrieval (Morning) Fredric C. Gey T4. Data Warehousing Design Techniques for ROLAP (Afternoon) Il-Yeol Song T5. Metadata Repositories: Enabling Information Asset Management (Afternoon) Sandra Heiler and Gail Mitchell ------------------------------------------------------------------------------- * WORKSHOPS: (One "Symposium" and four "workshops" are planned. For more info, please visit CIKM home page below or contact Prof. Kia Makki at kia@usl.edu, Workshop Coordinator) W1: GIS'98 Symposium (TWO Days: Nov. 6-7, 1998) - Advances in Geographic --------- Information Systems (Program Chair: Robert Laurini; laurini@if.insa-lyon.fr) W2: NPIVM'98 (TWO Days: Nov. 6-7, 1998) - New Paradigms in Information Visualization and Manipulation (Program Co-Chair: Charles Nicholas; Nicholas@cs.umbc.edu) W3: WIDM'98 (ONE Day: Nov. 6, 1998) - Web Information and Data Management (Program Chair: Fereidoon Sadri; sadri@uncg.edu) W4: DOLAP'98 (ONE Day: Nov. 7, 1998) - ACM First International Workshop on Data Warehousing and OLAP (Program Co-Chair: Il-Yeol Song; song@drexel.edu) (http://www.pages.drexel.edu/faculty/songiy/dolap.html) -------------------------------------------------------------------------------- NOTES for authors: At least one author PER ACCEPTED PAPER must advance register by August 17, 1998 for CIKM98 papers (or by the deadline set by the workshop program chairs for workshop papers), or the paper will be excluded from the proceedings. The proceedings will be published by ACM. The maximum length is 8 pages per CIKM98 paper with a registration fee. Authors can buy at most 4 extra pages at a cost of U.S. $150.00 per page (so total length limit is 12 pages if you buy extra pages). The maximum length is 4 pages (GIS98 is 6 pages) per WORKSHOP paper with a registration fee. Authors can buy at most 2 extra pages at a cost of U.S. $150.00 per page (so total length limit is 6 pages (8 pages for GIS98)if you buy extra pages). The Primary Author is responsible/liable for full registration fees for the accepted paper and at least one author per paper (if not the primary author) must register by above deadline (this fee will not be refunded at any case) and must come to give presentation about the paper at the CIKM98 (or Workshops as appropriate). ---------------------------------------------------------------------------------- * Send this registration form with payment (in US Dollars ONLY and make checks or money order payable to "ACM-CIKM98") to: CIKM/Workshops Treasurer C/O Dr. EK Park Computer Science Telecommunications University of Missouri - Kansas City 4747 Troost Ave., Room 207 Kansas City, MO 64110 USA (816)235-1497; (816)235-5159(fax); ekpark@cstp.umkc.edu * The conference registration fee covers a copy of proceedings, conference reception, refreshments during the conference, and the dinner banquet. Additional reception and banquet tickets can be purchased. Each tutorial and workshop registration fees are indicated above as well. Each Workshop reg. fee covers a copy of proceedings, workshop reception and refreshments during the workshop. All payments must be in U.S. dollars. All checks or international money orders from banks outside the United States should be cashable at a branch of that bank in the United States or at any U.S. bank. If you send us check or international money order, it should have complete "micro encoding line" at the bottom of it (ask your bank about this). You can also send Traveler's check of American Express or Visa or MasterCard (be sure that you sign each check and make it payable to "ACM CIKM98" AND mail it via EXPRESS Carrier Service company such as Federal Express, DHL, etc for insured delivery). We accept purchase orders from U.S. organizations only and ONSITE RATE (higher rate !!) is applied to each purchase order. You are responsible for paying fees to get the check or money order from your bank AND any fees imposed on us from your bank to issue/deliver your check to CIKM. Student rate attendees must have proper ID. Please ASK your organization if they need TAX ID number for the reg. fee payment. ACM is the sponsoring organization and so ACM's federal tax id will be provided as necessary if it is requested by your organization for accounting purpose. * CIKM98/Workshops/Tutorials will be held at: Holiday Inn Bethesda (renamed to "Crown Plaza" soon) 8120 Wisconsin Avenue Bethesda, Maryland 20814 USA Phone: (301) 652-2000; 301-652-3806(fax) Room Rate: $109.00. All room rates are net per room, per night, single/double/ triple/quad occupancy plus taxes for Nov. 2, 1998 through Nov. 7, 1998. Individuals to make their own reservations by calling 1-800-HOLIDAY (465-4329) and use the group code "CIK" to receive CIKM98 rate. Individuals are on their own for payment of room, tax and any incidental charges. All reservations must be made prior to the Cut-off date of October 3, 1998. After the cut-off date, reservations will be accommodated based on rate(s) and rooms available. All reservations must be guaranteed for late arrival by a valid credit card or an advance deposit of one night's room and tax. Check-in time is 4:00PM. Acknowledgement of receipt of the registration form with payment will be sent out by e-mail only if you provide your e-mail address. Conference registration materials including receipts and proceedings can be picked up only at the registration desk on site (there will be extra handling/shipping /mailing charges of $200.00 if it needs to be mailed to any paid registrants with "no show"). * Refund Policy: Paid registrants who cannot attend, and do not send a substitute, are entitled to a refund of paid fees (less a US$200.00 processing fee) if a request is received in writing on or before September 30, 1998. Registrants are liable for their full fees after that date (i.e., NO Refund will be made !!). All no-show registrations will be billed in full. * If you have any questions contact appropriate person: on registrations: Dr. Park: ekpark@cstp.umkc.edu; on technical programs: Program Chairs; Prof. French at french@cs.virginia.edu or Prof. Gardarin at georges.gardarin@prism.uvsq.fr); on other conference related matters: General Chairs; Dr. Charles Nicholas at nicholas@cs.umbc.edu; or Dr. Niki Pissinou at pissinou@cacs.usl.edu. * Please visit our WWW site for up-to-date CIKM98 program information and related activities: http://www.cs.umbc.edu/cikm/1998 ---------------------------------------------------------------------------------- Detailed Tutorial Information: T1. Data Mining on Large Databases (Morning, Nov. 3, 1998) by Rajeev Rastogi and Kyuseok Shim Bell Laboratories Tutorial Outline Following topics will be discussed during the tutorial. 1. Introduction: Brief overview and discussion on data mining techniques developed for large databases. 2. Association Rules: Presents the association rules, optimized association rules and correlations. 3. Classification: Describes the state of the art classifiers for large databases. These include PUBLIC, Rain-Forest, SLIQ and SPRINT algorithms. Break : (15 minutes) -------------------- 4. Clustering: Illustrates the characteristics of traditional clustering algorithms and present techniques developed for large databases. We cover CLARANS, BIRCH, CURE and ROCK algorithms. 5. Similar Time Sequences: Illustrates the existing techniques developed for similar time sequences. 6. Other Applications and Future Research: Discusses other interesting problems and research issues Biography of Speaker : ---------------------- (1) Rajeev Rastogi Rastogi Rastogi received the B. Tech degree in Computer Science from the Indian Institute of Technology, Bombay in 1988, and the masters and Ph.D. degrees in Computer Science from the University of Texas, Austin, in 1990 and 1993, respectively. He joined Bell Laboratories in Murray Hill, New Jersey, in 1993 and is currently a member of technical staff (MTS) in the Information Sciences Research Center. Rajeev Rastogi is active in the field of databases and has served as a program committee member for several conferences in the area. His writings have appeared in a number of ACM and IEEE publications and other professional conferences and journals. His research interests include database systems, storage systems and knowledge discovery. His most recent research has focused on the areas of high-performance transaction systems, continuous-media storage servers, tertiary storage systems, data mining, and multidatabase transaction management. (2) Kyuseok Shim ---------------- Kyuseok Shim is currently a member of technical staff (MTS) in the Database Systems Research Department of Bell Laboratories. Before that, he worked for Rakesh Agrawal's Quest Data Mining project at IBM Almaden Research Center. He also worked as a summer intern for two summers at Hewlett Packard Laboratories. He received B.S. degree in Electrical Engineering from Seoul National University, and the MS and Ph.D. degrees in Computer Science from University of Maryland, College Park. Kyuseok Shim has been working in the area of databases focusing on data mining, query processing and query optimization, constraint-based database systems and dataware housing. He has published several research papers in prestigious database conferences and journals. He has also served as a program committee member on database and knowledge discovery conferences. T2. MODELS IN INFORMATION RETRIEVAL (Morning, Nov. 3, 1998) by Fredric C. Gey Ph.D University of California-Berkeley COURSE DESCRIPTION: Information retrieval algorithms have emerged as the key to effective search of large collections of unstructured text such as found on the Internet. Vector space algorithms are used by Lycos and AltaVista, while Inktome uses a probabilistic document retrieval algorithms. The three major theoretical models in information retrieval=20 are Boolean/logic, vector space, and probabilistic. This tutorial=20 will explain the unique characteristics and problems of each model and how each model has evolved along different lines. Modern variants of the basic models are explained. The attendees of this tutorial will obtain a basic understanding=20 of the major theoretical models upon which modern text retrieval =20 software is based. The tutorial should provide each participant=20 with a starting point for further self-education. 1/2 hour Background and historical development Luhn and statistical text characteristics Statistical weights and the IDF concept 1 hour Boolean set and logic models Fuzzy logic (RUBRIC/TOPIC) Weighted boolean and P-Norm (INQUERY) Recent logic models 1 hour Vector space and geometric models Basic vector similarity measures Generalized vector space model Latent Semantic Indexing Pivoted normalization similarity 1 hour Probabilistic models Probabilistic indexing and querying 2- Poisson and OKAPI Relevance weights and relevance feedback Inference nets and neural network approaches Regression models 1/2hour Performance measurement and analysis=20 Recall, precision, fallout measures Limitations to performance assessment -- interjudge consistency, completeness Statistical significance tests Materials: 110 Course overheads, and a 40 page survey article "Models in Information Retrieval" will be provided WHO SHOULD ATTEND: This course is designed to provide a fast-paced yet rigorous introduction to the basic models of Information Retrieval for academic and industrial research and development computer scientists whose background lies outside the Information Retrieval area. ABOUT THE INSTRUCTOR: Fredric Gey's research specializes in in probabilistic document retrieval using logistic regression techniques. He is principal investigator of NSF grant IRI 9630765 Probabilistic Retrieval of Full-Text Document Collections Using Logistic Regression. He is Co-principal Investigator for the ARPA research contract "Search Support for Unfamiliar Metadata Vocabularies," July 1997-June 2000. He directs the UC Berkeley entries to the TREC conferences, and is designated as General Chairman for SIGIR99 to be held at the University of California, Berkeley during the summer of 1999. He holds a PhD in Information Science from UC Berkeley. T4. Data Warehousing Design Techniques for ROLAP (Afternoon, Nov. 3, 1998) by Il-Yeol Song Drexel University Level : Beginning to Intermediate. Intended Audience : Professionals who are working or thinking for data warehousing based on relational database systems. Tutorial Abstract : A data warehouse is an integrated data repository containing historical data of a corporate for supporting decision-making processes. Recently, data warehouses became the focus of corporate information management with the most advanced database technology. The basic strategy for accessing individual and aggregate data in a data warehouse using relational databases is known as ROLAP (Relational OLAP). This tutorial presents technology overview for the development of data warehousing. It compares ROLAP and MOLAP (Multidimensional OLAP) then discusses techniques for designing star schema. We will look at the multiple variations of the star schema that exist and the differences in the properties of these different schema. It also discusses the techniques for optimizing the performance of data warehouse systems based on relational database systems. Specifically, the discussion includes storage, parallel processing technology, indexing technology, including bit map indexes, join indexes, multi-table join indexes, indexing strategies, query optimization based on star schema, and partitioning techniques. It concludes with the survey of commercial markets, tools, trends, research issues and challenges. Biography of Speaker : Il-Yeol Song is an associate professor in the College of Information Science and Technology at Drexel University, Philadelphia, PA. He received his M.S. and Ph.D degrees in Computer Science from Louisiana State University in 1984 and 1988, respectively. His current research areas include database modeling and design, data warehousing, object-oriented database systems, and object-oriented analysis and design. He has published over 60 refereed technical articles in various journals, international conferences, and books. In 1992, he received an exemplary teaching award as well as a research scholar award from Drexel University. He has won eight Sigma Xi research awards from the Drexel Sigma Xi scientific research competition. He has worked as a program committee member for over twenty five international conferences and workshops. He was the guest editor for a 1995 special issue of Journal of Computer and Software Engineering entitled "Methodologies and Tools for Intelligent Information Systems." He will be the guest editor for a special issue of Journal of Computer Science and Information Management entitled "Applications and Technologies for Next Generation Database Systems," scheduled for Fall 1998. He is the program co-chair of First ACM Int'l Workshop on Data Warehousing and OLAP (DOLAP that will be held with CIKM98 in November 7, D.C. T5. Metadata Repositories: Enabling Information Asset Management by Sandra Heiler and Gail Mitchell (Afternoon, Nov. 3, 1998) GTE Laboratories, Inc. Metadata repositories have long been used by software engineering tools to store and manage descriptions of system components, and by data administrators to document information stores. More recently, they are being used to support the integration of various tools, databases, and applications, and their use is being expanded to manage metadata for many more kinds of applications, including data warehousing. In this half-day tutorial, we present an industrial perspective on repository technology and its uses in managing an enterprise's information assets. The tutorial starts with a description of repository technology. It examines requirements for managing metadata and describes how these are met by the technology. In particular, we discuss repository architectures, integration mechanisms, repository metamodels, and associated tools for populating, accessing, maintaining, and administering the repository. We identify various implementation strategies for repositories, and look at the state-of-the-art in repository products. The second part of the tutorial examines the use of repositories. We begin with a discussion of issues in populating a repository and in implementing applications using repositories. We then describe a number of applications of repository technology, including software lifecycle support, production planning and management, and decision support systems and data warehousing. Finally, we look at how the repositories supporting these applications combine to provide for enterprise-wide information asset management, and we identify research issues in moving to this broader use. Repository Technology: Metadata and Metadata Management: Motivation for the use of metadata to document business models (data, processes, systems, relationships) and the management requirements for this type of documentation. What is a Repository?: Description of the basic functionality expected of a metadata repository. Integration and Relationships: Three perspectives of integration: 1) data integration through shared models, 2) tightness of integration of repository system with other tools and applications, 3) breadth of applications/tools that are integrated by a repository. Background and Requirements: Database data dictionaries, CASE tool repositories, data warehouses, etc. all manage metadata for their particular application. General look at the requirements for managing metadata (configuration control, views, versioning, long-running transactions,...). Repository Architecture: Components of a repository system. How these components work together, and work with outside applications and tools. Implementation Issues: Some of the issues addressed: in what ways are repository databases implemented...commercial DBMSs? specialized data stores? What makes a repository portable? Should there be one centralized database, or a federation of repositories? etc. Implementing Applications: How does data get into the repository? How is it accessed? What are some of the administrative functions that need to be done to maintain a repository? etc. State of the art: Product overview: What is available in the products? What don't the products have today? What directions are vendors taking? Repository Applications Data Integration: How heterogeneous database systems use repositories to integrate data. DDL in the repository. Schema evolution. Capturing semantics in metadata. Data Analysis, Decision Support and Data Warehouses: Information captured in the repository for warehouse data derivation, analysis, maintenance, and interpretation. Software Lifecycle: Use of repository and CASE tools in design, development, testing, tuning, reuse, maintenance. Production planning and management: Simulation models and production statistics/logs managed in repository. Legacy System Migration: Collect legacy system documentation (code and data) in a repository and store relationships to new systems for determining retirement (partial or complete) eligibility. Maintaining Enterprise Knowledge: Expanding the application of repositories to track relationships across heterogeneous repositories; integrating metadata and metadata managers. Summary and Conclusions: Standards and Ontologies Research Issues Sandra Heiler is the Principal Investigator of the Data and Database Research project at GTE Laboratories, where her research focuses on the use of metadata repositories to support enterprise-wide management of information and software components. In particular, her work is directed to the use of metadata and repository technology to integrate distributed, heterogeneous systems and databases, and to support data warehousing. She is also involved in the application of this technology to legacy system migration and data archiving in a large SAP rollout. Ms. Heiler's earlier work at GTE Laboratories was with the Distributed Object Management Department, where she did research on object model integration and interoperability frameworks, and on object views and identifiers. She joined GTE from CCA and Xerox Advanced Information Technology, where she developed object models and object management systems for VLSI and software engineering environments, as well as transaction models to support cooperative work in those environments. Ms. Heiler has more than 35 years of experience in database research and development and in applications of database and metadata technology. Her previous work includes developing data management systems for statistical databases and Decision Support Systems, and for managing other specialized data types, including engineering, statistical, and bibliographic data. She has authored papers and presented tutorials on object models and object views, semantic interoperability, integration frameworks, and object-oriented systems. Her current research interests include the use of metadata to capture data semantics and to support data warehousing. Gail Mitchell is a Principal Member of the Technical Staff at GTE Laboratories where she works on problems in integrating enterprise information. Current research interests include querying in heterogeneous systems (including over the web), data warehousing, and data integration. Recent activities focus on metadata repository technology to support legacy system migration and information integration. Dr. Mitchell received her PhD from Brown University for her research on extensible query optimization for object-oriented database systems. She has authored a number of papers in the areas of object query languages, extensible query processing, and distributed object management, and has taught courses, workshops and tutorials on these topics at such venues as OOPSLA, MIT, OGI, DEC, and NATO ASI. She is active on program committees, more recently SIGMOD, VLDB and CIKM, and was editor of a TAPOS issue on Distributed Object Systems and co-editor of the book Persistent Object Bases. -----------------------------------------------------------------------------