Wednesday, November 6, 2013

Continental Breakfast

8:00 a.m. - 8:45 a.m.

Building Collaborative Organizations

8:45 a.m. - 9:45 a.m.

Nicco Mele, Co-Founder, EchoDitto and Faculty, Harvard Kennedy School, & Author, The End of Big

Our ability to connect instantly, constantly, and globally is altering the exercise of power with dramatic speed. Governments, corporations, centers of knowledge, and expertise are eroding before the power of the individual. Based on ideas from his recent book, internet pioneer Mele provides insights and ideas for building collaborative organizations using revolutionary technology and more!

KEYNOTE: A New Search Architecture for the Big Data Era

9:45 a.m. - 10:00 a.m.

Kamran Khan, Managing Director, Accenture

Search engines, distributed processing and content processing pipelines are not new. However enabling technologies of mature search engines, powerful content processing pipelines and cheap distributed processing are coming together to empower a next generation of information access, analysis and presentation much closer to the holy grails of knowledge management. Hear from the founder of Search Technologies how modern search engines are currently being combined with powerful independent content processing pipelines and the distributed processing technologies from big data to form new and exciting enterprise search architecture, delivering results only available to the biggest companies with the deepest pockets in the past.

TRUE TALES OF TAXONOMY USE

10:15 a.m. - 11:00 a.m.

Corporate Folksonomy for Collaborative Teams

Joanne du Hommet, Knowledge Manager, Ubisoft Entertainment PhD student, Paris 8 University, INDEX-Paragraphe laboratory
Beatrice Cacace, Senior Program Manager, Knowledge Management, Ubisoft Entertainment

At Ubisoft, collaboration and sharing are key factors to successfully creating great games. On this journey to success, expertise recognition and knowledge sharing can become major challenges.To facilitate knowledge accessibility and discovery, Ubisoft teams implemented a common referential of keywords, usable on collaborative platforms and internal applications.What is the vision the folksonomy will help to achieve? How does the tag system connect and interact with Ubisoft’s other internal applications? These are described along with how Ubisoft handled this mix of folksonomy and taxonomy, the interactions with other KM tools, and the governance behind the Ubisoft tag system.

Creating a Unified Front: Taxonomy Automatic Indexing

Ashleigh N. Faith, Librarian Data Scientist, EBSCO Information Services

SAE International uses automatic indexing software. Before its project was under way, “use” of the software was loosely applied at best and the taxonomy was in a poor state for automatic indexing. SAE created its own taxonomy, based in engineering mobility and science terminology, from scratch. Developing a cohesive taxonomy that would also facilitate automatic indexing on content reaching more than 136,000 pages (and growing) across eight different content types was a challenge. The nature of scientific content makes automatic indexing difficult. Faith discusses the process that SAE used to establish a taxonomy to capture content and create the bedrock in which the indexing software could be trained, as well as the trials and iterations of training the software and validating the assignments. SAE improved its taxonomy assignment of content to 89% accuracy, well above the typically accepted 75% accuracy rate of automatic indexing, and established a repeatable process that can be used as the taxonomy grows. Learn from concrete examples, lessons learned, and how to duplicate the process with any automatic indexing software.

User Experience Testing for Content Types & Retention Rules for Records & Econtent

Kyle Stannert, Assistant Director - City Clerk's Office, Public Records, City of Bellevue

As the City of Bellevue embarked on implementing new technologies and compliance requirements, it faced a challenge. With a retention schedule made up of more than 6,000 records series, the ability to support emerging business and technology requirements seemed next to impossible. The city’s records management program took on this challenge and refined the agency retention schedule into a format that would work for users and could be implemented in systems including email archiving, instant messaging, and SharePoint 2010/Gimmal Compliance Suite. This talk shares lessons learned in developing retention rules and a content type framework that is as easy to navigate as a visit to Disneyland. Learn how to consider the value of a functional retention schedule in your organization; connect the value of a simplified schedule in implementing email archiving, unified communication and/or ECM technologies; and apply multiple ideas to simplify your retention schedule at your place of work when you get back to the office.

AUTO-CATEGORIZATION: MACHINES VS. HUMANS IN BIG DATA

11:15 a.m. - 11:45 a.m.

Pattern Analysis & Categorization: Big Metadata Toolkits

Joseph Busch, Principal Analyst, Taxonomy Strategies
Vivian Bliss, Independent Consultant, USA

The exploding volume, complexity, and velocity of structured and unstructured data and their interactions presents challenges and opportunities to derive valuable insights. Among the challenges of managing massive datasets are gathering, validating, preserving, analyzing, and maintaining linkages from those analyses to the source dataset. Identifying patterns in datasets using information retrieval methods and writing out the results as metadata are well-established information management processes that should be adopted by organizations working with today’s Big Data sets. This presentation provides an overview of pattern analysis and categorization methods, including keyword and regular expression matching, business rules, pattern categorizers, entity extraction, and trained categorizers that are the key building blocks of analytics toolkits for big metadata applications.

Using Example-Based Auto-Categorization to Tame Big Data

John Felahi, Founder, JGF Strategies LLC

There are at least two schools of thought regarding Big Data. Part of the organization wants to take advantage of it.The other part views it as “Dark Data”—undiscovered, unanalyzed, and unreachable without the proper analysis tools and skills. It’s both. Concept-based auto-categorization,which automatically categorizes documents based on their actual content, not keywords or terms, is the fastest, easiest, and most repeatable way to pinpoint only the most important documents and emails among libraries spanning millions of files and messages. It is an established standard in legal e-discovery and U.S. intelligence, having proved defensible and highly scalable. Learn how companies are beginning to step up to big data analytics using example-based auto-categorization in order to take advantage of all their data, no matter where—or how—it resides. Analytics for Big Data can bring great value to many business applications—social media, market analysis, internal information analysis. It is also being applied toward governance issues and regulatory and legal compliance matters. We’ll also look at empowering the entire information life cycle; how the “infomediaries” gain business advantages; and why traditional search engines have not been able to keep up.

Constructing a Focused Taxonomy

11:45 a.m. - 12:00 p.m.

William Pieser, VP, Chief Marketing Officer, Pingar

This session describes a new method for constructing custom taxonomies from document collections. It includes identifying relevant concepts and entities in text; linking them to knowledge sources like Wikipedia, DBpedia, Freebase, and any supplied taxonomies from related domains; disambiguating conflicting concept mappings; and selecting semantic relations that best group them hierarchically.

ATTENDEE LUNCH & KEYNOTE: File Sync/Share Is Not Endpoint Backup

12:15 p.m. - 12:30 p.m.

Ann Fellman, Director, Product Marketing, Code42 Software

Due to its “social” essence, file sync/share requires a more open, less rigorous data security approach than enterprise endpoint backup, which demands the highest measures of data security. Attempting to marry the two business challenges through a single application results in an unhappy union that jeopardizes the safety and privacy of your organizational data.

INDEXING IMAGES

1:15 p.m. - 2:00 p.m.

Practical Aspects of Natural Language Processing

Daniel Joseph Vasicek, Programmer, Access Innovations, Inc. SIAM
Kathryn Brown, Editor, Access Innovations, Inc.

Recognizing data in medical records includes forming a regular expression and using it to extract a list of tags from the data. You probably need to do several iterations of this process to tune the regular expression(s).The next two steps involve classifying the tags into useful groups (recognizing context) and extracting the data to build your database. Vasicek presents a case study and walks through these steps for a set of medical records containing images, demonstrating the overall process used for extracting tags. The next talk shows how this process could be used to index images from those records.

Image Indexing (UF=Indexing Images)

Bob Kasenchak, Information Architect, Factor, USA

Many corporations, publishers, content providers, and other organizations have large stores of images: photographs, graphs, tables, pictures, diagrams. It is useful to be able to find and retrieve these images on demand without browsing through pages and volumes of documents and files. How can we index an image? Until optical recognition software is far more advanced, indexing an image itself is not practical. We can, however, examine and extract concepts from the text associated with an image using a thesaurus and indexing software to tag the image with metadata. In this way we can implement an easy, accurate image search.What text would be useful for this purpose? And how much text should be captured and indexed without generating too much noise, rendering the search meaningless? Get some answers here!

Taxonomy Meet Art, Art Meet Taxonomy

Dave Clarke, CEO, Squirro

The vast majority of taxonomy development discussion and effort is applied toward accessing textual data. This talk focuses on how taxonomy may be used to enhance access to visual data. It discusses current research into relevant technologies, illustrates examples using stunning high-definition imagery of masterpiece works of art, and concludes with a summary of taxonomy’s untapped potential to come to the aid of information access for visual data.

USER EXPERIENCE (UX) IN TAXONOMY DESIGN

2:15 p.m. - 3:00 p.m.

Benefits of Integrated UX-IM Design

Michael Rudy, VP Business Imagineering, Factor

Customers today face market-driven design challenges that span multiple content formats and sources, multiple application platforms, and multiple user platforms. These challenges often require the services of two types of designers: user experience (UX) designers and the information modelers (IM) of taxonomies and metadata. An integrated UX-IM design team is one approach to accelerating solution implementation. When approaching the overall design with a collaborative team, the project can have a common methodology and shorter design cycles.This presentation examines how collaboration between these teams ultimately benefits the entire design cycle while resonating with executive sponsors.

Ensuring Consistent & Accurate Tagging: Interface Design & Metadata Application

Ben Licciardi, Manager, PwC

As taxonomists, we spend a lot of energy designing vocabularies that are browsable and user-friendly, but we often overlook the system interfaces that taggers use to access and apply taxonomy terms to objects. The irony is, even the best vocabulary is of little value if a poorly designed tagging interface impedes a user from tagging consistently and accurately. This presentation looks at how tagging interfaces impact the metadata application process, exploring examples from content management systems, product information management systems, and crowdsourcing sites from around the web. It discusses how interface design can influence data quality, findability, and the long-term viability of a taxonomy project and concludes with some high-level principles to consider if you’re evaluating, building, or revamping a tagging interface.

Applying User Research to Designing Info Models

Bram Wessel, Principal, Factor Board Member, Information Architecture Institute

User experience design isn’t the only practice that can benefit from user research. Exploring the way users natively organize information in their minds gives taxonomists a means to precisely identify and define information structures, as well as a powerful guiding mechanism for difficult information model design decisions. This talk demonstrates how to construct an effective research plan that blends quantitative and qualitative methods to explore and analyze user mental models and how to transform research insights into viable, flexible, and sustainable information models.

Coffee Break

3:00 p.m. - 3:15 p.m.

SEMANTIC SEARCH

3:15 p.m. - 4:00 p.m.

Utilizing Ontologies for Taxonomy & Content Organization

Anthony Rhem, CEO/Principal Consultant, A. J. Rhem & Associates Author, Essential Topics in Artificial Intelligence

This presentation focuses on the design and implementation of ontologies and how they are leveraged to implement taxonomies and provide better content organization. It demonstrates through case studies how this implementation improves unstructured content search and retrieval. Case studies illustrate how this approach to content organization has improved “findability” and reuse of content and knowledge in several organizations. Along with examples of ontology and taxonomy adoption, an underlying view of the card sort results, keyword, and controlled vocabulary building used to meet the business expectations of the KM solution are shared.

Enhancing Searches With Taxonomies & Semantic Tech

Bob DuCharme, Technical Writer, Commonwealth Computer Research, Inc.

We’ve all seen how the major search engines sometimes second-guess—often correctly—what we meant to search for; taxonomies using standards-based semantic technology can help your own applications do this and more.While semantic technologies rarely try to store the complete meanings of words, data about the relationships of words and phrases to other words and phrases (for example, “broader than” or “alter- native label”) can often store enough semantics to automate search enhancement.When this data is stored using the W3C SKOS standard, it can more easily be aggregated, queried, and used by a variety of tools. Because SKOS is based on RDF, the growing amount of publicly available RDF data about terms and term relationships can be an especially big help to drive improved searches with your own systems. This talk looks at how taxonomies based on semantic technology can help with focusing and augmenting searches, correcting terms, disambiguation, and using other vocabulary metadata sources to improve the data driving your search enhancements.

Developing a Semantic Search Application: A Pharma Case Study

Tom Reamy, Chief Knowledge Architect & Founder, KAPS Group

Adding semantics to search can be a daunting task since it involves dealing with language (messy and not really an IT core competency) and relatively new technologies such as text analytics, and, to work well, requires an interdisciplinary team. Given the complexity and uncertainty of developing semantic search, it usually makes sense to start with a small, focused pilot or POC.This is what one pharmaceutical company decided to do. This talk describes an initial pilot carried out by a varied and diverse team using three different search and text analytics products. It discusses the pluses and problems this diversity caused, what worked best, and what didn’t work so well and covers the key issues and approaches that were needed for success.

VISUALIZING ONTOLOGIES AND METADATA

4:15 p.m. - 5:00 p.m.

Visualization Illuminates Data & Convinces Stakeholders

Suzanne Carroll, Product Director, Data Intelligence, XO Group (The Knot)

Most people’s eyes glaze over at the mention of organizing information,metadata, or taxonomies. Show off a graphic illustrating how the organization fits together, and watch how people are automatically drawn in, finding how they fit in the big picture and asking questions. Put it up by your desk, and they’ll stop by and ask you what it is and—bingo!—instant connection and conversation on your terms. Visualizations are eye-catching conversation starters. This session shares several types of visualizations and discusses which ones work for showing off taxonomies and metadata. It showcases available online tools to build your own visualizations.

Ontology Diagrams for Successful Knowledge Capture & Transfer

Brandon Olson, Associate Professor, School of Business and Technology, College of St. Scholastica

Knowledge rarely resides in a single location. It may be embedded within hundreds or thousands of documents throughout the organization and within the employees’ implicit understandings and experiences. While many knowledge management approaches seek to quickly locate individual knowledge artifacts or facilitate collaboration across knowledge holders, these methods are limited by the number of documents or collaborators involved. Greater value may be realized through a more holistic view of the domain area. This presentation describes the use of ontology diagrams as means to capture knowledge from across many sources and to depict the knowledge in a manner that is easily managed and communicated.

Ontology-Driven Search and Information Access: How Abstractions Become Actionable

Seth Earley, CEO, Earley Information Science

Taxonomies have come a very long way through the years, from rudimentary application in navigation and search thesauri to systems driven by complex domain models with intricate linked knowledge and data. Today, ontologies form the underpinning of unified information access and search-based application development. In this final session of the conference, we see how all of these pieces can come together with examples from the areas of finance, healthcare, and media/entertainment.The applications may look very different, but the end result is the same: getting users to the content and information they need in the context of their processes, helping to reveal knowledge structures and relationships, and allowing on-the-fly synthesis of structured data and unstructured content. We end the conference by providing the ammunition needed to make this case at the highest levels in your company.

Enterprise Solutions Showcase GRAND OPENING RECEPTION

5:00 p.m. - 7:00 p.m.