Semi-Automated Identification of Faceted Categories from Large Corpora.

By Academy of Information and Management Sciences Journal

Release Date: 2009-01-01
Genre: Computers

Description

INTRODUCTION This paper describes FFID (Fast Facet Identifier), a system that can be used to compute facets from a corpus of documents. FFID uses a fast simplified clustering algorithm that allows the identification of hundreds of facet clusters from a corpus of hundreds of thousands of sentences in a very short time (seconds). The automatic identification of facets may be a very powerful tool to design better information retrieval systems. The goal of information retrieval is to support people in searching for the information they need. Given an information problem, finding relevant (let alone high quality) documents is difficult. The sheer amount of information available on line makes this a difficult problem. The size of the web is debatable (Markoff, 2005) but it must be by now at least 12,000 million pages. If each one of these web pages were printed on a standard A4 sheet of paper (21-cm wide), and put side to side on a straight line, it would take about 60 earth circumferences to lay them all down. This is a lot of information. People learn about their information problem and about the information resource they are using through interaction with the resource. Human computer interaction is the crucial phenomenon of the information retrieval process. Fast algorithms, hardware for storage and processing, data and knowledge structures are important but useless if we do not understand how humans interact with machines when looking for information. All the techniques we use must first take into account what we are doing this for: the user. Users encounter several problems when they approach an information resource:

More by Academy of Information and Management Sciences Journal

Employee Performance Evaluation Using the Analytic Hierarchy Process (Manuscripts)

Academy of Information and Management Sciences Journal
Heuristics for Scheduling Operations in MRP: Flowshop Case (Material Requirements Planning)

Academy of Information and Management Sciences Journal
The Ebay Factor: The Online Auction Solution to the Riddle of Reverse Logistics (Manuscripts)

Academy of Information and Management Sciences Journal
Six Sigma and Innovation (Manuscripts)

Academy of Information and Management Sciences Journal
E-Commerce Security Standards and Loopholes (Manuscripts)

Academy of Information and Management Sciences Journal
Customer Relationship Management Strategies for the Internet (Company Overview)

Academy of Information and Management Sciences Journal
Toward an Understanding of MIS Survey Research Methodology: Current Practices, Trends, And Implications for Future Research (Manuscripts)

Academy of Information and Management Sciences Journal
Artificial Neural Network Application to Business Performance with Economic Value Added (Manuscripts)

Academy of Information and Management Sciences Journal
Predicting Leadership Success in Agile Environments: An Inquiring Systems Approach (Report)

Academy of Information and Management Sciences Journal
Functional Requirements for Secure Code: The Reference Monitor and Use Case.

Academy of Information and Management Sciences Journal
The Role of Consultants in the Implementation of Enterprise Resource Planning Systems (Report)

Academy of Information and Management Sciences Journal
Integrating Sap R/3 Applications Into a Total Quality Management Course (Manuscripts)

Academy of Information and Management Sciences Journal
Expansion Into the Future: Healthcare and Information Systems Technology.

Academy of Information and Management Sciences Journal
Production & Operations Quality Concepts: Deficient Diffusion Into the Service Sector (Manuscripts)

Academy of Information and Management Sciences Journal
Semi-Automated Identification of Faceted Categories from Large Corpora.

Academy of Information and Management Sciences Journal
Understanding the Lack of Minority Representation in Graduate Programs in Computer Science and Information Technology: A Focus Group Study of Student Perceptions (Report)

Academy of Information and Management Sciences Journal
Students' Personality Type and Choice of Major.

Academy of Information and Management Sciences Journal
Exploratory Research to Apply Leadership Theory to the Implementation of Radio Frequency Identification (Rfid) (Report)

Academy of Information and Management Sciences Journal
A Social Engineering Project in a Computer Security Course.

Academy of Information and Management Sciences Journal
Domain Names and Trademarks--the Unhappy Marriage Continues But the Rules are Clearer.

Academy of Information and Management Sciences Journal
Objects-First vs. Structures-First Approaches to OO Programming Education: An Empirical Study.

Academy of Information and Management Sciences Journal
Interpretation of Shifted Binary Interpretive Framework Coefficients Using a Classical Regression Problem (Manuscripts)

Academy of Information and Management Sciences Journal
Comparisons of Performances Between Online Learners and Offline Learners Across Different Types of Tests (Report)

Academy of Information and Management Sciences Journal
2+2 Tier Banded Frameworks of Interconnectedness: Industry Structure Determinants.

Academy of Information and Management Sciences Journal
Comment Generation with Three Electronic Brainwriting Techniques.

Academy of Information and Management Sciences Journal
The Probability of Winning and the Effect of Home-Field Advantage: The Case of Major League Baseball.

Academy of Information and Management Sciences Journal
Case Use: Mixed Signals from the Marketplace (Manuscripts)

Academy of Information and Management Sciences Journal
Managers' Perceptions of the Role of It in Organizational Change (Public Service Organizations )

Academy of Information and Management Sciences Journal
Online Trading: Problems and Challenges (Manuscripts)

Academy of Information and Management Sciences Journal
GSS Anonymity Effects on Small Group Behavior (Group Support Systems ) (Report)

Academy of Information and Management Sciences Journal
The Case for Measuring Supplier Satisfaction (Manuscripts)

Academy of Information and Management Sciences Journal
The Impact of General and System-Specific Self-Efficacy on Computer Training Learning and Reactions.

Academy of Information and Management Sciences Journal
Quantitative Methods Professors' Perspectives on the Cost of College Textbooks.

Academy of Information and Management Sciences Journal
Impact of Mastery Based Learning Approaches on Student Performance in an Undergraduate Management Science Course.

Academy of Information and Management Sciences Journal
Designing EDA/SQL Middleware Systems to Integrate Web Database and Legacy Database Systems for E-Business in an Electric Utility Company (Manuscripts)

Academy of Information and Management Sciences Journal
Enterprise Resource Planning Software As an Organizing Theme for MBA Curricula.

Academy of Information and Management Sciences Journal
The Routinization of Web-Based Supplier Diversity Initiatives (Manuscripts)

Academy of Information and Management Sciences Journal
The Case for Measuring Supplier Satisfaction (Manuscripts)

Academy of Information and Management Sciences Journal
Instant Messenger Communication in a Multinational Corporation (Survey)

Academy of Information and Management Sciences Journal
Experiments on Design Models Supplied by Their Manufacturer: An Example of Management Science in Practice (Manuscripts)

Academy of Information and Management Sciences Journal
The MIS Academic Area: The State of the Profession (Manuscripts)

Academy of Information and Management Sciences Journal
Testing the Validity of Miles and Snow's Typology.

Academy of Information and Management Sciences Journal
Optimizing Metal Cutting Cost by Integration of Cost of Quality Using Taguchi's Loss Function.

Academy of Information and Management Sciences Journal
Return-To-Scale In Production-Service-Demand System Application to the Airport Congestion's Problem (Manuscripts)

Academy of Information and Management Sciences Journal
A Parsonian Perspective on Change Given the Orthodox Paradigm of Functionalism.

Academy of Information and Management Sciences Journal
Component-Oriented Middleware for E-Business: COM+ and EJB Application Servers.

Academy of Information and Management Sciences Journal
Examining the Differences in Gender Perception in the Use of Speech Recognition As a Tool in Group Support Systems.

Academy of Information and Management Sciences Journal
A Review of the Interrlationship Among Management, Information Technology, And In-House End-User: Empirical Propositions.

Academy of Information and Management Sciences Journal
Internet Pricing: Best Effort Versus Quality of Service.

Academy of Information and Management Sciences Journal
Production & Operations Quality Concepts: Deficient Diffusion Into the Service Sector (Manuscripts)

Academy of Information and Management Sciences Journal