A Form-based approach to natural language query processing

Nabil R. Adam, Aryya Gangopadhyay, James Clifford

Research output: Contribution to journalArticle

6 Scopus citations

Abstract

WE describe a methodology for processing data retrieval and update queries using a form-based natural language interface. For the purpose of illustration, we use computer integrated manufacturing (CIM) as the application domain. The interface consists of a set of fourth-generation interface tools (SQL forms), a set of form definitions, a lexicon, and a parser. The forms are developed from the functional and data models of the system. A form definition consists of a form name, a form object, a set of form fields, and a set of fragment grammars. A form object is a single or composite entity that uniquely identifies a form. Form fields consist of database fields whose values can be entered by users (user-defined), and others whose values can be derivedby the system (system-defined). Fragment grammars are templates that identify the information requested by user queries. The lexicon consists of all words recognized by the system, their grammatical categories, synonyms, and associations (if any) with database objects and forms. The parser scans a natural language query to identify a form in a bottom-up fashion. The information requested by the user query is determined in a top-down manner by matching the fragment grammars associated with a form against the user query. Extragrammatical inputs with limited deviations from the grammar rules are supported. Elliptical queries are supported by deriving me missing information from those specified in previous queries and forms. Combining a natural language processor with SQL forms allows update queries and prevents violation of database integrity constraints, duplication of records, and invalid data entry.

Original languageEnglish (US)
Pages (from-to)109-135
Number of pages27
JournalJournal of Management Information Systems
Volume11
Issue number2
DOIs
StatePublished - Jan 1 1994

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Management Information Systems
  • Computer Science Applications
  • Management Science and Operations Research
  • Information Systems and Management

Keywords

  • 4GL
  • Database management systems
  • Natural language interface
  • Query processing

Cite this