Research Commons
      • Browse 
        • Communities & Collections
        • Titles
        • Authors
        • By Issue Date
        • Subjects
        • Types
        • Series
      • Help 
        • About
        • Collection Policy
        • OA Mandate Guidelines
        • Guidelines FAQ
        • Contact Us
      • My Account 
        • Sign In
        • Register
      View Item 
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      •   Research Commons
      • University of Waikato Research
      • Computing and Mathematical Sciences
      • Computing and Mathematical Sciences Papers
      • View Item
      JavaScript is disabled for your browser. Some features of this site may not work without it.

      Utilizing lexical data from a Web-derived corpus to expand productive collocation knowledge

      Wu, Shaoqun; Witten, Ian H.; Franken, Margaret
      Thumbnail
      Files
      Wu Witten Franken 2010.pdf
      437.6Kb
      DOI
       10.1017/S0958344009990218
      Find in your library  
      Citation
      Export citation
      Wu, S., Witten, I.H. & Franken, M. (2010). Utilizing lexical data from a Web-derived corpus to expand productive collocation knowledge. ReCALL (22), 83-102.
      Permanent Research Commons link: https://hdl.handle.net/10289/3531
      Abstract
      Collocations are of great importance for second language learners, and a learner’s knowledge of them plays a key role in producing language fluently (Nation, 2001: 323). In this article we describe and evaluate an innovative system that uses a Web-derived corpus and digital library software to produce a vast concordance and present it in a way that helps students use collocations more effectively in their writing. Instead of live search we use an off-line corpus of short sequences of words, along with their frequencies. They are preprocessed, filtered, and organized into a searchable digital library collection containing 380 million five-word sequences drawn from a vocabulary of 145,000 words. Although the phrases are short, learners can browse more extended contexts because the system automatically locates sample sentences that contain them, either on the Web or in the British National Corpus. Two evaluations were conducted: an expert user tested the system to see if it could generate suitable alternatives for given text fragments, and students used it for a particular exercise. Both suggest that, even within the constraints of a limited study, the system could and did help students improve their writing.
      Date
      2010
      Publisher
      European Association for Computer Assisted Language Learning
      Rights
      Copyright © European Association for Computer Assisted Language Learning 2010
      Collections
      • Computing and Mathematical Sciences Papers [1455]
      • Education Papers [1416]
      Show full item record  

      Usage

      Downloads, last 12 months
      117
       
       
       

      Usage Statistics

      For this itemFor all of Research Commons

      The University of Waikato - Te Whare Wānanga o WaikatoFeedback and RequestsCopyright and Legal Statement