what is corpus linguistics ?

زائر

The term corpus ,derived from the Latin word for body ,was first encountered in the 6th century to referto a collectio of legal texts.The term corpus has preserved this initial meaning i.e.a body of text;according to one of the five definitions provided by the Oxford Dictionary ,acorpus is 'the body of written or spoken material upon which a linguistic analysis is based'.Franceis(1992) mentions three main areas in which corpora has traditionally been used :lexicographicalstudies in the creation of dictionaries ,dialectological studies and the creation of grammars.Corpus research continued and even strengthened due to technological advances in computer software.Now it is possible to process texts of several million words in length (Sinclair 1991( .

الموضوع : what is corpus linguistics ?

المصدر : منتديات تخاطب ta5atub.com

اضغط هنا لتحميل كتب الأدب العربي والعالمي

اضغط هنا لتحميل كتب فكرية أو ثقافية أو فلسفية

اضغط هنا لتحميل أي كتاب في تخصصات أخرى

اضغط هنا لتحميل كتب د محمد محمد يونس علي

للذهاب إلى الفهرس والمنتديات اضغط هنا

زائر

Corpus ,meaning 'a collection of writings', has a plural corpora, although corpuses is increasingly found. In the domain of language and linguistics it has used to refer to a collection of texts of all kinds, written and spoken , which are read and analysed by a computer program designed to produce statistics and sort the material into accessible forms, usually as a screen concordance of consecutive lines with the word being studied(the key word) in the centre of each line. The best known corpora in current use are the British National Corpus, the Cobuild Corpus (Collins-Birmingham University International Language Database),now called the Bank of English. The survey of English Usage (at University College London),and the Oxford English Corpus on which the present work extensively draws.

Source:
-Oxford Pocket Fowler's Modern English Usage
-Oxford University Press, second Edition 2008

الموضوع : what is corpus linguistics ?

المصدر : منتديات تخاطب ta5atub.com

اضغط هنا لتحميل كتب الأدب العربي والعالمي

اضغط هنا لتحميل كتب فكرية أو ثقافية أو فلسفية

اضغط هنا لتحميل أي كتاب في تخصصات أخرى

اضغط هنا لتحميل كتب د محمد محمد يونس علي

للذهاب إلى الفهرس والمنتديات اضغط هنا

زائر

What is corpus linguistics?
Corpus linguistics is viewed by some as an empirical method of linguistic analysis and description, using real-life examples of language date stored in corpora as the starting point (Crystal, 1992; Jackson, 2007).(p,29) Corpus linguistics is maturing methodologically (McEnery and Wilson, 2001); it is an approach or methodologically for studying language use (Bowker and Pearson, 2002: 9). Other view corpus linguistics as theory, and so much more than methodology (Sinclair, 1994, 1996, 2001, 2004). Halliday (1993:4) asserts that corpus linguistics 're-unites data gathering and theorizing and this is leading to a qualitative change in our understanding of language'. Teubert and Krishnamurthy (2007), suggest that corpus linguistics is a 'bottom-up' approach that looks at 'the full evidence of the corpus', analyses the evidence with the aim of finding probabilities, trends, patterns, co-occurrences of elements, features or groupings of features. Corpus linguistics is regarded as a new philosophical approach to linguistic enquiry (Tognini - Bonelli, 2001: 1).

A corpus is not just any collection of texts; it is a collection of naturally occurring language texts, chosen to characterize a state or variety of a language (Sinclaire, 1991:171). In other words, a corpus is designed and compiled based on corpus design principles. Sinclair (2005a) details a set of core principles and these are listed below:
1. Corpus contents are selected based on their communicative purpose in the community without regard for the language that they contain.
2. The control of subject matter in the corpus is imposed by the use of external, and not internal, criteria.
3. Only components in the corpus that are designed to be independently contrasted are contrasted (i.e., 'orientation').
4. Criteria determining the structure of the corpus are small in number, separate from each other, and efficient at delineating a corpus that in representative.
5. Samples of language for the corpus, whenever possible, consist of entire texts.)
6. Any information about a text, such as part-of-speech tags and the typography and layout of a printed document, should be stored separately from the plain text (i.e., the words and punctuation of the text) and only merged when needed.
7. The design and composition of the corpus are fully documented with full justifications.
8. The corpus design includes, as target notions, representativeness and balance.
9. The corpus aims for consistency in its components while maintaining adequate coverage.

الموضوع : what is corpus linguistics ?

المصدر : منتديات تخاطب ta5atub.com

اضغط هنا لتحميل كتب الأدب العربي والعالمي

اضغط هنا لتحميل كتب فكرية أو ثقافية أو فلسفية

اضغط هنا لتحميل أي كتاب في تخصصات أخرى

اضغط هنا لتحميل كتب د محمد محمد يونس علي

للذهاب إلى الفهرس والمنتديات اضغط هنا