Corpus

Description

mapa mental sobre corpus lingüístico
Jenny Valdez
Mind Map by Jenny Valdez, updated more than 1 year ago
Jenny Valdez
Created by Jenny Valdez almost 9 years ago
25
1

Resource summary

Corpus
  1. Collection of texts
    1. Large
      1. Computer readable
        1. Designed for linguistic analysis
        2. Applications
          1. Depend on
            1. Desing of corpora
              1. Observational methods of analysis
                1. Interpretation of analysis
                2. Translation studies
                  1. Stylistics
                    1. Forensic linguistics
                      1. Cultural representation & key words
                        1. Psycholinguistics
                          1. Theoretical linguistics
                          2. Modern corpora & software
                            1. Principles
                              1. Observer must not influence what is observed
                                1. Repeated events are significant
                                2. Available corpora
                                  1. 1960s --> 1990 (First generation)
                                    1. Small but carefully designed
                                    2. Carefully designed REFERENCE corpora
                                    3. Corpus design
                                      1. Must be balanced
                                        1. Must include consideranble data
                                          1. Running words
                                            1. Size of the audience for the text in corpus
                                            2. Must combine
                                              1. Large general corpora
                                                1. Small corpora for specific knowledge
                                                  1. Opportunistic text collections
                                                2. Some types of corpora (according to process)
                                                  1. Raw
                                                    1. Lemmatized
                                                      1. Annotated
                                                    2. Empirical linguistics
                                                      1. Computer technology is essential
                                                        1. Requires observation
                                                          1. No single method
                                                            1. Uses concordances
                                                              1. Concordance lines
                                                                1. Concordance data
                                                            2. New findings & descriptions
                                                              1. Word frequency
                                                                1. Varies according text-types
                                                                  1. May have differences senses
                                                                    1. Requires interpretation by material designers & teachers
                                                                      1. AWL can be used as a guide
                                                                      2. Phrase frequency
                                                                        1. Determines word frequency
                                                                          1. Phrase-like units
                                                                            1. Basic units of meaning
                                                                          2. Phrases
                                                                            1. Collocations
                                                                              1. 1, 2, or 3 words co-occurring frequently
                                                                              2. Recurrent phrases
                                                                                1. Frequent multi-word strings
                                                                                  1. Identified using computer programs
                                                                                    1. Identify patterns
                                                                              3. Semantic prefence, discourse prosody, and extended lexical units
                                                                                1. Collocation
                                                                                  1. Colligation
                                                                                    1. Semantic preference
                                                                                      1. Discourse prosody
                                                                                        1. Strength & attraction between nodes & collocates
                                                                                          1. Position of nodes & collocates
                                                                                            1. Distribution
                                                                                            2. Grammar, co-text, and text-types
                                                                                              1. Corpus can reveal characteristics
                                                                                                1. Type-token ratio
                                                                                                  1. Lexical density
                                                                                                    1. % of everyday & academic vocabulary
                                                                                                Show full summary Hide full summary

                                                                                                Similar

                                                                                                Actividades del corpus ortográfico
                                                                                                Carolina Rojas Cubero
                                                                                                Palabras del corpus ortográfico
                                                                                                Carolina Rojas Cubero
                                                                                                Corpus ortográfico
                                                                                                Carolina Rojas Cubero
                                                                                                Test sobre corpus ortográfico
                                                                                                Carolina Rojas Cubero
                                                                                                Tema 1, punto 3 - VARIACIÓN Y VARIEDAD EN LAS LENGUAS
                                                                                                Len Sanz
                                                                                                Ciencia
                                                                                                Lhillyan Perez
                                                                                                Reflexiones sobre el innatismo
                                                                                                Valeria Pérez López
                                                                                                Corpus
                                                                                                Luis Wong
                                                                                                Tema 1 punto 3 cortas
                                                                                                Len Sanz
                                                                                                Biology Unit 2 - DNA, meiosis, mitosis, cell cycle
                                                                                                DauntlessAlpha