A data-driven learning experiment in the legal English classroom using the FLAX platform


  • María José Marín Pérez Universidad de Murcia
  • María Ángeles Orts Llopis Universidad de Murcia
  • Alannah Fitzgerald Concordia University, Montreal




legal English, data-driven learning (DDL), corpus linguistics, learner corpora, open access


This research presents a data-driven experiment in the legal English field where the FLAX, an open-source self-learning online platform, is assessed as regards its efficacy in aiding a group of legal English non-native undergraduates (divided into an experimental and a control group) to use legal terminology more consistently, amongst other language items. The experimental group were instructed to only resort to the FLAX and to exploit all the functionalities offered by it. Conversely, the control group could access any information source at hand except for the learning platform for the completion of the same task.  Two learner corpora were gathered and analysed on a lexical and pragmatic level for the evaluation of term usage and distribution, lexical diversity, lexical fundamentality and the use of discourse markers. The results display a tendency on the part of the experimental group towards a more consistent usage of legal terminology, which also appears to be better distributed than the terms in the non-FLAX corpus. In contrast and on average, the lexicon in the FLAX-based corpus tends to be slightly more basic. Concerning the use of MD markers, the experimental group appears to use, though marginally, a greater number of evidentials, endophoric and interactional markers.


Download data is not yet available.


Ädel, Annelie. 2010. “Using corpora to teach academic writing”. In Campoy Cubillo, M.C., B. Bellés Fortuño and M.L. Gea-Valor (eds.) Corpus-based Approaches in English Language Teaching. London: Continuum: 39-55.

Anthony, Lawrence. 2011. AntConc (Version 3.2.2) [computer software]. Japan: University of Waseda.

Aston, Guy. 1997. “Involving learners in developing learning methods: Exploiting text corpora in self-access”. In Benson, P. and P. Voller (eds.) Autonomy and Independence in Language Learning. London: Longman: 204-214.

Baroni, Marco and Stefan evert. 2009. “Statistical methods for corpus exploitation”. In Lüdeling, Anke and Merja Kytö (eds.) Corpus Linguistics: An International Handbook, 2. Berlin: Mouton de Gruyter: 777-802.

Biber, Douglas and Edward Finnegan. 1989. “Styles of stance in English: lexical and grammatical marking of evidentiality and affect”. Text 9 (1): 93-124.

Boulton, Alex. 2010a. “Data-driven learning: Taking the computer out of the equation”. Language Learning 60 (3): 534-572.

Boulton, Alex. 2010b. “Learning outcomes from corpus consultation”. In Moreno Jaén, M., F. Serrano Valverde and M. Calzada Pérez (eds.). Exploring New Paths in Language Pedagogy: Lexis and Corpus-Based Language Teaching. London: Equinox: 129-144.

Boulton, Alex. 2011.“Data Driven Learning: the Perpetual Enigma”. In Roszkowski, S. and B. Lewandowska-Tomaszczyk (eds.). Explorations across Languages and Corpora. Frankfurt: Peter Lang: 563-580.

Boulton, Alex. 2012. “Corpus consultation for ESP. A review of empirical research”. In Boulton, A., S. Carter-Thomas and E. Rowley-Jolivet (eds.). Corpus-Informed Research and Learning in ESP. Issues and Applications. Amsterdam: John Benjamins: 261-292.

Clerehan, Rosemary, Giselle Kett and Renee Gedge. 2003. “Web-based tools and instruction for developing students’ written communication skills”. Proceedings of Exploring Educational Technologies. 16-17 July, Melbourne: Monash University. <http://www.monash.edu.au/groups/flt/eet/full_papers/clerehan.pdf> Accessed May 31, 2017.

Curado Fuentes, Alejandro. 2004. “The use of corpora and IT in evaluating oral task competence for tourism English”. CALICO Journal 22 (1): 5-22.

Dafouz, Emma. 2008. “The pragmatic role of textual and interpersonal metadiscourse markers in the construction and attainment of persuasion: A cross-linguistic study of newspaper discourse”. Journal of Pragmatics 40:95-113.

Fan, May and Xu Xunfeng. 2002. “An evaluation of an online bilingual corpus for the self-learning of legal English”. System 30 (1): 47-63.

Flowerdew, Lynne. 2009. “Applying corpus linguistics to pedagogy: A critical evaluation”. International Journal of Corpus Linguistics 14 (3): 393-417.

Hadley, Gregory. 2002. “Sensing the winds of change: An introduction to data-driven learning”. RELC Journal 33 (2): 99-124.

Hafner, Christoph and Christopher Candlin. 2007. “Corpus tools as an affordance to learning in professional legal education”. Journal of English for Academic Purposes 6 (4): 303-318.

Heatley, Alex and Paul Nation. 1996. Range. Computer software. Wellington, New Zealand: Victoria University of Wellington. Accessed July 4, 2017.

Hempel, Susanne and Liesbeth Degand. 2008. “Sequencers in different text genres: Academic writing, journalese and fiction”. Journal of Pragmatics, 40: 676–693.

Heng, Chang Swee and Helen Tan. 2010. “Extracting and comparing the intricacies of metadiscourse in two written persuasive corpora”. International Journal of Education and Development using Information and Communication Technology (IJEDICT) 6 (3): 124-146.

Hunston, Susan. 2007. Corpora in Applied Linguistics. Cambridge: Cambridge U.P.

Hyland, Ken. 2005. Metadiscourse: Exploring Interaction in Writing. London: Continuum.

Hyland, Ken and Polly Tse. 2004. “Metadiscourse in academic writing: A reappraisal”. Applied Linguistics 25 (2): 156-177.

Ishikawa, Shin’ichiro. 2015. “Lexical Development in L2 English Learners’ Speeches and Writings”. Procedia, Social and Behavioural Sciences 198: 202-210.

Johns, Tim. 1986. “Microconcord: A language-learner’s research tool”. System 14 (2): 151-162.

Johns, Tim. 1991. “Should you be persuaded: Two examples of data-driven learning”. English Language Research Journal 4: 1-16.

Johns, Tim. 1997. “Contexts: The background, development and trialling of a concordance-based CALL program”. In Wichmann, A., S. Figelston, T. McEnery and G. Knowles (eds.). Teaching and Language Corpora. London: Longman: 100-115.

Kit, Chunyu and Xiaoyue liu. 2008. “Measuring mono-word termhood by rank difference via corpus comparison”. Terminology 14 (2): 204-229.

Mao, Luming. 1993. “I conclude not: toward a pragmatic account of metadiscourse”. Rhetoric Review 11 (2): 265-289.

Marín, María José. 2014a. “Evaluation of five single-word term recognition methods on a legal corpus”. Corpora 9 (1): 83-107.

Marín, María José. 2014b. “A Proposal to Exploit Legal Term Repertoires Extracted Automatically from a Legal English Corpus”. Miscelánea: A Journal of English and American Studies 49: 53-72.

Marín, María José and Piedad fernández toledo. 2015. “The Influence of Cognates on the Acquisition of LegalTerminology: Help or Hindrance? A Corpus-based Study”. Procedia-Social and Behavioral Sciences 198: 320-329.

Martin, James R. and Peter White. 2005. The Language of Evaluation. Hampshire: Palgrave MacMillan.

McEnery, Tony and Andrew Wilson. 1996. Corpus Linguistics. Edinburgh: Edinburgh U.P.

McEnery, Tony and Richard Xiao. 2010. “What corpora can offer in language teaching and learning”. In Hinkel, E. (ed.). Handbook of Research in Second Language Teaching and Learning. London: Routledge: 364-380.

Moreno, Ana I. 2004. “Retrospective labelling in premise-conclusion metatext: an English-Spanish contrastive study of research articles on business and economics”. Journal of English for Academic Purposes 3: 321-339.

Mur-Dueñas, Pilar. 2011. “An intercultural analysis of metadiscourse features in research articles written in English and in Spanish”. Journal of Pragmatics 43: 3068-3079.

Sancho-Guinda, Carmen and Ken Hyland. 2012. “Introduction: a context-sensitive approach to stance and voice”. In Hyland, Ken and Carmen Sancho-Guinda (eds.). Stance and Voice in Written Academic Genres. Basingstoke, UK: Palgrave Macmillan: 1-11.

Scott, Mike. 2008a. WordSmith Tools version 5 [computer software]. Liverpool: Lexical Analysis Software.

Scott, Mike. 2008b. WordSmith Tools Help. Stroud: Lexical Analysis Software.

Sinclair, John. 1991. Corpus, Concordance, Collocation. Oxford: Oxford U.P.

Sinclair, John. 2003. Reading Concordances: An Introduction. London: Longman.

Todd, Richard Watson. 2001. “Induction from self-selected concordances and self-correction”. System 29 (1): 91-102.

Widdowson, Henry G. 2000. “The limitations of linguistics applied”. Applied Linguistics 21 (1): 3-25.




How to Cite

Marín Pérez, M. J., Orts Llopis, M. Ángeles, & Fitzgerald, A. (2017). A data-driven learning experiment in the legal English classroom using the FLAX platform. Miscelánea: A Journal of English and American Studies, 55, 37–64. https://doi.org/10.26754/ojs_misc/mj.20176814



ARTICLES: Language and linguistics