Corpus Brasileiro User license
  • Corpus Brasileiro User License Form

    CEPRIL, LAEL, PUCSP, Fapesp
  • The Corpus Brasileiro (Brazilian Corpus) is a 1 billion word corpus of Brazilian Portuguese compiled in 2008-2010. It is currently available for searching on both SketchEngine and Linguateca. The compiliation was supported by CEPRIL, LAEL, and PUCSP, as well as by grants from Fapesp and CNPq. If you want to have access to the texts of the corpus for your own research projects, you can submit a request to download the corpus. 

    Fill out this form and submit it. If your form is approved, you will receive a download link. The whole process of receiving, verifying and approving or rejecting requests is not automatic. This might take at least a week. We apologize for not being able to respond sooner.

  • Terms of the license:

    (1) The corpus will be used for non-profit purposes only.

    (2) The corpus will be used for academic purposes only.

    (3) The corpus will not be distributed in part or whole by any means.

    (4) The download link to the corpus will not be distributed by the person whose name is identified in this license.

    (5) All research that uses the corpus will make reference to the corpus as follows: Corpus Brasileiro (CEPRIL, LAEL, PUCSP, Fapesp)

  •  -
  • 0/80
  • Information about the research project and team that will use the corpus:

  • 0/80
  • 0/500
  • Reload
  • Should be Empty: