The company exploration licence reports are a subset of the Department for Energy and Mining publications and reports database, filtered by the Reference Code: ENV (Envelope - open file). The original reports are in pdf format. In addition, the the AWS Textract optical character recognition (OCR) tool has been used to create more readily searchable versions of the majority of these reports. This is a basic OCR process that is not capable of producing an exact replication of all the text in every document, but can be useful in producing a purely text based version of the reports. The Textract tool produces two outputs: a basic text version of the pdf with the suffix '.txt'; and a json version of the pdf with the suffix '.json'. The json versions also contain information about the position on the page of each piece of text. The OCR process failed for a small number (<100) of the reports in this folder, and in these instances only the pdf version is available. The mer-env AWS Bucket index_20200227.zip file contains a list of the reports and relating metadata such as report title and abstract in csv file format.
To download files: Click on the file displaying the file type to start the download process.