Data flow from submission to results

The graph below summarizes the MGnify data flow from submission to analysis results:

_images/submit_graph_08_web032.png

(1) Submissions are handled by the European Nucleotide Archive (ENA) and therefore users must have an ENA Webin account.

In addition, users submitting private data must provide an expressed agreement that MGnify can access their data for analysis, as described under Submit data. Otherwise, we will not be able to access their data. MGnify will, of course, handle this data confidentially.

(2) Access to the ENA submission page requires a login using a registered email address or a Webin identifier (Webin-XXXX).

(3 and 4): upload and submission.

These steps are described in detail in the ENA online guides. The MGnify and EMBL-EBI online tutorials provide a step by step guide to submission. Please also check our FAQs.

Note that all queries concerning data submission should be directed to ENA dedicated help desk

The following ENA submission criteria must be fulfilled. Please note, MGnify will not have access to retrieve data for analyses until these criteria are met.

Assemblies:

The associated sample taxonomy must be in the metagenomic tax tree 408169. Please see the environmental taxonomy guidelines for further details.

Raw metagenomic or metatranscriptomic reads:

Same taxonomy guidelines as assemblies apply AND/OR the library source should be either ‘METAGENOMIC’ or ‘METATRANSCRIPTOMIC’. Please see the library source guidelines for further details.

After validation by ENA, if the above criteria are met we will be able to access the submitted data.

To request private analysis with MGnify, navigate to the home page, click ‘Submit and/or Request’ and complete the form. You will need to login to request analysis of private data. To request analysis of a public dataset in ENA click ‘Request’ and complete the form. Once we have received the requests, they will be queued for analysis (more details about our Analysis pipeline).

The length of time required for analysis varies according to the number of projects in the queue and the nature and number of runs in the submission. However, we aim to have most analyses completed in less than a week once validated by ENA.

(5) Upon completion of analysis, data will be uploaded on the website

MGnify pipeline will generate a number of charts and downloadable files (Files available to download on the MGnify website).

(6) For private data, users will have to login to the MGnify website to access their data, until they become public

(7) Private data will become public after an initial confidential period of two years. Submitters will receive an email from ENA prior to the public release of their data, giving them the opportunity to extend the confidential period which is set to two years as default (as indicated at Can I change the release date of my project?).