Hi guys,
I had 22 samples which for four groups which I merged (following DADA2) using the following code:
qiime feature-table group
--i-table dada2-test-last-table.qza
--p-axis 'sample'
--m-metadata-file tissue-metadata.tsv
--m-metadata-column "group"
--p-mode 'mean-ceiling'
--o-grouped-table grouped-table.qza
I am now trying to create the taxa barplots for my groups, but am struggling to format the new metadata file without getting the following (or similar) error:
There was an issue with loading the file tissue-data.tsv as metadata:
Metadata file must be encoded as UTF-8 or ASCII. The following error occurred when decoding the file:
'utf-8' codec can't decode byte 0x8e in position 10: invalid start byte
There may be more errors present in the metadata file. To get a full report, sample/feature metadata files can be validated with Keemei: https://keemei.qiime2.org
Find details on QIIME 2 metadata requirements here: Metadata in QIIME 2 — QIIME 2 2022.2.0 documentation
My initial metadata file is very simple as below:
sample-id | barcode-sequence | group |
---|---|---|
#q2:types | categorical | categorical |
T11 | NAAGGCGA+NGTCTAAT | A |
T12 | NGTACTAG+NGTCTAAT | A |
T13 | NGGCAGAA+NGTCTAAT | A |
T14 | NCCTGAGC+NGTCTAAT | A |
T15 | NGACTCCT+NGTCTAAT | A |
T21 | NAGGCATG+NGTCTAAT | B |
T22 | NTCTCTAC+NGTCTAAT | B |
T23 | NGAGGCTG+NGTCTAAT | B |
T24 | NAGAGGCA+NGTCTAAT | B |
T25 | NTAGAGGA+NGTCTAAT | B |
T28 | NCTCATGA+NGTCTAAT | B |
T29 | NCTCATGA+NGTCTAAT | B |
T31 | NAAGGCGA+NCTCTCCG | C |
T32 | NGTACTAG+NCTCTCCG | C |
T33 | NGGCAGAA+NCTCTCCG | C |
T34 | NCCTGAGC+NCTCTCCG | C |
T36 | NGACTCCT+NCTCTCCG | C |
T41 | NAGGCATG+NCTCTCCG | D |
T44 | NTCTCTAC+NCTCTCCG | D |
T45 | NGAGGCTG+NCTCTCCG | D |
T46 | NAGAGGCA+NCTCTCCG | D |
T47 | NTAGAGGA+NCTCTCCG | D |
And I have been trying the following metadata file for my merged groups, with the sample ID as the original group from the previous metadata tsv, and assigning new group names:
sample-id | group |
---|---|
#q2:types | categorical |
A | control |
B | DMSO |
C | MCC950 |
D | Z-IETD-FMK |
I am just wondering how to rectify this, should I have something in the barcode-sequence file? There doesn't appear to be any barcode identifier in the Feature table summary from my merged groups (below)
Sample ID | Feature Count |
---|---|
A | 126365 |
B | 121736 |
C | 113228 |
D | 98207 |
Any help much appreciated!!
-Cian