’utf-8’ codec can’t decode byte 0xca in position 227: invalid continuation byte"
I already searched this forum and came across some similar threads, where people manually looked for non-ASCII characters in their fastq-files… But I don’t know where to look for, it’s like searching a needle in a haystick. And what does “position 227” mean? The 227th character?
thank you for your reply. I have already looked through all the threads and none of those threads is really resolved, except for the one, where @nick-youngblut found the “ƒ”.
I have opened my fastq-files as .txt files and checked in Excel in the given position (227 in my case). In that cell I can only see a “+”. Even deleting this read completely from the fastq-file gives me the same error ( ’utf-8’ codec can’t decode byte 0xca in position 227: invalid continuation byte ") - even though I should have deleted that line.
I also tried re-converting the files to UTF-8 with the Os text-editor, still the same error.
But I cannot seem to find this character in my fastq-file, whether I search for it in word, excel oder text-edit. (Nor can I find it the decimal code 9577 when I look through the fastq via od -c).
Yes, it was my manifest file, which was corrupted. Somehow an “É” had slipped into it - although I could only see an empty space in my Excel .csv file - after deleting that empty space, everything was fine. (the file command helped me see that my text was not ASCII, so I deleted that space and used iconv to convert it into UTF-8 and everything worked fine. I’m just writing this down in case someone else is experiencing the same problem)
Thank you very much for your support @thermokarst!