MATLAB: Gene contains only ATGC

atgcgenesnucleotides

I have a list of genes (strings of letters) and I want delete the ones that has letters other than ATGC (N for example). how can I do it?
Thank you!

Best Answer

  • You just want to remove the invalid letters? The code below will do that. If you want to check if a gene string is invalid you can check that with any(~L).
    gene='ATTCAGN';
    L=ismember(gene,'ATGC');
    gene=gene(L);