Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Better handling of mRNAs with multiple CDSs #165

Open
standage opened this issue Feb 2, 2016 · 0 comments
Open

Better handling of mRNAs with multiple CDSs #165

standage opened this issue Feb 2, 2016 · 0 comments

Comments

@standage
Copy link
Member

standage commented Feb 2, 2016

Technically, an mRNA feature with multiple coding sequences is not invalid. In fact, the canonical gene from the GFF3 spec includes such a case (alternative start codons). In practice, different CDSs often have their own dedicated mRNA features, even if those mRNAs have identical structure.

Recently I've come across a couple of cases in NCBI GFF3 files where mRNAs have multiple CDSs, but in each case it appeared to be an erroneous duplicated annotation. Diagnostic messages from AEGeAn weren't that helpful: basically stating that the following assertion had failed in agn_typecheck_feature_combined_length.

agn_assert(strcmp(id, fid) == 0);

This particular issue should probably be handled somewhere upstream, so that mRNAs with multiple CDSs are flagged before this function is called on them. Whether to issue a warning and proceed or to halt at once needs to be considered.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant