-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KeyError: 'transcript_id' with Ensemble human annotation #2
Comments
The problem is that we assume the @abreschi what do you suggest? |
Hi! Sorry about this issue. Unfortunately the transcript_id is a required field in the GTF format (https://genome.ucsc.edu/FAQ/FAQformat.html#format4), even in gene rows. So, I would modify the Ensembl file like @emi80 said. Hope it helps. |
Here, |
Just wanted to add that gencode gtf runs into same issue. |
I'm sorry it didn't fix the issue for me, I have a new error :
I am using the Mus_musculus.GRCm38.83 annotation. If you have a solution, I would appreciate it. Thanks, |
Dear Lea @bellenger-l, You could try using GENCODE annotation files. The release corresponding to mouse ensembl 83 is GENCODE M8. Alternatively, could you provide some lines of your GTF to check what is the problem? As stated in previous comments, make sure that the file follows the proper format. Specially, the |
Dear @dgarrimar, Thanks a lot ! It works like a charm with the Gencode annotation, but it doesn't print the different transcripts under sashimi plots. I can't figure it out which option can do that. Best, |
In principle it should, could you send the command that you are using and the output that you generated? Thanks! |
I'm sorry, I didn't check gencode GTF and the chromosome names were different from Ensembl GTF ("chr1" against "1"), I remove "chr" from first column and now I have the transcripts... Thanks a lot for your help anyway, |
just use transcript_id = d.get("transcript_id","transcript_id_missing") to replace the original code 284th line. |
@kylinson this hack didn't work for me. The following error occured:
I'll edit my gtf as suggested earlier. |
@tangchao7498
|
Dear @archana433, have you tried to use gencode anotation? I believe this is the equivalent to the one you use. Give it a try and let me know! |
thanks , it worked. now got this error
|
Great, as the annotation issue is solved, let's continue the discussion regarding this error in #33. |
Hi, I faced the same problem. I'm trying to run the python script, but instead of changing the GTF file, I added a couple of code lines to ignore the absence of "transcript_id" and, also concatenate gene names with a space:
You might consider adding these couple of lines (from lines: 283-297) to your python script. I know that is not very pythonic. António |
I've fixed this issue with gencodeID, but still works as originally intended with a try statement. This is easier than editing a GTF file. replace: with try statement below.
|
Thx @KrotosBenjamin is by far much more pythonic and elegant. António |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions. |
Following suggestions in PR #52 by @ygidtu, with minor modifications, GTFs with gene rows without the |
From @sridhar0605 originally posted in #1:
When using Homo_sapiens.GRCh37.75.gtf as reference from ensembl, I see this error
Using default tag: latest
latest: Pulling from guigolab/ggsashimi
915665fee719: Pull complete
1a0814f59c8e: Pull complete
b3b71680ed5d: Pull complete
1c3c8afa6ada: Pull complete
2fbeb903a5b4: Pull complete
Digest: sha256:82590f821978568e948ad4861ce009fcb26e7543263bea9d7b78c17667f8d675
Status: Downloaded newer image for guigolab/ggsashimi:latest
Traceback (most recent call last):
File "/sashimi-plot.py", line 592, in
transcripts, exons = read_gtf(args.gtf, args.coordinates)
File "/sashimi-plot.py", line 278, in read_gtf
transcript_id = d["transcript_id"]
KeyError: 'transcript_id'
few lines from gtf:
The text was updated successfully, but these errors were encountered: