-
Notifications
You must be signed in to change notification settings - Fork 111
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature request: Parse DMs, add user names and handles #6
Comments
I see lots of usernames and handles in both the twitter archive and the markdown produced by this script. Can you give an example of where data is missing? The twitter archive does lose a lot of context - it only contains your tweets and replies, not entire threads. This script doesn't currently attempt to parse the DMs. |
Same here, would love to see handles/usernames in the dm section. |
I'm struggling to understand what's being asked here. If this is a bug report: Please be precise about what the script did and what you were expecting it to do instead. It sounds like you are talking about missing data in Twitter's archive? If so then that's a bug for Twitter I would have thought? If this is a feature request: Please give more details in what you would like the script to do. Currently it doesn't do anything with DMs. |
I can only speak for myself, but from my POV this is a feature request. |
OK. I have changed the title to reflect my understanding. I don't have any immediate plans to address this but maybe someone else would want to take a look. There are many other twitter archive parsers out there that may well already do this. |
Great :) |
I made a tool to turn those archive IDs into name, bio, and real url: https://gist.github.com/n1ckfg/df70c6fa1dabac4fe55cb551364adcc5 |
I made a script to parse user IDs and map them to handles. It is different from the scripts linked above in that it doesn't need login or access to Twitter's API, because it uses the TweeterID web service to look up the handles. It also finds some of the handles in the archive itself (looking in mentions and retweets). Sometimes it also finds display names and links, but it can't look up the bio or profile picture yet. Currently, it just writes the mappings into a JSON file, but you might already want to already use it anyway, in case Twitter goes down even faster than expected... The script is available in the userids branch in my fork of this project: @lenaschimmel and me are working on integrating it into the main parser script and will probably be making a pull request to the main project here later. But integrating it properly might take a few days, so if you're in a hurry, feel free to use my version in the meantime :) |
@flauschzelle Thanks for looking into this. I was just looking at the JSON for this myself:
For my archive this gives me 234 handles and is enough for making a start on parsing DMs, followers/followings. Maybe we should get that basic functionality working and then add the lookup feature afterwards? |
I'm trying to understand what you are currently doing and if / how much it overlaps with what @flauschzelle and I have already done / are about to do... So this is already done now by @flauschzelle:
Currently working on:
Things I/we still plan to do:
Things that seem useful, but that I didn't really look into:
|
@lenaschimmel Yes, there was some overlap. The branch looks good. To avoid calamity let's tackle it in small PRs:
|
I've just made a note about getting full user data from the API (without a key!) on the followers issue: |
Updated roadmap with current progress:
|
Any way to add code to pull deleted DM's from your own personal account? |
The current twitter archive downloaded omits all the user names and handles. It only contains the ids of the accounts that someone interacted with. With that the archive looses context, especially for the DMs and reply's.
The text was updated successfully, but these errors were encountered: