You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for open-sourcing the ChineseWebText2 dataset! I’m very interested in using it for my research. Could you please provide more details about the data source(s) used for this dataset? Additionally, I’d like to know the method or tools used to parse the data into the jsonl format.
The text was updated successfully, but these errors were encountered:
Thank you for open-sourcing the ChineseWebText2 dataset! I’m very interested in using it for my research. Could you please provide more details about the data source(s) used for this dataset? Additionally, I’d like to know the method or tools used to parse the data into the jsonl format.
The text was updated successfully, but these errors were encountered: