You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In this case, I put a pdf file named WuJWEBSCI2012-crawling.pdf in the repository, and configures PDFMEF to load from the file system, I got the following error. It looks like that the current code expects file names constructed by numerical characters but this is not the case in general.
Traceback (most recent call last):
File "main.py", line 140, in <module>
ids = wrapper.get_document_ids()
File "/home/jxw394/github/pdfmef/src/extractor/python_wrapper/wrappers.py", line 75, in get_document_ids
ids.append(utils.file_name_to_id(docPath[docPath.rfind('/') + 1 : docPath.rfind('.pdf') + 4]))
File "/home/jxw394/github/pdfmef/src/extractor/python_wrapper/utils.py", line 10, in file_name_to_id
return int(ID)
ValueError: invalid literal for int() with base 10: 'WuJWEBSCI2012-crawling'
The text was updated successfully, but these errors were encountered:
In this case, I put a pdf file named WuJWEBSCI2012-crawling.pdf in the repository, and configures PDFMEF to load from the file system, I got the following error. It looks like that the current code expects file names constructed by numerical characters but this is not the case in general.
The text was updated successfully, but these errors were encountered: