Create tests for Vakya Analyzer #155

kmadathil · 2021-02-03T01:00:57Z

Locate the DCS10K and DCS4K datasets mentioned in this paper. Also, look at the larger dataset mentioned in this later paper.

From these, create a set of testcases for the vakya analyzer.

Also, figure out their actual definitions for Precision, Recall and F-Score

kmadathil · 2021-02-03T01:13:47Z

An even better option may be the smaller 1300 sentence testset found in this even later paper. . Advantage, it is available on github

The paper author has provided another publication with a better description of this work

avinashvarna · 2021-02-05T16:09:57Z

Thanks for the investigation. I've been a bit busy but will try do some catching up this weekend, by reading up on the papers.

The KISS paper has some superficial similarities. I found the supplementary material helpful in understanding the methodology, but probably need to read it a few more times to completely understand it.

We should probably plan out a proper sequence of next steps (based on priorities). Once we are ready to discuss this step, it may be helpful to have a call.

kmadathil · 2021-02-11T22:09:25Z

I have received the DCS10K and KISS datasets from Amrith Krishna. KISS has been committed into the DB. DCS10K will be added after I figure out how to (too many directories).

I have added basic test infrastructure and added a test_parser.py. I will close this after I get KISS tests working.

gasyoun · 2021-03-09T04:49:37Z

https://zenodo.org/record/803508# is from 2017, so there has been a DCS update after it.

smaller 1300 sentence testset

There is this set of sentences that J. Huet trained on as well.

apte-verified.txt

gasyoun · 2021-03-16T15:20:32Z

https://kmadathil.github.io/sanskrit_parser/ui/index.html?api_url_base=https://sanskrit-parser.appspot.com/ what did I do wrong? Nothing ever returned

drdhaval2785 · 2021-03-16T15:47:33Z

Web service is closed, due to not many users. Readme was also updated recently to explicitly say so, as far as I remember. Not able to locate it now.

avinashvarna · 2021-03-16T16:54:36Z

@drdhaval2785 Actually, we created a different web service on Google App Engine which is always enabled.

@gasyoun Thanks for reporting this issue. Looking at the logs, it does seem to be related to the parsing. I see logs of the form:

ERROR:sanskrit_parser.parser.datastructures:Partition 4: eva went to zero length!

@kmadathil can you please take a look to see if this works from the command line? I can also take a look, but probably in the weekend.

kmadathil · 2021-03-16T17:15:45Z

@gasyoun Please try a different input. This is an error condition that somehow is hanging the API

avinashvarna · 2021-03-16T17:38:08Z

Actually, sorry. The log I was looking at was for a slightly shorter input than what was in the reported issue. It appears that this input is causing the parse to take > 30s (which is the time limit on App Engine), and the process gets killed. GAE instances are not super-high performance, so we may need further optimizations.

gasyoun · 2021-03-16T22:19:54Z

It appears that this input is causing the parse to take > 30s

How many words can I input?

kmadathil · 2021-03-20T03:58:36Z

I've sped this case up using on_the_fly constraint checking (explained in the Sphinx document). This case takes about 8 seconds on my computer

time python scripts/sanskrit_parser vakya "sA tu mahASvetAyA eva muKam avalokitavatI" --input SLP1  --min-cost --max-paths 10
...
real    0m8.508s
user    0m8.256s
sys     0m0.248s

@avinashvarna - thanks for the idea! Please update appspot to v0.2.3

avinashvarna · 2021-03-21T22:09:43Z

I updated, but the online version still times out for this input (runs in a container after all).

gasyoun · 2021-03-23T21:45:38Z

I updated, but the online version still times out for this input (runs in a container after all).

So no way to test the scripts on the web, only locally?

kmadathil · 2021-03-23T21:50:22Z

Please hold on while we update the web service. We are working through some deployment issue with the sped-up code. It should work for you after that.

gasyoun · 2021-03-23T23:53:04Z

It should work for you after that.

Oh, ok, I can wait for a few hours anyway ))

avinashvarna · 2021-03-24T03:01:06Z

So no way to test the scripts on the web, only locally?

If you are comfortable with python notebooks, you can use Binder and modify this notebook for your input to test it out online.

gasyoun · 2021-04-01T11:21:54Z

python notebooks, you can use Binder and modify this notebook

Would ask for a video intro, if possible, please.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create tests for Vakya Analyzer #155

Create tests for Vakya Analyzer #155

kmadathil commented Feb 3, 2021 •

edited

Loading

kmadathil commented Feb 3, 2021 •

edited

Loading

avinashvarna commented Feb 5, 2021

kmadathil commented Feb 11, 2021

gasyoun commented Mar 9, 2021

gasyoun commented Mar 16, 2021

drdhaval2785 commented Mar 16, 2021

avinashvarna commented Mar 16, 2021

kmadathil commented Mar 16, 2021

avinashvarna commented Mar 16, 2021

gasyoun commented Mar 16, 2021

kmadathil commented Mar 20, 2021

avinashvarna commented Mar 21, 2021

gasyoun commented Mar 23, 2021

kmadathil commented Mar 23, 2021

gasyoun commented Mar 23, 2021

avinashvarna commented Mar 24, 2021

gasyoun commented Apr 1, 2021

Create tests for Vakya Analyzer #155

Create tests for Vakya Analyzer #155

Comments

kmadathil commented Feb 3, 2021 • edited Loading

kmadathil commented Feb 3, 2021 • edited Loading

avinashvarna commented Feb 5, 2021

kmadathil commented Feb 11, 2021

gasyoun commented Mar 9, 2021

gasyoun commented Mar 16, 2021

drdhaval2785 commented Mar 16, 2021

avinashvarna commented Mar 16, 2021

kmadathil commented Mar 16, 2021

avinashvarna commented Mar 16, 2021

gasyoun commented Mar 16, 2021

kmadathil commented Mar 20, 2021

avinashvarna commented Mar 21, 2021

gasyoun commented Mar 23, 2021

kmadathil commented Mar 23, 2021

gasyoun commented Mar 23, 2021

avinashvarna commented Mar 24, 2021

gasyoun commented Apr 1, 2021

kmadathil commented Feb 3, 2021 •

edited

Loading

kmadathil commented Feb 3, 2021 •

edited

Loading