I would recommend instead of scraping Jisho.org for this just go to the source dictionary (JMdict) and take a look at the entries that are tagged as either news1, ichi1, spec1, spec2 or gai1 in the ke_pri/re_pri field. Or you can directly download the wordfreq file that was used for the tags news1 and gai1.
You can find all these files and many more here: http://ftp.monash.edu/pub/nihongo/00INDEX.html#oth_fil
ありがとう!
the link doesn't seem to work anymore, please help?
For anyone in the future looking for JMdict files, go to your search engine of choice, search for "JMdict" and you're highly likely to be able to find the page you're looking for. If you cannot find them, look on Wikipedia what happened: https://en.wikipedia.org/wiki/JMdict If Wikipedia doesn't exist anymore, well... greetings from the past. Or just click on the "JMdict" link in the footer.
A way to get a text file of common words?
Hi, I'm trying to get a text file of the words in the dictionary classed as common in a simple form. I know this may have been said before, but a post that seemed promising did not have a working link for the file. If someone can redirect me to a suitable page or can tell me how to use the API in a way to get definitions a bit like this:
'の
indicates possessive
wild
indicates a confident conclusion
た
multi-
rice field'
that would be great. I'm a beginner in Japanese and I think a list of the words rated as #common (no matter how many) would help me in the future
Thank you