How can one process natural language (English) to extract named entities (aka NER)? Or at least, entities which can be mapped to a Mathematica Entity. Note that trying to map entities via simple tokenization, POS or the dependency tree is not the same as NER. Also, I am aware that one can use neural networks to train NER but I would hope there is an easier solution within Mathematica.
1 Answer
$\begingroup$ $\endgroup$
As mentioned in the comments, Mathematica v12 has indeed named entity recognition via the TextContents command. Details in the documentation.
WordDataa its head instead ofListon POS queries. Your mileage may vary. $\endgroup$JLinkand connect to, say, the Java Stanford NER library. (And with the results from those doEntitymapping...) $\endgroup$