I have developed a NER model to detect all address and property price independently in a pdf document which have property address and its prices in natural language. There are lots of variations in how property address and prices are mentioned. It could be in described sentencse or sometimes like and many more
One possibility
address 1 details about address 1 details about address 1 price 1 address 2 details about address 2 details about address 2 price 2 So the model in a document would predict say 5 different address and 5 different property prices.
Questions
- Now how to build model to assign the price to the correct address?
- How to encode this link in the training data and learn that?

