Questions tagged [natural-language]
Natural Language Processing is a set of techniques from linguistics, artificial intelligence, machine learning and statistics that aim at processing and understanding human languages.
1,143 questions
1 vote
0 answers
63 views
In the original InstructGPT paper, why is the loss divided by K choose 2?
In the original InstructGPT paper, the loss of the reward model is as follows: Why do the authors divide by ${K}\choose{2}$? If, for example, we have $7$ prompts and $5$ completions per prompt, the ...
0 votes
0 answers
75 views
Conditional independence assumption for Naive Bayes with Multinomial distribution
I was going through Naive Bayes Classifier (from Cornell Machine Learning course (link here) and I found quite confusing the use of the Naive Bayes classifier for bag-of-words with the Multinomial ...
2 votes
0 answers
41 views
How to conduct A/B testing for AI models properly with limited dataset (NLP)
Situation: I want to compare the performance of two models on the same task. I have a dataset of around 400 manually curated samples. The task is relatively niche (targeted sentiment analysis on ...
1 vote
0 answers
41 views
Parsing maritime location ranges
I'm attempting to train a model to parse maritime location ranges. These are strings that can be resolved into a geographical area or a list of shipping ports. An example could be ...
0 votes
0 answers
55 views
Calculating Precision and Recall in Spell Correction when the input sentence has no errors
I am doing a project on spell correction. While evaluating the model results, I came across this situation: the input sentence has no errors, and the model outputs the input sentence as it is, which ...
2 votes
2 answers
143 views
Why we don't mask other layers besides the multihead attention in transformers?
Typically when training for NLP tasks, we need to pad our sequences to a max_len, so they can be processed efficiently in a batch-wise manner. However, these padded ...
1 vote
0 answers
37 views
Stanza bad performance on Named Entity Identification
I'm using both SpaCy and Stanza to identify named entities in very short string (brand names and business names): ...
2 votes
0 answers
40 views
Unsupervised clustering of short texts with covariates
I posted this on the Data Science Stack Exchange and didn’t get any responses (that sight seems pretty dead). So I’m trying here! I'm working on a project where I have to categorise short texts. I don'...