Questions tagged [machine-learning]
Machine learning algorithms build a model of the training data. The term "machine learning" is vaguely defined; it includes what is also called statistical learning, reinforcement learning, unsupervised learning, etc. ALWAYS ADD A MORE SPECIFIC TAG.
20,437 questions
0 votes
0 answers
20 views
Modeling recurring monthly transactions with weekend-shift effects: DBSCAN vs rule-based temporal detection?
I have 3 months of categorized bank transaction data and need to identify recurring cash inflows and outflows for lending risk modeling. Complications: 1. Income dates shift earlier when payday falls ...
0 votes
0 answers
22 views
Is the figure showing margin violation for the support vector machine correct?
I am listening to a lecture on soft margin SVM https://youtu.be/XUj5JbQihlU?si=b66SblRnw9mmczVU&t=2969 The lecturer says that the blue dot represents a violation of the margin. I don't really ...
1 vote
0 answers
28 views
Is the strong duality of the hard-margin SVM really trivially satisfied all the time?
It is widely known that if you were to calculate the maximizer of the dual SVM program (denote as $\alpha^*$), then the primal minimizer of the hard-margin SVM program, \begin{aligned}&{\underset {...
0 votes
0 answers
25 views
Time-based regression: is it leakage if training includes snapshots closer to the event than those used at prediction?
I’m building a regression model that predicts the final number of vehicles booked for a ferry trip. Each training row represents the state of bookings for a given trip N days before departure. Example ...
0 votes
0 answers
35 views
Extending the TVD-MI mechanism beyond information-based questions for scalable oversight
TVD-MI (Total Variation Distance–Mutual Information) has been proposed as a mechanism for evaluating the trustworthiness of judges (such as LLMs scoring code correctness or theorem validity) without ...
1 vote
0 answers
22 views
Designing a demand forecasting model with a dynamic daily update and a final horizon prediction — best practices to avoid leakage?
I am working on a demand forecasting problem for ferry vehicle capacity. For each voyage, I have daily snapshots of the cumulative reservations from the opening date until departure day. So each ...
1 vote
0 answers
48 views
What is the best statistical approach to forecast cash flow from run-off debt vintages with a growing balance?
community. I'm facing a modeling problem for cash flow forecasting and would like to know what the most robust mathematical/statistical approach is to solve it. The Problem: Debt Recovery Forecasting ...
1 vote
0 answers
24 views
When testing a specific hypothesis regarding HTE with "best_linear_projection" in a Causal Forest, is it valid to halve the p-value?
I’m using the "grf" package in R and its "best_linear_projection" function, which regresses doubly robust (AIPW) scores on a set of covariates/features. I have a directional ...