Skip to main content

Questions tagged [artificial-intelligence]

Questions about design and properties of agents that act in a dynamic environment and make decisions towards some goal without user control.

0 votes
0 answers
25 views

I am trying to understand how the Reduce Operation that PyTorch does in its backward pass for broadcasted tensors actually work under the hood. I am trying to make a cpp library for neural networks ...
Rishabh Agarwal's user avatar
0 votes
0 answers
30 views

I'm building a U-Net for predicting density maps. The ground truth maps are generated by labeling centroids in the objects of interest in the original image (they are all of the same class), forming a ...
jojusuar's user avatar
0 votes
0 answers
64 views

Let us say that in the not-so-distant future, people might have to convince an LLM that they are worthy of a job. Is it possible to use prompt injection to convince the LLM that you are worthy of a ...
Neil Meyer's user avatar
-3 votes
1 answer
93 views

I gave this prompt to ChatGPT o3-mini-high: Give me a novel algorithm not discovered yet that reduces the complexity of matrix multiplication. Do you think this algorithm is correct and truly novel? ...
MP123's user avatar
  • 9
0 votes
0 answers
34 views

So I am building a CNN from scratch. I built all the layers with the first layer of convolution->pooling, the second layer of convolution->pooling, flatten, feed into a deep network (built that ...
BIBIDH SUBEDI's user avatar
0 votes
1 answer
112 views

I would like to implement a genetic algorithm to solve the matchmaking problem between offers and demands in a marketplace. I found a research paper which proposes the following encoding: each ...
Fabio Nardelli's user avatar
1 vote
1 answer
87 views

I struggle with the understanding of this algorithm. Is there anybody willing to explain me how the tree is built in RETE and how it helps with concrete inputs?
BanikPyco's user avatar
  • 111
3 votes
2 answers
215 views

Even if it may be complicated, is it possible with the present technology?
Ayush's user avatar
  • 33
1 vote
0 answers
115 views

In the GT (Graph Transformer) model presented by Dwivedi & Bresson in "A Generalization of Transformer Networks to Graphs", the following equations are used to update node and edge ...
td12345's user avatar
  • 111
-1 votes
1 answer
80 views

I have a bunch of transcripts from online videos but I suspect the transcripts aren't formatted well. There are no punctuations, sentences are broken abruptly, some words aren't complete (for example ...
Mr.A's user avatar
  • 1
1 vote
2 answers
256 views

Can we assume that a 7gb llm model knows 7gb of text info? (by 7gb i mean a 7 billion parameter model at Q8 quantization) or a (1.75 billion parameter model at full fp32 precision) For example, a 70,...
Zenytt's user avatar
  • 61
2 votes
2 answers
137 views

I was studying AI and when a question came to my mind. I know that one of the objections to the possibility of a thinking machine examined by Turing is the so called mathematical objection, ...
Amanda Wealth's user avatar
1 vote
1 answer
95 views

In this post I asked about why the sigmoid/softmax function was used in classification: Binary Classification- Non-Differentiable Loss Function But I have a followup question: We're assuming that the ...
Allan Henriques's user avatar
2 votes
3 answers
3k views

A lot of AI hardware coming out lately has its performance mentioned in TOPS i.e trillion operations per second. Does anyone have an Idea how to estimate the llm performance on such hardware in tokens ...
Zenytt's user avatar
  • 61
1 vote
1 answer
72 views

For binary classification using linear regression, we pass the output z of the linear regression through the sigmoid function so that if the linear regression takes an input x which should be ...
Allan Henriques's user avatar

15 30 50 per page
1
2 3 4 5
39