Skip to main content

Questions tagged [cuda]

CUDA is a parallel computing platform and programming model for Nvidia GPUs (Graphics Processing Units). CUDA provides an interface to Nvidia GPUs through a variety of programming languages, libraries, and APIs.

2 votes
1 answer
90 views

I was recently working on my CUDA wrappers library, and this particular class is one of the oldest pieces of code in the entire project. Since that time, I added tons of other features (for example <...
NeKon's user avatar
  • 641
12 votes
1 answer
799 views

When I discovered that CUDA device memory was represented by plain old void* I was horrified by having to deal with C-style type safety and resource ownership (i.e. ...
Toby Speight's user avatar
  • 88.6k
7 votes
1 answer
265 views

I've implemented a resource management class for CUDA interop using RAII to ensure exception safety. The goal is to handle the registration/unregistration and mapping/unmapping, of graphics resources (...
NeKon's user avatar
  • 641
1 vote
0 answers
94 views

This is some kind of follow up to my previous question, this question will be more focused on the actual tessellating pipeline. What I changed from previous question Implemented the async sphere ...
NeKon's user avatar
  • 641
1 vote
0 answers
67 views

I was working on my version of "Universe Sandbox" and first thought comes to your mind is "where the hell are my planets?" so I thought loading models sucks and made this thing, It'...
NeKon's user avatar
  • 641
8 votes
1 answer
291 views

I've implemented a feature in my C++ fractal explorer application to switch between CUDA and NVRTC. The main reason for the NVRTC/Driver API context is to support runtime compilation of custom CUDA ...
NeKon's user avatar
  • 641
15 votes
1 answer
2k views

I'm looking for feedback and suggestions on improving the performance and quality of my CUDA kernel for rendering the Mandelbrot set. I've implemented a "ping-pong" style coloring and ...
NeKon's user avatar
  • 641
3 votes
1 answer
103 views

I'm developing a fractal renderer in CUDA and need advice on tracking the total number of iterations performed during rendering. This is important for real-time dragging and zooming performance. ...
NeKon's user avatar
  • 641
6 votes
0 answers
168 views

I am doing a fractal renderer using CUDA, SFML, C++, recently optimized it to eat less memory, now I am going to optimize the actual fractals, because for some reason, it is the most holding back ...
NeKon's user avatar
  • 641
2 votes
1 answer
85 views

One instance of the following module uses up to almost 75% of my vram. So, I was wondering how I could improve that without slowing down runtime too much. The code is below: ...
Jayson Meribe's user avatar
3 votes
1 answer
129 views

I'm a new student in reinforcement learning. Below is the code that I wrote for deep Q learning: ...
Jahid Chowdhury Choton's user avatar
1 vote
0 answers
252 views

To multiply the matrices A and B using the outer product of vectors, we can express each row of matrix A as a row vector and each column of matrix B as a column vector. Then, we can take the outer ...
user366312's user avatar
2 votes
1 answer
173 views

I need to apply the coint function from the statsmodels library to 207 times series with 1397 points each, two by two. Currently, it takes between 35-40 minutes on my computer with an Intel 24 Cores ...
Begoodpy's user avatar
  • 135
5 votes
3 answers
237 views

Do you have any suggestions for improving the efficiency of the code below? I believe that better optimization can be implemented in the GPU function cuKer_sum, which is located in the ...
Anomalous Physicst's user avatar
5 votes
1 answer
223 views

My first time writing anything significant in CUDA. This kernel takes two arrays representing square matrices and compares them pair-wise. It takes into consideration large input arrays, and ...
l3utterfly's user avatar

15 30 50 per page