__global__

In the main code, the function is called like:
vec_add<<<blocks, threads(d_a, d_b, d_c, n);
This function runs on the GPU, executes many threads at the same time. Each thread runs the same code, but with difference IDs. This is SIMT: singe instruction, multiple threads.