Cuda migration #122

vancemiller · 2021-05-13T13:26:30Z

Added annotations to cudaMalloc, cudaFree for migration support.
Memory allocations must be handles.

yuhc · 2021-05-14T03:10:48Z

cava/samples/cudart/cudart.cpp

 }

 __host__ cudaError_t CUDARTAPI
 cudaMemcpy(void *dst, const void *src, size_t count, enum cudaMemcpyKind kind)
 {
    ava_argument(dst) {
        if (kind == cudaMemcpyHostToDevice) {
-            ava_opaque;
+            ava_handle;


As this change requires to annotate devptr as handles which doesn't support the "offsetting" (+/-) operation I think we should merge this after we redesign the handle.

Do you know of any test applications that this breaks?

There is another issue. Devptrs should be handles everywhere or nowhere. I don't think this spec makes the devptrs in arguments to kernels handles.

Do you know of any test applications that this breaks?

Not for Rodinia CUDA benchmarks. But for the supported AI frameworks (like TF), copying memory at offsets is a normal behavior.

There is another issue. Devptrs should be handles everywhere or nowhere. I don't think this spec makes the devptrs in arguments to kernels handles.

@arthurp The spec for cuLaunchKernel already checked if the argument was a handle and applied the annotation accordingly. Is this not doing the right thing?

CUresult CUDAAPI cuLaunchKernel(CUfunction f, unsigned int gridDimX, unsigned int gridDimY, unsigned int gridDimZ, unsigned int blockDimX, unsigned int blockDimY, unsigned int blockDimZ, unsigned int sharedMemBytes, CUstream hStream, void **kernelParams, void **extra) { ava_argument(hStream) ava_handle; ava_argument(kernelParams) { ava_in; ava_buffer(ava_metadata(f)->func->argc); ava_element { // FIXME: use the generated index name in the spec to // reference the outer loop's loop index at this moment. if (ava_metadata(f)->func->args[__kernelParams_index_0].is_handle) { ava_type_cast(void *); ava_buffer(ava_metadata(f)->func->args[__kernelParams_index_0].size); ava_element ava_handle; } else { ava_type_cast(void *); ava_buffer(ava_metadata(f)->func->args[__kernelParams_index_0].size); } } } ava_argument(extra) { ava_in; ava_buffer(__helper_launch_extra_size(extra)); #warning The buffer size below states that every kernelParams[i] is 1 byte long. ava_element ava_buffer(1); } }

If you haven't touched that piece of spec, then it treats devPtr kernel args as non-handle.

What should it look like? It seems like it checks if each argument is a handle and if it is adds the ava_handle annotation. Is this doing something different?

cudaLaunchKernel spec:

ava_argument(args) { ava_in; ava_buffer(ava_metadata(func)->func->argc); ava_element { // FIXME: use the generated index name in the spec to // reference the outer loop's loop index at this moment. if (ava_metadata(func)->func->args[__args_index_0].is_handle) { ava_type_cast(void *); ava_buffer(ava_metadata(func)->func->args[__args_index_0].size); // ava_element ava_handle; } else { ava_type_cast(void *); ava_buffer(ava_metadata(func)->func->args[__args_index_0].size); } } ```

oh, it's commented out now. Should it be uncommented?

Uncommented ava_element ava_handle and the behavior is unchanged.

If you haven't touched that piece of spec, then it treats devPtr kernel args as non-handle.

@yuhc can you elaborate on this?

Memory allocations must be handles

vancemiller · 2021-06-20T19:59:01Z

This needs a range map to handle pointer arithmetic correctly. In what part of the source tree should the files for the range map go?

vancemiller requested a review from yuhc May 13, 2021 13:26

vancemiller assigned ArnavMohan May 13, 2021

yuhc reviewed May 14, 2021

View reviewed changes

vancemiller force-pushed the cuda-migration-spec branch from 3313d5e to b63b9cf Compare May 18, 2021 12:40

yuhc added this to the Complete live migration milestone May 23, 2021

yuhc added bug Something isn't working enhancement New feature or request labels May 23, 2021

vancemiller force-pushed the cuda-migration-spec branch from b63b9cf to f5e3728 Compare June 4, 2021 12:26

Cuda migration (by amp and amohan)

6c955d7

Memory allocations must be handles

vancemiller force-pushed the cuda-migration-spec branch from f5e3728 to 6c955d7 Compare June 20, 2021 19:56

vancemiller self-assigned this Jun 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cuda migration #122

Cuda migration #122

vancemiller commented May 13, 2021

yuhc May 14, 2021

vancemiller May 20, 2021

arthurp May 20, 2021

yuhc May 20, 2021

vancemiller Jun 4, 2021

yuhc Jun 20, 2021

vancemiller Jun 20, 2021

vancemiller Jun 20, 2021

vancemiller Jun 20, 2021

vancemiller Jun 20, 2021

vancemiller commented Jun 20, 2021

Cuda migration #122

Are you sure you want to change the base?

Cuda migration #122

Conversation

vancemiller commented May 13, 2021

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

vancemiller commented Jun 20, 2021