Long long fix #191

TomMelt · 2024-11-18T09:32:53Z

closes #183

Problem

This issue also affected windows and is also fixed by this PR.

Tldr; FTorch expects a long int* whilst the libtorch binary seems to be returning a long long int*.

Example output below.

[ 25%] Building CXX object CMakeFiles/ftorch.dir/ctorch.cpp.o
/Users/user/FTorch/src/ctorch.cpp: In function 'const long int* torch_tensor_get_sizes(torch_tensor_t)':
/Users/user/FTorch/src/ctorch.cpp:240:25: error: invalid conversion from 'const long long int*' to 'const long int*' [-fpermissive]
  240 |   return t->sizes().data();
      |          ~~~~~~~~~~~~~~~^~
      |                         |
      |                         const long long int*

Solution

The solution is to check which system we are running on (OS and architecture) and create a preprocess macro -DUNIX that we can use to switch between long long int and long int versions of the libtorch library.

FTorch/src/CMakeLists.txt

Lines 57 to 62 in 5fb0887

    
           if(UNIX) 
        
             message(STATUS "CMAKE_SYSTEM_PROCESSOR = ${CMAKE_SYSTEM_PROCESSOR}") 
        
             if(CMAKE_SYSTEM_PROCESSOR STREQUAL "x86_64") 
        
               target_compile_definitions(${LIB_NAME} PRIVATE UNIX) 
        
             endif() 
        
           endif()

Then in the source code we check for that macro e.g.,

FTorch/src/ctorch.h

Lines 128 to 132 in 5fb0887

    
           #ifdef UNIX 
        
           EXPORT_C const long int* torch_tensor_get_sizes(const torch_tensor_t tensor); 
        
           #else 
        
           EXPORT_C const long long int* torch_tensor_get_sizes(const torch_tensor_t tensor); 
        
           #endif

TODO

works on windows
works on linux
works on mac (M1/M2)
create windows CI (this will be a separate PR) Windows CI build #168

TomMelt · 2024-11-19T10:15:44Z

@jwallwork23 @jatkinson1000 , to make reviewing easier this PR should be merged before #137

TomMelt · 2024-11-19T13:55:59Z

src/CMakeLists.txt

+add_library(${LIB_NAME} SHARED ctorch.cpp ftorch.F90 ftorch_test_utils.f90)
+
+if(UNIX)
+  message(STATUS "CMAKE_SYSTEM_PROCESSOR = ${CMAKE_SYSTEM_PROCESSOR}")


Suggested change

message(STATUS "CMAKE_SYSTEM_PROCESSOR = ${CMAKE_SYSTEM_PROCESSOR}")

Just remembered I left this print statement in for debugging. I should take this out

Did you remove this?

not yet. I will look at fixing the mac build before I take it out completely

jwallwork23

This looks sensible to me, @TomMelt. I haven't tested it on my laptop running Windows but I can do if needed.

TomMelt · 2024-11-21T13:53:59Z

This looks sensible to me, @TomMelt. I haven't tested it on my laptop running Windows but I can do if needed.

Thanks @jwallwork23. Wouldn't hurt if you get time but I did test on windows vm so it should be fine 👌

I do plan on setting up a windows CI soon so that should also catch any issues.

jatkinson1000 · 2024-12-03T08:40:14Z

It seems that this incompatibility still exists on Mac 12 and 13 - see #192

So the CMake/#ifdef might need some refinement.
IIRC 12 and 13 are X86 images rather than arm, but I'd need to check that.

jwallwork23 · 2024-12-03T15:00:19Z

src/ctorch.h

+EXPORT_C const long long int *
+torch_tensor_get_sizes(const torch_tensor_t tensor);


I'm surprised the linter likes these not being on the same line.

honestly... it makes me sad. I had them on the same line, but the linter forces it to split :(

Can maybe use a config file to extend the column limit?
https://clang.llvm.org/docs/ClangFormatStyleOptions.html

I'm a little confused, when I run clang-format locally it moves them to be placed on one line...

I wonder if this is to do with the default settings being different for different clang-format versions? We observed this in nextSIM-DG and so pinned the version used by the CI.

I have just opened #199 to here which should resolve this by fixing Cpp Column Limit to 88.

jwallwork23

Approved once, will approve again. Thanks @TomMelt!

jatkinson1000

As discussed in-person, I think this can go in now and the issues with mac can be resolved in a separate set of issues.

Thanks @TomMelt!

fixes [#183](#183) There is an issue when building on mac (arm_64) or windows. The version of `libtorch` exposes a torch tensors shape (`t->sizes().data()`) as a `const long long int*` instead of just a `const long int*` like on linux and mac (x86). This commit adds preprocessor macro to switch between implementations automatically detecting the correct version at CMake build stage.

jatkinson1000 · 2024-12-04T16:40:08Z

Have rebased #199 on this if you want to merge.

TomMelt self-assigned this Nov 19, 2024

TomMelt requested a review from jatkinson1000 November 19, 2024 08:51

TomMelt force-pushed the long-long-fix branch 3 times, most recently from e491c33 to 5fb0887 Compare November 19, 2024 10:04

TomMelt changed the base branch from main to melt-windows-fix November 19, 2024 10:05

TomMelt added bug Something isn't working enhancement New feature or request hackathon labels Nov 19, 2024

TomMelt requested a review from jwallwork23 November 19, 2024 10:12

TomMelt marked this pull request as ready for review November 19, 2024 10:12

jatkinson1000 mentioned this pull request Nov 19, 2024

[TEST: DO NOT MERGE] Long long fix mac test #192

Closed

TomMelt commented Nov 19, 2024

View reviewed changes

jwallwork23 approved these changes Nov 21, 2024

View reviewed changes

TomMelt force-pushed the melt-windows-fix branch from 594e938 to 4d66327 Compare December 2, 2024 10:24

TomMelt force-pushed the long-long-fix branch from 5fb0887 to a005817 Compare December 2, 2024 10:31

TomMelt mentioned this pull request Dec 3, 2024

Windows Installation Issue #124

Closed

jwallwork23 reviewed Dec 3, 2024

View reviewed changes

jwallwork23 approved these changes Dec 3, 2024

View reviewed changes

jatkinson1000 approved these changes Dec 3, 2024

View reviewed changes

TomMelt force-pushed the long-long-fix branch from a005817 to 6ec89a0 Compare December 4, 2024 15:37

Add clang-format confic to set columnlimit for Cpp to 88.

5aff7db

TomMelt merged commit fad331e into melt-windows-fix Dec 9, 2024
4 checks passed

TomMelt deleted the long-long-fix branch December 9, 2024 15:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Long long fix #191

Long long fix #191

TomMelt commented Nov 18, 2024 •

edited

Loading

TomMelt commented Nov 19, 2024

TomMelt Nov 19, 2024

jatkinson1000 Dec 3, 2024

TomMelt Dec 4, 2024

jwallwork23 left a comment

TomMelt commented Nov 21, 2024

jatkinson1000 commented Dec 3, 2024

jwallwork23 Dec 3, 2024

TomMelt Dec 4, 2024

jatkinson1000 Dec 4, 2024

jatkinson1000 Dec 4, 2024

jwallwork23 Dec 4, 2024

jatkinson1000 Dec 4, 2024

jwallwork23 left a comment

jatkinson1000 left a comment

jatkinson1000 commented Dec 4, 2024

	if(UNIX)
	message(STATUS "CMAKE_SYSTEM_PROCESSOR = ${CMAKE_SYSTEM_PROCESSOR}")
	if(CMAKE_SYSTEM_PROCESSOR STREQUAL "x86_64")
	target_compile_definitions(${LIB_NAME} PRIVATE UNIX)
	endif()
	endif()

	#ifdef UNIX
	EXPORT_C const long int* torch_tensor_get_sizes(const torch_tensor_t tensor);
	#else
	EXPORT_C const long long int* torch_tensor_get_sizes(const torch_tensor_t tensor);
	#endif

		EXPORT_C const long long int *
		torch_tensor_get_sizes(const torch_tensor_t tensor);

Long long fix #191

Long long fix #191

Conversation

TomMelt commented Nov 18, 2024 • edited Loading

Problem

Solution

TODO

TomMelt commented Nov 19, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jwallwork23 left a comment

Choose a reason for hiding this comment

TomMelt commented Nov 21, 2024

jatkinson1000 commented Dec 3, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jwallwork23 left a comment

Choose a reason for hiding this comment

jatkinson1000 left a comment

Choose a reason for hiding this comment

jatkinson1000 commented Dec 4, 2024

TomMelt commented Nov 18, 2024 •

edited

Loading