Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

WIP: Metal backend #1287

Open
wants to merge 389 commits into
base: main
Choose a base branch
from
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
389 commits
Select commit Hold shift + click to select a range
ccd72bf
implement gather sampling
SamoZ256 Aug 29, 2024
a4ff855
do buffer copying in a void vertex function
SamoZ256 Aug 29, 2024
2403cf9
always use blit commands for buffer copies on non-apple GPUs
SamoZ256 Aug 29, 2024
1cfb841
do buffer cache uploading on the GPU
SamoZ256 Aug 29, 2024
2f822d0
support game pad view in a separate window
SamoZ256 Aug 30, 2024
41ee2e7
Merge branch 'main' into metal
SamoZ256 Aug 30, 2024
1412d1e
enable triangle fan support
SamoZ256 Aug 30, 2024
9c29acc
synchronize buffer copying correctly
SamoZ256 Aug 30, 2024
cda4799
disable writes for color attachments which are not in the active FBO
SamoZ256 Aug 30, 2024
a9a4d7b
disable depth write if active FBO doesn't have a depth attachment
SamoZ256 Aug 30, 2024
5eb3026
skip unnecessary draws
SamoZ256 Aug 31, 2024
f9f6260
monitor clears
SamoZ256 Aug 31, 2024
5d07d11
emulate unsupported shadow sampler types
SamoZ256 Sep 1, 2024
491ac69
fix: object shader error when used with rect primitive
SamoZ256 Sep 1, 2024
45737e8
disable shader write usage on textures
SamoZ256 Sep 2, 2024
c4eb195
make all textures be at least 1x1x1
SamoZ256 Sep 2, 2024
953975f
don't jit compile vertex shaders
SamoZ256 Sep 3, 2024
2ee92e5
use the correct fetch shader
SamoZ256 Sep 3, 2024
b13ba58
correct the object shader hash
SamoZ256 Sep 4, 2024
8a74445
don't compile fragment shaders just-in-time
SamoZ256 Sep 4, 2024
cd72ad8
include color format data types in ps hash
SamoZ256 Sep 5, 2024
4251f3f
check for invalid color formats
SamoZ256 Sep 6, 2024
6a3bdd4
refactor pixel format support
SamoZ256 Sep 6, 2024
548ffb6
add: todo notices
SamoZ256 Sep 6, 2024
5e50592
Merge pull request #3 from SamoZ256/metal-shaders
SamoZ256 Sep 6, 2024
3dc233f
support rasterization kill
SamoZ256 Sep 7, 2024
cf56024
optimize rasterization kill
SamoZ256 Sep 7, 2024
8404981
make binary archives device and os version specific
SamoZ256 Sep 7, 2024
e8f726e
use lcr
SamoZ256 Sep 7, 2024
395cd1c
handle rasterization kill for mesh shaders
SamoZ256 Sep 11, 2024
950f04d
support instancing for mesh shaders
SamoZ256 Sep 11, 2024
e7f8f0e
simplify uniform names
SamoZ256 Sep 11, 2024
a328c5e
use uchar for index type
SamoZ256 Sep 11, 2024
4cce369
put query object into a separate file
SamoZ256 Sep 12, 2024
eb7c10e
implement occlusion queries
SamoZ256 Sep 13, 2024
fd16488
improve command buffer sync
SamoZ256 Sep 13, 2024
934b1f8
handle occlusion queries with no draws
SamoZ256 Sep 13, 2024
e89efed
request soon commit instead of committing directly
SamoZ256 Sep 13, 2024
e5dcd93
Merge pull request #4 from SamoZ256/metal-occlusion-queries
SamoZ256 Sep 14, 2024
b5954d8
release command buffers properly
SamoZ256 Sep 14, 2024
25a1d7e
Merge branch 'cemu-project:main' into metal
SamoZ256 Sep 14, 2024
02254d4
set pixel format view usage for textures
SamoZ256 Sep 14, 2024
008c11c
allocate special buffers only when needed
SamoZ256 Sep 14, 2024
2961151
correct comment
SamoZ256 Sep 14, 2024
358567a
make a workaround for streamout with no fbo
SamoZ256 Sep 14, 2024
8ac90ce
use correct stages for buffer sync
SamoZ256 Sep 16, 2024
535107e
fix: invalid surface copy depth
SamoZ256 Sep 16, 2024
aa81070
refactor pixel formats
SamoZ256 Sep 16, 2024
f03c0a2
only set buffer offset if needed
SamoZ256 Sep 19, 2024
a460a5d
do vertex restride on the CPU
SamoZ256 Sep 21, 2024
7ad57f5
add the option to disable accurate barriers
SamoZ256 Sep 21, 2024
4b548f9
never disable accurate barriers for certain shaders
SamoZ256 Sep 21, 2024
8b68df0
use correct texture decoders
SamoZ256 Sep 22, 2024
3cf831d
do vertex restride on the GPU & don't over-sync
SamoZ256 Sep 23, 2024
3de2b03
only enable metal on apple platforms by default
SamoZ256 Sep 28, 2024
ed5f72a
include texture decoder in pixel format info
SamoZ256 Sep 28, 2024
f9e71e9
implement ABGR4 texture decoders
SamoZ256 Sep 28, 2024
810168b
use alternative texture decoders for unsupported formats
SamoZ256 Sep 28, 2024
8c30bc8
use RG8Unorm format if ABGR4Unorm is not available
SamoZ256 Sep 28, 2024
6c8cd5c
use private storage mode when possible
SamoZ256 Sep 30, 2024
b088ddc
implement output shaders
SamoZ256 Sep 30, 2024
28e553e
cache output shaders
SamoZ256 Sep 30, 2024
07cb8b8
set output shader uniforms
SamoZ256 Sep 30, 2024
778037f
remove useless print statement
SamoZ256 Sep 30, 2024
c65123b
use logging instead of printing
SamoZ256 Sep 30, 2024
5d01c77
rename hybrid pipeline to void vertex pipeline
SamoZ256 Sep 30, 2024
a3bfde8
remove old present pipelines
SamoZ256 Sep 30, 2024
94e8ed5
fetch vertices manually if needed
SamoZ256 Oct 1, 2024
f2096de
remove vertex restriding code
SamoZ256 Oct 1, 2024
50175fc
report manual vertex fetch draws per frame
SamoZ256 Oct 1, 2024
6dc8f9a
Merge pull request #5 from SamoZ256/metal-no-vertex-restride
SamoZ256 Oct 1, 2024
2fb4d83
don't set verticesPerInstance twice
SamoZ256 Oct 1, 2024
6ba6157
properly implement ABGR4 decoder
SamoZ256 Oct 1, 2024
ebcb62a
add: an option to turn fast math off
SamoZ256 Oct 2, 2024
6b784ad
log relevant settings
SamoZ256 Oct 2, 2024
756470f
remove CMake hacks
SamoZ256 Oct 9, 2024
bfd9059
remove the -g flag on debug builds
SamoZ256 Oct 9, 2024
bdfac96
implement async shader compilation
SamoZ256 Oct 10, 2024
03bc647
prepone compilation for rect geometry shaders
SamoZ256 Oct 10, 2024
641ef71
count compiled shaders properly
SamoZ256 Oct 10, 2024
fabaf16
Merge pull request #6 from SamoZ256/metal-async
SamoZ256 Oct 11, 2024
8b783e6
refactor pipeline cache
SamoZ256 Oct 13, 2024
e9e510d
add: base for pipeline caching
SamoZ256 Oct 14, 2024
6b47d4f
implement pipeline cache serializing
SamoZ256 Oct 15, 2024
cd21d95
refactor fbos
SamoZ256 Oct 15, 2024
944cc8b
store loaded pipelines
SamoZ256 Oct 15, 2024
79f5586
report pipeline compilation count
SamoZ256 Oct 15, 2024
d1c69e9
set shader just before compiling
SamoZ256 Oct 15, 2024
cbde7f9
force compile shaders if needed
SamoZ256 Oct 15, 2024
4dcb858
check if pipeline is eligible for serializing
SamoZ256 Oct 15, 2024
7d9194a
don't overshadow error
SamoZ256 Oct 15, 2024
8f2385a
use lcr instead of contextNew
SamoZ256 Oct 16, 2024
15eb6bb
fix: pipeline cache with mesh shaders
SamoZ256 Oct 19, 2024
295a6ed
only add pipeline to cache if compilation was attempted
SamoZ256 Oct 19, 2024
1750715
retrieve ps input table without using global variable
SamoZ256 Oct 19, 2024
b8021b6
fix: incorrect texture usages
SamoZ256 Oct 24, 2024
665eb23
fix: incorrect pipeline compilation time report
SamoZ256 Oct 28, 2024
bca32c4
refactor the way pipelines are stored
SamoZ256 Oct 28, 2024
4e3f94e
compile pipelines async
SamoZ256 Oct 28, 2024
7906733
don't compile certain pipelines async
SamoZ256 Oct 28, 2024
228fd34
Merge pull request #9 from SamoZ256/metal-async-pipelines
SamoZ256 Oct 29, 2024
85db0dc
cache all pipelines
SamoZ256 Oct 29, 2024
a61d0f0
Merge pull request #7 from SamoZ256/metal-pipeline-cache
SamoZ256 Oct 29, 2024
00256e5
only set blend color when changed
SamoZ256 Oct 29, 2024
e7ac199
make occlusion queries accumulate with draws
SamoZ256 Oct 30, 2024
7eb8508
fix: CI when metal is disabled
SamoZ256 Oct 31, 2024
927c52e
implement calculate texture lod
SamoZ256 Nov 1, 2024
6eb46ed
remove 'Vulkan' from cache loading message
SamoZ256 Nov 1, 2024
61de5a3
only log todo for vulkan output shader
SamoZ256 Nov 1, 2024
ed48fbf
Merge branch 'main' into metal
SamoZ256 Nov 1, 2024
ab41de4
use host memory instead of buffer cache when possible
SamoZ256 Nov 3, 2024
03d4e86
add an option to use the host memory instead of buffer cache
SamoZ256 Nov 3, 2024
b38ca6a
add an option to choose buffer cache type
SamoZ256 Nov 3, 2024
b008ace
Merge pull request #10 from SamoZ256/metal-host-buffer
SamoZ256 Nov 3, 2024
31c10bd
move fast math option to game profile
SamoZ256 Nov 3, 2024
e00d244
remove unused code
SamoZ256 Nov 3, 2024
fbea328
correctly report memory usage for host buffer cache
SamoZ256 Nov 3, 2024
a00d409
improve host and device shared buffer cache modes
SamoZ256 Nov 4, 2024
2e93b08
rename buffer cache type to buffer cache mode
SamoZ256 Nov 4, 2024
2790200
separate metal shader cache
SamoZ256 Nov 4, 2024
07c62d5
release texture upload buffer
SamoZ256 Nov 5, 2024
ea0a30b
handle special state 8 and 5
SamoZ256 Nov 5, 2024
552c490
implement occlusion query flush
SamoZ256 Nov 5, 2024
c46c821
refactor command buffers
SamoZ256 Nov 5, 2024
fc1580f
remove outdated todo notices
SamoZ256 Nov 5, 2024
798da38
Merge branch 'main' into metal
SamoZ256 Nov 6, 2024
763e50c
Merge branch 'main' into metal
SamoZ256 Nov 7, 2024
2c9f627
remove MetalLogging log type
SamoZ256 Nov 8, 2024
f264957
implement screenshots
SamoZ256 Nov 8, 2024
6897abd
fix: a typo in the attribute decoder
SamoZ256 Nov 8, 2024
f1db7d5
don't log texture allocation warnings
SamoZ256 Nov 9, 2024
02c36d9
add: blit todo notice
SamoZ256 Nov 9, 2024
97be576
cast texture slice to uint
SamoZ256 Nov 10, 2024
ed32feb
rint texture slice & explicitly specify texture lod
SamoZ256 Nov 10, 2024
a52095b
sync between command buffers
SamoZ256 Nov 10, 2024
326d344
cleanup the buffer allocator
SamoZ256 Nov 11, 2024
4b58ac1
check if verticesPerInstance uniform is used
SamoZ256 Nov 11, 2024
a1b9164
don't mask ps inputs with 0x7F
SamoZ256 Nov 11, 2024
6ea6ad3
use verticesPerInstance for rect vertex shaders
SamoZ256 Nov 11, 2024
5b7a0fa
warn about invalid primitive mode
SamoZ256 Nov 14, 2024
c9b18ef
don't transform depth to 0...1
SamoZ256 Nov 14, 2024
d732416
invalidate index buffers when a command buffer finishes
SamoZ256 Nov 14, 2024
38cbd3e
handle halfZ in the shader decompiler
SamoZ256 Nov 15, 2024
a721364
log instead of printf
SamoZ256 Nov 15, 2024
a81ee79
maximize concurrent shader compilation when loading shader cache
SamoZ256 Nov 15, 2024
2890819
fix: triangle fan index count
SamoZ256 Nov 16, 2024
8a80373
rework the command buffer system
SamoZ256 Nov 16, 2024
b486568
limit height to 1 for 1D textures
SamoZ256 Nov 19, 2024
b140984
fix: texture sample gradient errors
SamoZ256 Nov 19, 2024
c5bef60
save pipeline even if compilation failed
SamoZ256 Nov 19, 2024
7f5040a
don't release autoreleased objects
SamoZ256 Nov 19, 2024
cabf568
Merge branch 'main' into metal
SamoZ256 Nov 19, 2024
fda2f40
fix: output shader issues
SamoZ256 Nov 19, 2024
9095035
only set array length for texture arrays
SamoZ256 Nov 20, 2024
d9f857b
only set array length for texture view arrays
SamoZ256 Nov 20, 2024
732e3be
explicitly make integer constants int
SamoZ256 Nov 20, 2024
66ad59d
implement state 5 through a draw call
SamoZ256 Nov 22, 2024
00857b2
support arbitrary pixel formats for state 5
SamoZ256 Nov 22, 2024
14258cd
Revert "only set array length for texture view arrays"
SamoZ256 Nov 23, 2024
06f43b4
Merge branch 'metal' into metal-state5
SamoZ256 Nov 23, 2024
009dab8
remove useless includes
SamoZ256 Nov 23, 2024
4c54f4b
Merge pull request #12 from SamoZ256/metal-state5
SamoZ256 Nov 23, 2024
05b603d
fix: incorrect texture read coord type
SamoZ256 Nov 23, 2024
18bd24b
only set concurrent compilation if Metal 3 is available
SamoZ256 Nov 30, 2024
f710463
Merge pull request #13 from SamoZ256/test
SamoZ256 Nov 30, 2024
1752126
flush file streamer after each write
SamoZ256 Dec 1, 2024
ac96a1d
output all fragment inputs in the vertex shader
SamoZ256 Dec 8, 2024
434ffc9
clamp texture mip count
SamoZ256 Dec 10, 2024
74a1162
only set array length for texture view arrays
SamoZ256 Dec 10, 2024
137becb
do safety checks before clearing
SamoZ256 Dec 14, 2024
ba9a937
make clear message more descriptive
SamoZ256 Dec 14, 2024
d64e0c9
rework gpu selection
SamoZ256 Dec 15, 2024
c6e8b5c
fix: depth stencil state issues
SamoZ256 Dec 16, 2024
e8c7e9d
Merge branch 'main' into metal
SamoZ256 Dec 16, 2024
4281f6e
implement shader uint min max instructions
SamoZ256 Dec 16, 2024
fa004a3
add an option to preserve position invariance
SamoZ256 Dec 18, 2024
770d6cf
Revert "rework gpu selection"
SamoZ256 Dec 18, 2024
79290ea
display gpu name
SamoZ256 Dec 18, 2024
08ea28f
add an option to select gpu for metal
SamoZ256 Dec 18, 2024
967a740
implement point coord properly
SamoZ256 Dec 21, 2024
55d9436
fix typo in texture decoder
SamoZ256 Dec 22, 2024
40dab1e
fallback to transparent black for custom sampler border color
SamoZ256 Jan 4, 2025
1d8806c
add an option to capture GPU frame
SamoZ256 Jan 4, 2025
9a61e81
support saving GPU captures to a file
SamoZ256 Jan 4, 2025
813c52c
add gpu capture environment notice
SamoZ256 Jan 4, 2025
337ec6b
fix: GPU capture button not working
SamoZ256 Jan 4, 2025
6247b65
Merge pull request #14 from SamoZ256/metal-gpu-capture
SamoZ256 Jan 5, 2025
68aa405
Merge branch 'main' into metal
SamoZ256 Jan 7, 2025
e406885
implement framebuffer fetch
SamoZ256 Jan 8, 2025
68d328b
mask out color attachments
SamoZ256 Jan 8, 2025
27a31fe
clean up accurate barriers
SamoZ256 Jan 8, 2025
6d1d739
limit framebuffer fetch to 2D textures
SamoZ256 Jan 9, 2025
03ec231
don't bind textures when framebuffer fetched
SamoZ256 Jan 9, 2025
3fae686
remove the accurate barriers option
SamoZ256 Jan 9, 2025
a0239cb
check for framebuffer fetch support
SamoZ256 Jan 9, 2025
d9bf99c
fix: a typo in texture index
SamoZ256 Jan 9, 2025
4479584
support texture lod query for framebuffer fetch
SamoZ256 Jan 10, 2025
ca3fe96
fix: component indexing for framebuffer fetch
SamoZ256 Jan 10, 2025
217e2ed
check if pixel formats match for framebuffer fetch
SamoZ256 Jan 11, 2025
f4985c4
add an option to disable framebuffer fetch
SamoZ256 Jan 11, 2025
f686a7b
use as_type instead of bitCast in shaders
SamoZ256 Jan 12, 2025
159a10e
Merge pull request #16 from SamoZ256/metal-accurate-barriers
SamoZ256 Jan 12, 2025
0b15ad4
Merge branch 'metal' into metal-shader-compilation
SamoZ256 Jan 12, 2025
82c2d14
fix: device preference not getting loaded
SamoZ256 Jan 12, 2025
a28f9d6
fix: incorrect texture view pointers
SamoZ256 Jan 12, 2025
adf0b5c
Merge pull request #17 from SamoZ256/metal-shader-compilation
SamoZ256 Jan 12, 2025
388bbe2
release command buffers
SamoZ256 Jan 12, 2025
aec7a18
remove useless texture member variables
SamoZ256 Jan 12, 2025
21bc5f2
add texture copy block size workaround
SamoZ256 Jan 12, 2025
1e3a3ef
prepare for AIR cache
SamoZ256 Jan 15, 2025
3d84b78
support multiple components in type conversion
SamoZ256 Jan 15, 2025
5f339e4
Merge branch 'metal' into metal-air-cache
SamoZ256 Jan 15, 2025
a8da524
use precompiled shaders
SamoZ256 Jan 15, 2025
371c089
fix: AIR shaders not getting loaded properly
SamoZ256 Jan 15, 2025
f5eb184
check if shaders compiled successfully
SamoZ256 Jan 15, 2025
5af904b
delete unused metallibs
SamoZ256 Jan 15, 2025
0b1932c
compile shaders to AIR at runtime
SamoZ256 Jan 16, 2025
4026430
use the ENABLE_METAL macro
SamoZ256 Jan 16, 2025
ec24102
support metal shaders in graphics packs
SamoZ256 Jan 16, 2025
0bf245b
remove printf
SamoZ256 Jan 16, 2025
4160034
Merge branch 'metal' into metal-air-cache
SamoZ256 Jan 17, 2025
7700635
add AIR cache to cache paths
SamoZ256 Jan 17, 2025
21e7466
Merge pull request #18 from SamoZ256/metal-air-cache
SamoZ256 Jan 17, 2025
2a218d4
use depth mask when writing to depth
SamoZ256 Jan 17, 2025
6cf7f43
set AIR cache thread priority
SamoZ256 Jan 17, 2025
9934213
disable AIR cache
SamoZ256 Jan 17, 2025
800aae4
Merge branch 'main' into metal
SamoZ256 Jan 17, 2025
24ff85b
implement new index cache
SamoZ256 Jan 18, 2025
97b806f
rework buffer allocators
SamoZ256 Jan 18, 2025
d086eb3
fix: index buffer crashes
SamoZ256 Jan 18, 2025
6d6c04a
update debug overlay
SamoZ256 Jan 18, 2025
bf93f90
flush uploaded buffers
SamoZ256 Jan 18, 2025
f0cf614
Merge pull request #19 from SamoZ256/metal-allocator-rework
SamoZ256 Jan 18, 2025
86f3648
fix: sample compare component count
SamoZ256 Jan 18, 2025
225d11c
fix dfdx and dfdy component count
SamoZ256 Jan 18, 2025
9e94451
check for texture and color tile mode
SamoZ256 Jan 19, 2025
db8c7de
speed up framebuffer fetch lookup
SamoZ256 Jan 19, 2025
29cd698
skip bindings for framebuffer fetched textures
SamoZ256 Jan 19, 2025
709f24b
fix typo in framebuffer fetch
SamoZ256 Jan 19, 2025
29a4835
fix incorrect ABGR4 texture decoder
SamoZ256 Jan 20, 2025
7ad4d48
filter framebuffer fetch more
SamoZ256 Jan 20, 2025
bc6fb81
add auto option for position invariance
SamoZ256 Jan 23, 2025
d912837
add auto option for buffer cache mode
SamoZ256 Jan 23, 2025
3c3f254
fix auto buffer cache mode not showing
SamoZ256 Jan 23, 2025
9db0e92
change descriptions for some expert options
SamoZ256 Jan 23, 2025
2523c80
Merge pull request #20 from SamoZ256/metal-auto-options
SamoZ256 Jan 23, 2025
3d76e7c
Merge branch 'main' into metal
SamoZ256 Jan 23, 2025
a74c6ee
apply position invariance in Wonderful 101
SamoZ256 Jan 23, 2025
28bcaf0
fix some memory leaks
SamoZ256 Jan 23, 2025
4f0bc72
fix memory leaks
SamoZ256 Jan 24, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
4 changes: 4 additions & 0 deletions .gitmodules
Original file line number Diff line number Diff line change
Expand Up @@ -18,3 +18,7 @@
path = dependencies/imgui
url = https://github.com/ocornut/imgui
shallow = true
[submodule "dependencies/metal-cpp"]
path = dependencies/metal-cpp
url = https://github.com/bkaradzic/metal-cpp.git
shallow = true
20 changes: 18 additions & 2 deletions CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -25,7 +25,7 @@ if (ENABLE_VCPKG)
OUTPUT_VARIABLE is_vcpkg_shallow
OUTPUT_STRIP_TRAILING_WHITESPACE
)

if(is_vcpkg_shallow STREQUAL "true")
message(STATUS "vcpkg is shallow. Unshallowing it now...")
execute_process(
Expand Down Expand Up @@ -102,10 +102,20 @@ if (UNIX AND NOT APPLE)
option(ENABLE_BLUEZ "Build with Bluez support" ON)
endif()

if (APPLE)
set(ENABLE_METAL_DEFAULT ON)
else()
set(ENABLE_METAL_DEFAULT OFF)
endif()

option(ENABLE_OPENGL "Enables the OpenGL backend" ON)
option(ENABLE_VULKAN "Enables the Vulkan backend" ON)
option(ENABLE_METAL "Enables the Metal backend" ${ENABLE_METAL_DEFAULT})
option(ENABLE_DISCORD_RPC "Enables the Discord Rich Presence feature" ON)

if (ENABLE_METAL AND NOT APPLE)
message(FATAL_ERROR "Metal backend is only supported on Apple platforms")
endif()

# input backends
if (WIN32)
Expand Down Expand Up @@ -180,6 +190,12 @@ if (ENABLE_OPENGL)
find_package(OpenGL REQUIRED)
endif()

if (ENABLE_METAL)
include_directories(${CMAKE_SOURCE_DIR}/dependencies/metal-cpp)

add_definitions(-DENABLE_METAL=1)
endif()

if (ENABLE_DISCORD_RPC)
add_compile_definitions(ENABLE_DISCORD_RPC)
add_subdirectory(dependencies/discord-rpc EXCLUDE_FROM_ALL)
Expand All @@ -206,7 +222,7 @@ endif()

if (ENABLE_CUBEB)
if (NOT ENABLE_VCPKG)
find_package(cubeb)
find_package(cubeb)
endif()
if (NOT cubeb_FOUND)
option(BUILD_TESTS "" OFF)
Expand Down
1 change: 1 addition & 0 deletions dependencies/metal-cpp
Submodule metal-cpp added at a63bd1
65 changes: 64 additions & 1 deletion src/Cafe/CMakeLists.txt
Original file line number Diff line number Diff line change
Expand Up @@ -523,7 +523,70 @@ add_library(CemuCafe
)

if(APPLE)
target_sources(CemuCafe PRIVATE "HW/Latte/Renderer/Vulkan/CocoaSurface.mm")
target_sources(CemuCafe PRIVATE
HW/Latte/Renderer/Vulkan/CocoaSurface.mm
HW/Latte/Renderer/MetalView.mm
HW/Latte/Renderer/MetalView.h
)
endif()

if(ENABLE_METAL)
# TODO: sort alphabetically
target_sources(CemuCafe PRIVATE
HW/Latte/Renderer/Metal/MetalRenderer.cpp
HW/Latte/Renderer/Metal/MetalRenderer.h
HW/Latte/Renderer/Metal/MetalCommon.h
HW/Latte/Renderer/Metal/MetalCppImpl.cpp
HW/Latte/Renderer/Metal/MetalLayer.mm
HW/Latte/Renderer/Metal/MetalLayer.h
HW/Latte/Renderer/Metal/MetalLayerHandle.cpp
HW/Latte/Renderer/Metal/MetalLayerHandle.h
HW/Latte/Renderer/Metal/LatteToMtl.cpp
HW/Latte/Renderer/Metal/LatteToMtl.h
HW/Latte/Renderer/Metal/LatteTextureMtl.cpp
HW/Latte/Renderer/Metal/LatteTextureMtl.h
HW/Latte/Renderer/Metal/LatteTextureViewMtl.cpp
HW/Latte/Renderer/Metal/LatteTextureViewMtl.h
HW/Latte/Renderer/Metal/LatteTextureReadbackMtl.cpp
HW/Latte/Renderer/Metal/LatteTextureReadbackMtl.h
HW/Latte/Renderer/Metal/RendererShaderMtl.cpp
HW/Latte/Renderer/Metal/RendererShaderMtl.h
HW/Latte/Renderer/Metal/CachedFBOMtl.cpp
HW/Latte/Renderer/Metal/CachedFBOMtl.h
HW/Latte/Renderer/Metal/MetalAttachmentsInfo.cpp
HW/Latte/Renderer/Metal/MetalAttachmentsInfo.h
HW/Latte/Renderer/Metal/MetalBufferAllocator.cpp
HW/Latte/Renderer/Metal/MetalBufferAllocator.h
HW/Latte/Renderer/Metal/MetalMemoryManager.cpp
HW/Latte/Renderer/Metal/MetalMemoryManager.h
HW/Latte/Renderer/Metal/MetalOutputShaderCache.cpp
HW/Latte/Renderer/Metal/MetalOutputShaderCache.h
HW/Latte/Renderer/Metal/MetalPipelineCompiler.cpp
HW/Latte/Renderer/Metal/MetalPipelineCompiler.h
HW/Latte/Renderer/Metal/MetalPipelineCache.cpp
HW/Latte/Renderer/Metal/MetalPipelineCache.h
HW/Latte/Renderer/Metal/MetalDepthStencilCache.cpp
HW/Latte/Renderer/Metal/MetalDepthStencilCache.h
HW/Latte/Renderer/Metal/MetalSamplerCache.cpp
HW/Latte/Renderer/Metal/MetalSamplerCache.h
HW/Latte/Renderer/Metal/MetalVoidVertexPipeline.cpp
HW/Latte/Renderer/Metal/MetalVoidVertexPipeline.h
HW/Latte/Renderer/Metal/MetalQuery.cpp
HW/Latte/Renderer/Metal/MetalQuery.h
HW/Latte/Renderer/Metal/MetalPerformanceMonitor.h
HW/Latte/Renderer/Metal/UtilityShaderSource.h
)

target_sources(CemuCafe PRIVATE
HW/Latte/LegacyShaderDecompiler/LatteDecompilerEmitMSLAttrDecoder.cpp
HW/Latte/LegacyShaderDecompiler/LatteDecompilerEmitMSL.cpp
HW/Latte/LegacyShaderDecompiler/LatteDecompilerEmitMSLHeader.hpp
)

#target_link_libraries(CemuCafe PRIVATE
# "-framework Metal"
# "-framework QuartzCore"
#)
endif()

set_property(TARGET CemuCafe PROPERTY MSVC_RUNTIME_LIBRARY "MultiThreaded$<$<CONFIG:Debug>:Debug>")
Expand Down
13 changes: 11 additions & 2 deletions src/Cafe/CafeSystem.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -252,7 +252,16 @@ void InfoLog_PrintActiveSettings()
if (ActiveSettings::GetGraphicsAPI() == GraphicAPI::kVulkan)
{
cemuLog_log(LogType::Force, "Async compile: {}", GetConfig().async_compile.GetValue() ? "true" : "false");
if(!GetConfig().vk_accurate_barriers.GetValue())
if (!GetConfig().vk_accurate_barriers.GetValue())
cemuLog_log(LogType::Force, "Accurate barriers are disabled!");
}
else if (ActiveSettings::GetGraphicsAPI() == GraphicAPI::kMetal)
{
cemuLog_log(LogType::Force, "Async compile: {}", GetConfig().async_compile.GetValue() ? "true" : "false");
cemuLog_log(LogType::Force, "Fast math: {}", g_current_game_profile->GetFastMath() ? "true" : "false");
cemuLog_log(LogType::Force, "Buffer cache type: {}", g_current_game_profile->GetBufferCacheMode());
cemuLog_log(LogType::Force, "Position invariance: {}", g_current_game_profile->GetPositionInvariance());
if (!GetConfig().vk_accurate_barriers.GetValue())
cemuLog_log(LogType::Force, "Accurate barriers are disabled!");
}
cemuLog_log(LogType::Force, "Console language: {}", stdx::to_underlying(config.console_language.GetValue()));
Expand Down Expand Up @@ -1014,7 +1023,7 @@ namespace CafeSystem
{
// starting with Cemu 1.27.0 /vol/storage_mlc01/ is virtualized, meaning that it doesn't point to one singular host os folder anymore
// instead it now uses a more complex solution to source titles with various formats (folder, wud, wua) from the game paths and host mlc path

// todo - mount /vol/storage_mlc01/ with base priority to the host mlc?

// since mounting titles is an expensive operation we have to avoid mounting all titles at once
Expand Down
27 changes: 19 additions & 8 deletions src/Cafe/GameProfile/GameProfile.cpp
Original file line number Diff line number Diff line change
Expand Up @@ -127,7 +127,7 @@ bool gameProfile_loadIntegerOption(IniParser& iniParser, const char* optionName,
{
cemuLog_log(LogType::Force, "Value '{}' is out of range for option '{}' in game profile", *option_value, optionName);
return false;
}
}
}

template<typename T>
Expand Down Expand Up @@ -224,8 +224,11 @@ bool GameProfile::Load(uint64_t title_id)
gameProfile_loadIntegerOption(&iniParser, "graphics_api", &graphicsApi, -1, 0, 1);
if (graphicsApi.value != -1)
m_graphics_api = (GraphicAPI)graphicsApi.value;

gameProfile_loadEnumOption(iniParser, "accurateShaderMul", m_accurateShaderMul);
gameProfile_loadBooleanOption2(iniParser, "fastMath", m_fastMath);
gameProfile_loadEnumOption(iniParser, "bufferCacheMode2", m_bufferCacheMode);
gameProfile_loadEnumOption(iniParser, "positionInvariance2", m_positionInvariance);

// legacy support
auto option_precompiledShaders = iniParser.FindOption("precompiledShaders");
Expand Down Expand Up @@ -277,7 +280,7 @@ bool GameProfile::Load(uint64_t title_id)
void GameProfile::Save(uint64_t title_id)
{
auto gameProfileDir = ActiveSettings::GetConfigPath("gameProfiles");
if (std::error_code ex_ec; !fs::exists(gameProfileDir, ex_ec))
if (std::error_code ex_ec; !fs::exists(gameProfileDir, ex_ec))
fs::create_directories(gameProfileDir, ex_ec);
auto gameProfilePath = gameProfileDir / fmt::format("{:016x}.ini", title_id);
FileStream* fs = FileStream::createFile2(gameProfilePath);
Expand All @@ -292,22 +295,23 @@ void GameProfile::Save(uint64_t title_id)

#define WRITE_OPTIONAL_ENTRY(__NAME) if (m_##__NAME) fs->writeLine(fmt::format("{} = {}", #__NAME, m_##__NAME.value()).c_str());
#define WRITE_ENTRY(__NAME) fs->writeLine(fmt::format("{} = {}", #__NAME, m_##__NAME).c_str());
#define WRITE_ENTRY_NUMBERED(__NAME, __NUM) fs->writeLine(fmt::format("{} = {}", #__NAME #__NUM, m_##__NAME).c_str());

fs->writeLine("[General]");
WRITE_OPTIONAL_ENTRY(loadSharedLibraries);
WRITE_ENTRY(startWithPadView);

fs->writeLine("");


fs->writeLine("[CPU]");
WRITE_OPTIONAL_ENTRY(cpuMode);
WRITE_ENTRY(threadQuantum);

fs->writeLine("");

fs->writeLine("[Graphics]");
WRITE_ENTRY(accurateShaderMul);
WRITE_ENTRY(fastMath);
WRITE_ENTRY_NUMBERED(bufferCacheMode, 2);
WRITE_ENTRY_NUMBERED(positionInvariance, 2);
WRITE_OPTIONAL_ENTRY(precompiledShaders);
WRITE_OPTIONAL_ENTRY(graphics_api);
fs->writeLine("");
Expand All @@ -323,6 +327,7 @@ void GameProfile::Save(uint64_t title_id)

#undef WRITE_OPTIONAL_ENTRY
#undef WRITE_ENTRY
#undef WRITE_ENTRY_NUMBERED

delete fs;
}
Expand All @@ -337,6 +342,9 @@ void GameProfile::ResetOptional()

// graphic settings
m_accurateShaderMul = AccurateShaderMulOption::True;
m_fastMath = true;
m_bufferCacheMode = BufferCacheMode::Auto;
m_positionInvariance = PositionInvariance::Auto;
// cpu settings
m_threadQuantum = kThreadQuantumDefault;
m_cpuMode.reset(); // CPUModeOption::kSingleCoreRecompiler;
Expand All @@ -354,9 +362,12 @@ void GameProfile::Reset()
// general settings
m_loadSharedLibraries = true;
m_startWithPadView = false;

// graphic settings
m_accurateShaderMul = AccurateShaderMulOption::True;
m_fastMath = true;
m_bufferCacheMode = BufferCacheMode::Auto;
m_positionInvariance = PositionInvariance::Auto;
m_precompiledShaders = PrecompiledShaderOption::Auto;
// cpu settings
m_threadQuantum = kThreadQuantumDefault;
Expand All @@ -366,4 +377,4 @@ void GameProfile::Reset()
// controller settings
for (auto& profile : m_controllerProfile)
profile.reset();
}
}
6 changes: 6 additions & 0 deletions src/Cafe/GameProfile/GameProfile.h
Original file line number Diff line number Diff line change
Expand Up @@ -31,6 +31,9 @@ class GameProfile

[[nodiscard]] const std::optional<GraphicAPI>& GetGraphicsAPI() const { return m_graphics_api; }
[[nodiscard]] const AccurateShaderMulOption& GetAccurateShaderMul() const { return m_accurateShaderMul; }
[[nodiscard]] bool GetFastMath() const { return m_fastMath; }
[[nodiscard]] BufferCacheMode GetBufferCacheMode() const { return m_bufferCacheMode; }
[[nodiscard]] PositionInvariance GetPositionInvariance() const { return m_positionInvariance; }
[[nodiscard]] const std::optional<PrecompiledShaderOption>& GetPrecompiledShadersState() const { return m_precompiledShaders; }

[[nodiscard]] uint32 GetThreadQuantum() const { return m_threadQuantum; }
Expand All @@ -54,6 +57,9 @@ class GameProfile
// graphic settings
std::optional<GraphicAPI> m_graphics_api{};
AccurateShaderMulOption m_accurateShaderMul = AccurateShaderMulOption::True;
bool m_fastMath = true;
BufferCacheMode m_bufferCacheMode = BufferCacheMode::Auto;
PositionInvariance m_positionInvariance = PositionInvariance::Auto;
std::optional<PrecompiledShaderOption> m_precompiledShaders{};
// cpu settings
uint32 m_threadQuantum = kThreadQuantumDefault; // values: 20000 45000 60000 80000 100000
Expand Down
Loading