stuff/suyu - qilk git

stuff/suyu

mirror of https://git.suyu.dev/suyu/suyu.git synced 2024-11-28 06:16:23 -05:00

Author	SHA1	Message	Date
Fernando Sahmkow	f1e5314f1a	Add Swizzle Parameters to the DMA engine	2019-04-23 11:21:00 -04:00
Fernando Sahmkow	e140e2ebc6	Add Documentation Headers to all the GPU Engines	2019-04-23 08:44:52 -04:00
Fernando Sahmkow	021d28c9b8	Corrections and styling	2019-04-23 08:02:24 -04:00
bunnei	4fad91ca45	Merge pull request #2383 from ReinUsesLisp/aoffi-test gl_shader_decompiler: Disable variable AOFFI on unsupported devices	2019-04-22 22:14:02 -04:00
bunnei	9cab042674	Merge pull request #2420 from lioncash/audctl service/audctl: Implement GetTargetVolumeMin() and GetTargetVolumeMax()	2019-04-22 22:12:48 -04:00
Fernando Sahmkow	701ce1c9d0	Implement Maxwell3D Data Upload	2019-04-22 19:27:36 -04:00
Fernando Sahmkow	e4ff140b99	Introduce skeleton of the GPU Compute Engine.	2019-04-22 19:05:43 -04:00
Fernando Sahmkow	a91d3fc639	Revamp Kepler Memory to use a subegine to manage uploads	2019-04-22 18:50:56 -04:00
bunnei	b5889cbd6f	Merge pull request #2403 from FernandoS27/compressed-linear Support compressed formats on linear textures.	2019-04-22 17:09:42 -04:00
bunnei	68b707711a	Merge pull request #2411 from FernandoS27/unsafe-gpu GPU Manager: Implement ReadBlockUnsafe and WriteBlockUnsafe	2019-04-22 17:09:00 -04:00
bunnei	01100f8afd	Merge pull request #2400 from FernandoS27/corret-kepler-mem Implement Kepler Memory on both Linear and BlockLinear.	2019-04-22 16:47:05 -04:00
Fernando Sahmkow	4c36b78567	Rasterizer Cache: Use a temporal storage for Surfaces loading/flushing. This PR should heavily reduce memory usage since temporal buffers are no longer stored per Surface but instead managed by the Rasterizer Cache.	2019-04-21 11:42:07 -04:00
Fernando Sahmkow	623b2e4b8f	Corrections Half Float operations on const buffers and implement saturation.	2019-04-20 21:11:33 -04:00
bunnei	da0c3bc658	Merge pull request #2407 from FernandoS27/f2f Do some corrections in conversion shader instructions.	2019-04-20 00:42:34 -04:00
Fernando Sahmkow	788497fd9d	Allow picking a Compatibility Profile for OpenGL. This option allows picking the compatibility profile since a lot of bugs are fixed in it. We devs will use this option to easierly debug current problems in our Core implementation.:wq	2019-04-20 00:05:24 -04:00
bunnei	650d9b1044	Merge pull request #2409 from ReinUsesLisp/half-floats shader_ir/decode: Miscellaneous fixes to half-float decompilation	2019-04-19 21:31:52 -04:00
Fernando Sahmkow	08cdcc2871	Apply Position Y Direction	2019-04-19 20:49:00 -04:00
Fernando Sahmkow	a3eb91ed8c	RasterizerCache Redesign: Flush flushing is now responsability of children caches instead of the cache object. This change will allow the specific cache to pass extra parameters on flushing and will allow more flexibility.	2019-04-19 20:44:56 -04:00
Fernando Sahmkow	db4b2bc798	make ReadBlockunsafe and WriteBlockunsafe, ignore invalid pages.	2019-04-19 20:35:54 -04:00
bunnei	62c2404d3c	Merge pull request #2415 from lioncash/const kernel/wait_object: Make GetHighestPriorityReadyThread() a const member function	2019-04-19 19:12:02 -04:00
bunnei	cd38eadcc1	Merge pull request #2414 from lioncash/reorder yuzu/bootmanager: Resolve constructor initializer list warnings	2019-04-19 19:11:47 -04:00
bunnei	b6faea0dd2	Merge pull request #2421 from lioncash/svc-call kernel/svc: Name supervisor call 0x36	2019-04-19 19:10:20 -04:00
bunnei	40dc893c37	Merge pull request #2374 from lioncash/pagetable core: Reorganize boot order	2019-04-19 19:09:20 -04:00
ReinUsesLisp	d74cb16535	gl_state: Fix samplers memory corruption It was possible for "samplers" to be read without being written. This addresses that.	2019-04-19 17:07:56 -03:00
Lioncash	f8be3f55da	kernel/svc: Name supervisor call 0x36 This call was added to the SVC handlers in the 8.0.0 kernel, so we can finally give it a name.	2019-04-19 14:34:56 -04:00
Lioncash	19f8f86bdb	service/audctl: Implement GetTargetVolumeMin() and GetTargetVolumeMax() These two service functions are literally hardcoded to always return these values without any other error checking.	2019-04-18 16:39:54 -04:00
ReinUsesLisp	fbe8d1ceaa	video_core: Silent -Wswitch warnings	2019-04-18 15:54:39 -03:00
bunnei	83b830eb2f	Merge pull request #2397 from lioncash/thread-unused kernel/thread: Remove unused guest_handle member variable	2019-04-17 21:46:46 -04:00
bunnei	4294062516	Merge pull request #2318 from ReinUsesLisp/sampler-cache gl_sampler_cache: Port sampler cache to OpenGL	2019-04-17 21:45:56 -04:00
bunnei	5bd5140bde	Merge pull request #2348 from FernandoS27/guest-bindless Implement Bindless Textures on Shader Decompiler and GL backend	2019-04-17 20:59:49 -04:00
Zach Hilman	2adb226b26	web_browser: Make OpenPage non-const	2019-04-17 11:35:24 -04:00
Zach Hilman	8f8049e846	main: Add GMainWindow hooks for Error display	2019-04-17 11:35:24 -04:00
Zach Hilman	a04d36c5a4	main: Switch to AppletManager for frontend	2019-04-17 11:35:24 -04:00
Zach Hilman	76452cd5b3	qt: Add dialog implementation of Error applet	2019-04-17 11:35:24 -04:00
Zach Hilman	f6e2295055	general_backend: Move StubApplet and add backend PhotoViewer	2019-04-17 11:35:24 -04:00
Zach Hilman	80c9e4d3ab	general_frontend: Add frontend scaffold for PhotoViewer applet	2019-04-17 11:35:24 -04:00
Zach Hilman	d9f6715d45	frontend: Add frontend receiver for Error applet	2019-04-17 11:35:24 -04:00
Zach Hilman	de3cfb1d37	applets: Add Error applet Responsible for displaying error codes and messages	2019-04-17 11:35:24 -04:00
Zach Hilman	d273bec68f	applets: Port current applets to take frontend in constructor As opposed to using Core::System::GetInstance()	2019-04-17 11:35:24 -04:00
Zach Hilman	f7540157e4	web_browser: Make OpenPage const	2019-04-17 11:35:24 -04:00
Zach Hilman	ec0bc3061e	core: Remove specific applets in favor of AppletManager	2019-04-17 11:35:24 -04:00
Zach Hilman	6cea62b756	am: Delegate applet creation to AppletManager	2019-04-17 11:35:24 -04:00
Zach Hilman	e51d33f0ce	applets: Add AppletManager class to control lifetime	2019-04-17 11:35:24 -04:00
Lioncash	c268ffd831	kernel/thread: Unify wait synchronization types This is a holdover from Citra, where the 3DS has both WaitSynchronization1 and WaitSynchronizationN. The switch only has one form of wait synchronizing (literally WaitSynchonization). This allows us to throw out code that doesn't apply at all to the Switch kernel. Because of this unnecessary dichotomy within the wait synchronization utilities, we were also neglecting to properly handle waiting on multiple objects. While we're at it, we can also scrub out any lingering references to WaitSynchronization1/WaitSynchronizationN in comments, and change them to WaitSynchronization (or remove them if the mention no longer applies).	2019-04-17 09:30:56 -04:00
Lioncash	433b59c112	kernel/svc: Migrate svcCancelSynchronization behavior to a thread function The actual behavior of this function is slightly more complex than what we're currently doing within the supervisor call. To avoid dumping most of this behavior in the supervisor call itself, we can migrate this to another function.	2019-04-17 09:30:56 -04:00
Lioncash	6b2bece81f	kernel/wait_object: Make GetHighestPriorityReadyThread() a const member function This doesn't actually modify internal state of a wait object, so it can be const qualified.	2019-04-17 06:44:34 -04:00
Lioncash	54e9f9b6ed	yuzu/bootmanager: Replace unnnecessary constructor initializer list member of GGLContext The default constructor will always run, even when not specified, so this is redundant. However, the context member can indeed be initialized in the constructor initializer list.	2019-04-17 00:04:10 -04:00
Lioncash	52e43734c4	yuzu/bootmanager: Remove unnecessary includes This include isn't used anymore so it can be removed.	2019-04-16 23:52:57 -04:00
Lioncash	fbfc347351	yuzu/bootmanager: Resolve constructor initializer list warnings Resolves -Wreorder warnings. These will automatically be initialized to nullptr anyways, so these were redundant.	2019-04-16 23:49:26 -04:00
bunnei	0cfbd3325b	Merge pull request #2315 from ReinUsesLisp/severity-decompiler shader_ir/decode: Reduce the severity of common assertions	2019-04-16 22:21:19 -04:00
bunnei	21d498bc06	Merge pull request #2384 from ReinUsesLisp/gl-state-clear gl_rasterizer: Apply just the needed state on Clear	2019-04-16 22:19:03 -04:00
bunnei	be6b9e2d2b	Merge pull request #2405 from lioncash/qt CMakeLists: Define QT_USE_QSTRINGBUILDER for the Qt target	2019-04-16 22:17:09 -04:00
bunnei	1b83f255c2	Merge pull request #2092 from ReinUsesLisp/stg shader/memory: Implement STG and global memory flushing	2019-04-16 22:15:17 -04:00
bunnei	2654eb659e	Merge pull request #2376 from lioncash/const yuzu/configure_hotkey: Minor changes	2019-04-16 22:13:12 -04:00
bunnei	382fbbb198	Merge pull request #2401 from lioncash/guard common/{lz4_compression, zstd_compression}: Add missing header guards	2019-04-16 22:11:04 -04:00
Lioncash	819c21d99e	CMakeLists: Ensure we specify Unicode as the codepage on Windows Previously we were building with MBCS, which is pretty undesirable. We want the application to be Unicode-aware in general. Currently, we make the command line variant of yuzu use ANSI variants of the non-standard getopt functions that we link in for Windows, given we only have an ANSI option-set. We should really replace getopt with a library that we make all build types of yuzu link in, but this will have to do for the time being.	2019-04-16 21:23:34 -04:00
Lioncash	b6a87b422e	kernel/vm_manager: Remove usages of global system accessors Makes the dependency on the system instance explicit within VMManager's interface.	2019-04-16 20:02:50 -04:00
Fernando Sahmkow	d0082de82a	Implement IsBlockContinous This detects when a GPU Memory Block is not continous within host cpu memory.	2019-04-16 18:49:35 -04:00
Fernando Sahmkow	da91e6e4b6	Apply Const correctness to SwizzleKepler and replace u32 for size_t on iterators.	2019-04-16 12:00:46 -04:00
Fernando Sahmkow	13d626fc21	Use ReadBlockUnsafe for fetyching DMA CommandLists	2019-04-16 11:22:34 -04:00
Fernando Sahmkow	06d1c5a991	Document unsafe versions and add BlockCopyUnsafe	2019-04-16 10:11:35 -04:00
Fernando Sahmkow	6fc562a9aa	Use ReadBlockUnsafe for Shader Cache	2019-04-15 23:34:03 -04:00
Fernando Sahmkow	ef381e6924	Use ReadBlockUnsafe on TIC and TSC reading Use ReadBlockUnsafe on TIC and TSC reading as memory is never flushed from host GPU there.	2019-04-15 23:10:24 -04:00
Fernando Sahmkow	367704aa82	GPU MemoryManager: Implement ReadBlockUnsafe and WriteBlockUnsafe	2019-04-15 23:01:35 -04:00
Fernando Sahmkow	3e96c367bd	Use WriteBlock and ReadBlock.	2019-04-15 22:42:34 -04:00
bunnei	9186f76b07	Merge pull request #2382 from lioncash/table service: Update service function tables	2019-04-15 21:46:15 -04:00
bunnei	fc64156533	Merge pull request #2393 from lioncash/svc kernel/svc: Implement svcMapProcessCodeMemory/svcUnmapProcessCodeMemory	2019-04-15 21:43:56 -04:00
bunnei	a7c3275b8b	Merge pull request #2398 from lioncash/boost kernel/thread: Remove BoostPriority()	2019-04-15 21:42:16 -04:00
Fernando Sahmkow	bec28d692d	Implement Block Linear copies in Kepler Memory.	2019-04-15 21:22:16 -04:00
ReinUsesLisp	ef8245bed2	vk_shader_decompiler: Add missing operations	2019-04-15 21:32:57 -03:00
ReinUsesLisp	f43995ec53	shader_ir/decode: Fix half float pre-operations and remove MetaHalfArithmetic Operations done before the main half float operation (like HAdd) were managing a packed value instead of the unpacked one. Adding an unpacked operation allows us to drop the per-operand MetaHalfArithmetic entry, simplifying the code overall.	2019-04-15 21:16:10 -03:00
ReinUsesLisp	abcbcb1b2a	gl_shader_decompiler: Fix MrgH0 decompilation GLSL decompilation for HMergeH0 was wrong. This addresses that issue.	2019-04-15 21:16:10 -03:00
ReinUsesLisp	64613db605	shader_ir/decode: Implement half float saturation	2019-04-15 21:16:10 -03:00
ReinUsesLisp	90cbf89303	shader_ir/decode: Reduce severity of unimplemented half-float FTZ	2019-04-15 21:16:09 -03:00
ReinUsesLisp	acf618afbc	renderer_opengl: Implement half float NaN comparisons	2019-04-15 21:13:26 -03:00
ReinUsesLisp	ae46ad48ed	shader_ir: Avoid using static on heap-allocated objects Using static here might be faster at runtime, but it adds a heap allocation called before main.	2019-04-15 21:12:43 -03:00
Fernando Sahmkow	aa471274d9	Do some corrections in conversion shader instructions. Corrects encodings for I2F, F2F, I2I and F2I Implements Immediate variants of all four conversion types. Add assertions to unimplemented stuffs.	2019-04-15 19:16:27 -04:00
Lioncash	d28bb56c91	CMakeLists: Define QT_USE_QSTRINGBUILDER for the Qt target This is a compile definition introduced in Qt 4.8 for reducing the total potential number of strings created when performing string concatenation. This allows for less memory churn. This can be read about here: https://blog.qt.io/blog/2011/06/13/string-concatenation-with-qstringbuilder/ For a change that isn't source-compatible, we only had one occurrence that actually need to have its type clarified, which is pretty good, as far as transitioning goes.	2019-04-15 17:59:41 -04:00
Lioncash	3283aa1e20	svc: Specify handle value in thread's name Allows the handle to be seen alongside the entry point.	2019-04-15 15:56:18 -04:00
Fernando Sahmkow	8a099ac99f	Correct Kepler Memory on Linear Pushes.	2019-04-15 14:51:36 -04:00
Fernando Sahmkow	773d955dfa	Support compressed formats on linear textures.	2019-04-15 13:56:09 -04:00
Lioncash	4620ed47a3	common/{lz4_compression, zstd_compression}: Add missing header guards These two files were missing the #pragma once directive.	2019-04-15 13:00:08 -04:00
Fernando Sahmkow	bf561e4340	Correct Pitch in Fermi2D	2019-04-15 12:24:29 -04:00
Lioncash	e3566e6c1d	kernel/thread: Remove BoostPriority() This is a holdover from Citra that currently remains unused, so it can be removed from the Thread interface.	2019-04-15 06:59:19 -04:00
Lioncash	09caf8a756	kernel/thread: Remove unused guest_handle member variable This member variable is entirely unused. It was only set but never actually utilized. Given that, we can remove it to get rid of noise in the thread interface.	2019-04-14 06:06:06 -04:00
ReinUsesLisp	f15c59a164	gl_shader_decompiler: Use variable AOFFI on supported hardware	2019-04-14 05:13:19 -03:00
ReinUsesLisp	5c280e6ff0	shader_ir: Implement STG, keep track of global memory usage and flush	2019-04-14 00:25:32 -03:00
bunnei	1f4dfb3998	Merge pull request #2378 from lioncash/ro ldr: Minor amendments to IPC-related parameters	2019-04-13 22:16:10 -04:00
bunnei	c9454c8422	Merge pull request #2373 from FernandoS27/z32 Set Pixel Format to Z32 if its R32F and depth compare enabled, and Implement format ZF32_X24S8	2019-04-13 22:14:51 -04:00
bunnei	6088898b02	Merge pull request #2357 from zarroboogs/force-30fps-mode Add a toggle to force 30FPS mode	2019-04-13 22:14:04 -04:00
bunnei	a788c861bd	Merge pull request #2381 from lioncash/fs fsp_srv: Minor cleanup related changes	2019-04-13 22:09:58 -04:00
bunnei	ee2206a1b7	Merge pull request #2386 from ReinUsesLisp/shader-manager gl_shader_manager: Move code to source file and minor clean up	2019-04-13 22:09:27 -04:00
bunnei	065f83c6c3	Merge pull request #2017 from jroweboy/glwidget Frontend: Migrate to QOpenGLWindow and support shared contexts	2019-04-13 22:08:40 -04:00
bunnei	ee3f576495	Merge pull request #2389 from FreddyFunk/rename-gamedir ui_settings: Rename game directory variables	2019-04-13 22:06:51 -04:00
Lioncash	4d293bb5cb	kernel/svc: Implement svcUnmapProcessCodeMemory Essentially performs the inverse of svcMapProcessCodeMemory. This unmaps the aliasing region first, then restores the general traits of the aliased memory. What this entails, is: - Restoring Read/Write permissions to the VMA. - Restoring its memory state to reflect it as a general heap memory region. - Clearing the memory attributes on the region.	2019-04-12 21:56:03 -04:00
Lioncash	76a2465655	kernel/svc: Implement svcMapProcessCodeMemory This is utilized for mapping code modules into memory. Notably, the ldr service would call this in order to map objects into memory.	2019-04-12 21:55:50 -04:00
bunnei	b42595fa6b	Merge pull request #2391 from lioncash/scope common/scope_exit: Replace std::move with std::forward in ScopeExit()	2019-04-12 21:52:35 -04:00
bunnei	0faf7b17a1	Merge pull request #2392 from lioncash/swap common/swap: Minor cleanup and improvements to byte swapping functions	2019-04-12 21:52:16 -04:00
FreddyFunk	382722b9c4	Fix Clang Format	2019-04-12 16:40:35 +02:00
Lioncash	0d8ef2d3b9	common/swap: Improve codegen of the default swap fallbacks Uses arithmetic that can be identified more trivially by compilers for optimizations. e.g. Rather than shifting the halves of the value and then swapping and combining them, we can swap them in place. e.g. for the original swap32 code on x86-64, clang 8.0 would generate: mov ecx, edi rol cx, 8 shl ecx, 16 shr edi, 16 rol di, 8 movzx eax, di or eax, ecx ret while GCC 8.3 would generate the ideal: mov eax, edi bswap eax ret now both generate the same optimal output. MSVC used to generate the following with the old code: mov eax, ecx rol cx, 8 shr eax, 16 rol ax, 8 movzx ecx, cx movzx eax, ax shl ecx, 16 or eax, ecx ret 0 Now MSVC also generates a similar, but equally optimal result as clang/GCC: bswap ecx mov eax, ecx ret 0 ==== In the swap64 case, for the original code, clang 8.0 would generate: mov eax, edi bswap eax shl rax, 32 shr rdi, 32 bswap edi or rax, rdi ret (almost there, but still missing the mark) while, again, GCC 8.3 would generate the more ideal: mov rax, rdi bswap rax ret now clang also generates the optimal sequence for this fallback as well. This is a case where MSVC unfortunately falls short, despite the new code, this one still generates a doozy of an output. mov r8, rcx mov r9, rcx mov rax, 71776119061217280 mov rdx, r8 and r9, rax and edx, 65280 mov rax, rcx shr rax, 16 or r9, rax mov rax, rcx shr r9, 16 mov rcx, 280375465082880 and rax, rcx mov rcx, 1095216660480 or r9, rax mov rax, r8 and rax, rcx shr r9, 16 or r9, rax mov rcx, r8 mov rax, r8 shr r9, 8 shl rax, 16 and ecx, 16711680 or rdx, rax mov eax, -16777216 and rax, r8 shl rdx, 16 or rdx, rcx shl rdx, 16 or rax, rdx shl rax, 8 or rax, r9 ret 0 which is pretty unfortunate.	2019-04-12 00:07:39 -04:00

1 2 3 4 5 ...

9996 commits