1 | Mesa 23.1.0 Release Notes / 2023-05-10
|
---|
2 | ======================================
|
---|
3 |
|
---|
4 | Mesa 23.1.0 is a new development release. People who are concerned
|
---|
5 | with stability and reliability should stick with a previous release or
|
---|
6 | wait for Mesa 23.1.1.
|
---|
7 |
|
---|
8 | Mesa 23.1.0 implements the OpenGL 4.6 API, but the version reported by
|
---|
9 | glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
|
---|
10 | glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
|
---|
11 | Some drivers don't support all the features required in OpenGL 4.6. OpenGL
|
---|
12 | 4.6 is **only** available if requested at context creation.
|
---|
13 | Compatibility contexts may report a lower version depending on each driver.
|
---|
14 |
|
---|
15 | Mesa 23.1.0 implements the Vulkan 1.3 API, but the version reported by
|
---|
16 | the apiVersion property of the VkPhysicalDeviceProperties struct
|
---|
17 | depends on the particular driver being used.
|
---|
18 |
|
---|
19 | SHA256 checksum
|
---|
20 | ---------------
|
---|
21 |
|
---|
22 | ::
|
---|
23 |
|
---|
24 | a9dde3c76571c4806245a05bda1cceee347c3267127e9e549e4f4e225d92e992 mesa-23.1.0.tar.xz
|
---|
25 |
|
---|
26 |
|
---|
27 | New features
|
---|
28 | ------------
|
---|
29 |
|
---|
30 | - VK_EXT_pipeline_library_group_handles on RADV
|
---|
31 | - VK_EXT_image_sliced_view_of_3d on RADV/GFX10+
|
---|
32 | - VK_KHR_map_memory2 on ANV and RADV
|
---|
33 | - fullyCoveredFragmentShaderInputVariable on RADV/GFX9+
|
---|
34 | - VK_EXT_discard_rectangles version 2 on RADV
|
---|
35 | - VK_EXT_graphics_pipeline_library on RADV
|
---|
36 | - extendedDynamicState3ColorBlendEquation on RADV
|
---|
37 | - primitiveUnderestimation on RADV/GFX9+
|
---|
38 | - VK_KHR_fragment_shading_rate on RADV/GFX11
|
---|
39 | - VK_EXT_mesh_shader on RADV/GFX11
|
---|
40 | - RGP support on RADV/GFX11
|
---|
41 | - GL_NV_alpha_to_coverage_dither_control on r600/evergreen+
|
---|
42 |
|
---|
43 |
|
---|
44 | Bug fixes
|
---|
45 | ---------
|
---|
46 |
|
---|
47 | - [radeonsi] flickering debug chunk border lines in Minecraft
|
---|
48 | - radv, radeonsi: Rogue Legacy 2 alpha-to-coverage rendering issues
|
---|
49 | - [r600, TURKS] R600: Unsupported instruction: vec1 32 ssa_1 = intrinsic image_samples (ssa_0) on spec@arb_shader_texture_image_samples@[email protected] (23.1.0-rc4)
|
---|
50 | - vulkan/device_select: no way to select between GPUs of the same model due to bugs
|
---|
51 | - Intel/anv: Modifier problems running gamescope embedded
|
---|
52 | - radv: 7900 XTX hair flickering/rendering issues in VaM
|
---|
53 | - radv: cache crashing
|
---|
54 | - nouveau: Regression in arb_transform_feedback_overflow_query-basic from multithreading patches
|
---|
55 | - radeonsi: vaapi: \`width >2880 && width % 64 != 0` results in wrong width in h265 stream
|
---|
56 | - [regression] iris: unable to use driver as secondary GPU (primary AMD GPU)
|
---|
57 | - iris: steam doesn't render on dg2
|
---|
58 | - [llvm 16+] [microsoft-clc] opencl-c-base.h does not exist
|
---|
59 | - Vulkancts clipping / tesselation tests trigger gpu hang on DG2
|
---|
60 | - Swaped fields in picture in vlc and mythtv if hw accel is on
|
---|
61 | - WGL: Assert assigns dwThreadId variable
|
---|
62 | - nine regression with r600 (bisected)
|
---|
63 | - [ACO] [RADV] Flickering squares in some areas in The Last of Us Part 1 (with workaround)
|
---|
64 | - radv: Jedi Fallen Order flickering & blocky plants
|
---|
65 | - nouveau: NV50 (NVAC) broken in latest master
|
---|
66 | - rusticl failed to build with rust-bindgen 0.65.0
|
---|
67 | - Regression, Bisected: glsl: Delete the lower_tess_level pass breaks r600 tesselation
|
---|
68 | - vkcts-navi21-valve failing often with GCVM_L2_PROTECTION_FAULT_STATUS:0x00X00830
|
---|
69 | - Deep Rock Galactic GPU freeze (AMD, DX11 DXVK Proton)
|
---|
70 | - radv: Resident Evil 4 Chainsaw Demo GPU hang with Navi 24
|
---|
71 | - radv: Gotham Knights GPU hang with Navi 24
|
---|
72 | - aco: s_load_dword with negative soffset cause GPU hang
|
---|
73 | - piglit.spec.ext_image_dma_buf_import.ext_image_dma_buf_import crash shutting down
|
---|
74 | - overlay layer: unable to launch titles on steam
|
---|
75 | - radv/zink: spec@ext_texture_integer@multisample-fast-clear gl_ext_texture_integer
|
---|
76 | - VAAPI: Wrong H.264 playback on RX 6900 XT and RX 6700 XT (all Sienna?)
|
---|
77 | - radv: possibly not setting state dirty bits correctly
|
---|
78 | - RADV: VRS attachment not working in specific scenario
|
---|
79 | - rusticl: invalid SPIR-V kernel causes panic
|
---|
80 | - [RADV] The Last Of Us Part 1: artifacting in the menu (with workaround)
|
---|
81 | - AMD va-api outputs corrupt encoding
|
---|
82 | - !20673 regressed \`dEQP-VK.wsi.xlib.surface.query_formats`
|
---|
83 | - aco: missing dependency on generated header
|
---|
84 | - zink: spirv validation errors with spirv 1.6
|
---|
85 | - freedreno/a6xx: Assertion \`view->rsc_seqno == rsc->seqno' failed.
|
---|
86 | - iris regression in map stride after import with gen9 parts
|
---|
87 | - anv: zink ADL failures
|
---|
88 | - Vulkan loader \`vk_common_GetPhysicalDeviceFormatProperties` fails to sanitize properties bits.
|
---|
89 | - Loading a model in PrusaSlicer 2.6.0-alpha5 crashes GNOME on radeonsi
|
---|
90 | - [glx][bisected][regression]Intel HD 3000 failing to create context on applications like Unity
|
---|
91 | - v3d: dEQP-GLES3.functional.texture.specification.teximage2d_pbo.rg32f_cube and similar fails when run together with other tests
|
---|
92 | - standalone glsl compiler not getting built with mingw
|
---|
93 | - !22191 broke test-docs-mr
|
---|
94 | - mesa: index buffer leaking
|
---|
95 | - RadeonSI: null dereference in amdgpu_cs_add_buffer, potential refcount mismatch, running BeyondAllReason
|
---|
96 | - NIR can't unroll any loop from nine
|
---|
97 | - Steel Division 2 - radv/gpu hang - bisected
|
---|
98 | - Turnip: VKD3D can't run due to lack of memory property flag
|
---|
99 | - eglCreateImageKHR, error: EGL_BAD_ALLOC (0x3003), message: "createImageFromDmaBufs failed" on AMD multi-gpu with explicit format modifiers
|
---|
100 | - radv: In the game Quake II RTX appeared artifacts at fresh mesa builds
|
---|
101 | - radv: Vampire: The Masquerade - Bloodline (Unofficial Patch) regression
|
---|
102 | - radeonsi broken for gcn1 card
|
---|
103 | - libgrl.a installed but not used?
|
---|
104 | - radv: crash compiling UE5 lumen hardware RT shader
|
---|
105 | - spec@ext_transform_feedback@builtin-varyings gl_culldistance fail
|
---|
106 | - Panfrost T860 - broken system with latest mesa on gnome wayland jammy
|
---|
107 | - aco: unused vtmp_in_loop
|
---|
108 | - FTBFS: src/amd/llvm/ac_llvm_util.c:248:4: error: implicit declaration of function ‘LLVMAddIPSCCPPass’ (LLVM C interface removed upstream)
|
---|
109 | - vulkan: new generated physical_device_feature missing meson dependency
|
---|
110 | - Build broken on old-ish Python versions
|
---|
111 | - radv: Support fullyCoveredFragmentShaderInputVariable from VK_EXT_conservative_rasterization on RDNA2+
|
---|
112 | - radv,nir: dEQP-VK.ray_query.builtin.rayqueryterminate.* failures
|
---|
113 | - RFE: Use _mesa_is_foo(ctx) helpers more
|
---|
114 | - spec@ext_transform_feedback@builtin-varyings gl_culldistance fail
|
---|
115 | - ci: infinite XDG_RUNTIME_DIR spam
|
---|
116 | - ci: XDG_RUNTIME_DIR spam
|
---|
117 | - [KBL] iris failures with dEQP-GLES3.functional.texture.compressed.astc.void_extent*
|
---|
118 | - glsl compiled error when the RHS of operator \`>>` is int64_t by enabling GL_ARB_gpu_shader_int64 extension
|
---|
119 | - turnip: inline uniforms regression
|
---|
120 | - QPainter fails to render multiple shapes with a brush set since Mesa 23.0
|
---|
121 | - eglSwapBuffers blocks in wayland when it's wl_surface_frame event is stolen.
|
---|
122 | - plasmashell sometimes hangs with mesa_glthread
|
---|
123 | - pps_device.h:23:11: error: ‘uint32_t’ does not name a type
|
---|
124 | - Build fails with llvm 17: llvm/ADT/Triple.h: No such file or directory
|
---|
125 | - nir: i2f32(i2i32(x@8)) isn't being collapsed to i2f32(x)
|
---|
126 | - zink-lvp no longer running tests
|
---|
127 | - radv: Immortals Fenyx Rising: Grass Flicker on R9 380X and Steam Deck
|
---|
128 | - radv: A Plague Tale: Requiem black "flash" on 7900XTX
|
---|
129 | - 7900 XTX: Graphical corruption / artifacts in Cyberpunk
|
---|
130 | - radeonsi draws spurious values to depth buffer
|
---|
131 | - Commit ccaaf8fe04c956d9f16f98b7f7fa69a2526283bc causes GPU ring timeouts on BONAIRE
|
---|
132 | - radv: CmdCopyQueryPoolResults broken for VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT with queryCount > 1
|
---|
133 | - rusticl over llvmpipe + ffmpeg's Opencl filter = error -51
|
---|
134 | - rusticl over llvmpipe + ffmpeg's Opencl filter = error -51
|
---|
135 | - ci: Remove LAVA Gitlab section handler workaround
|
---|
136 | - System freeze when playing some h264 videos with VA-API on Rembrandt
|
---|
137 | - OpenGL crashes in X-Plane 11
|
---|
138 | - r600/TURKS: NIR Shader related errors on CLI with the game "A Hat in Time" and Gallium Nine
|
---|
139 | - agx: shifts aren't sound
|
---|
140 | - ci: build logs hidden
|
---|
141 | - fatal error: intel/dev/intel_wa.h: No such file or directory
|
---|
142 | - [Bisected] Regression: Project Zomboid renders black
|
---|
143 | - hasvk: Black pixels with 8xMSAA and fast clears on Intel(R) HD Graphics 4400 (HSW GT2)
|
---|
144 | - radv: GTA IV graphical artifacts on 7900XTX
|
---|
145 | - radv: Resident Evil Revelations 2 artifacts on 7900XTX with DCC
|
---|
146 | - radv: Prototype 2 black textures on RDNA 3 when DCC is enabled
|
---|
147 | - Mesa 23.0.0 crashes immediately with indirect rendering
|
---|
148 | - virpipe-on-gl: arb_enhanced_layouts\@matching_fp64_types crashes
|
---|
149 | - [RADV] Returnal - pistol muzzle flash fills whole screen (graphical artifact)
|
---|
150 | - ACO: dEQP-VK.binding_model.descriptor_buffer.multiple.graphics_geom_buffers1_sets3_imm_samplers hangs on NAVI10
|
---|
151 | - Build failures with recent lld
|
---|
152 | - r600,regression: Glitches on terrain with the NIR backend on Transport Fever 2
|
---|
153 | - [radeonsi] Regression with MSAA fix for Unreal / Unreal Tournament 99
|
---|
154 | - spirv: Switch Vulkan drivers to use \`deref_buffer_array_length`
|
---|
155 | - r600/TURKS: Crash of the game "A Hat in Time" with Gallium Nine and NIR path (third report)
|
---|
156 | - [gen9atom] Vulkan tests cause gpu hang: dEQP-VK.memory_model.*
|
---|
157 | - GL_SHADER_BINARY_FORMAT_SPIR_V is not added to the list of GL_SHADER_BINARY_FORMATS even if GL_ARB_gl_spirv is supported.
|
---|
158 | - mesa: "Fragmented" dynamic lights in IronWail with \`r_fsaamode 1` on
|
---|
159 | - [ANV/DG2] Vertex explosion in nvpro-samples/vk_raytracing_tutorial_KHR/ray_tracing_gltf
|
---|
160 | - CUEtools FLACCL hit assert in rusticl
|
---|
161 | - Assertion Failed on Intel HD 5500 with Linux / Mesa 22.3.1 / OpenGL
|
---|
162 | - Rise of the Tomb Raider's Ambient Occlusion pass misrenders (swimming shadows)
|
---|
163 | - vk_enum_to_str: missing VkPipelineCreateFlags
|
---|
164 | - [glsl] [spirv] ssbo unsizied array not supported ?
|
---|
165 | - Creating a vulkan physical device on an AMD GPU causes following calls to drmModeAddFB to fail with ENOENT
|
---|
166 | - Minecraft: spec related compile errors
|
---|
167 | - mesa: _mesa_glthread_upload crash
|
---|
168 | - glthread: OpenGL submission blocks while swapping buffers
|
---|
169 | - glthread: Loading a shader cache in yuzu slows down with mesa_glthread=true
|
---|
170 | - Commit ""radeonsi: enable glthread by default"" (d6fabe49cd72fb) causes a regression in gstreamer gtkglsink element
|
---|
171 | - llvmpipe: linear rasterizer / depth bug
|
---|
172 | - radv: (Using mesh shader) NIR validation failed after nir_lower_io_to_scalar_early
|
---|
173 | - panfrost Mali-G31 glamor regression
|
---|
174 | - allwinner a64: DRM_IOCTL_MODE_CREATE_DUMB failed: Cannot allocate memory after some time of apps usage
|
---|
175 | - turnip: dEQP-VK.ubo.random.all_shared_buffer.48 slow
|
---|
176 | - wine + dxvk + Rise of the Tomb Raider crashes in Soviet Installation 20% with VK_ERROR_DEVICE_LOST
|
---|
177 | - Sometimes VLC player process gets stuck in memory after closure if video output used is Auto or OpenGL
|
---|
178 | - ci: Remove LAVA Gitlab section handler workaround
|
---|
179 | - kwin_wayland crashes involving dri2_create_drawable when Plasma starts and the llvmpipe driver from Mesa 23.0-rc3 and 23.0-rc4 is used
|
---|
180 | - turnip: no ubwc fast clear for depth on a618
|
---|
181 | - anv: VK_ACCESS_2_SHADER_READ_BIT doesn't seem to be handled correctly
|
---|
182 | - Vulkan WSI flags leak into NIR, breaking build on BSDs
|
---|
183 | - Iris corruptions in zoom calls
|
---|
184 | - Sampling with aux enabled with ISL_AUX_STATE_PASS_THROUGH seems broken on Tigerlake+
|
---|
185 | - anv: incorrect task shader payload
|
---|
186 | - radv: Hi-Fi Rush incorrectly rendering face shadows with DCC on 7900 XTX
|
---|
187 | - [iris] isl_calc_min_row_pitch seems incorrect on a750
|
---|
188 | - DG2: incorrect rendering in Sascha Willems raytracing callable demo
|
---|
189 | - turnip: conditional load/store hurts some workloads
|
---|
190 | - Some blackouts / rendering issues with RADV_PERFTEST=gpl in Battlefield 1 (DX11)
|
---|
191 | - radv/zink: ACO assert with DOOM2016
|
---|
192 | - Registered special XGE not unregistered
|
---|
193 | - draw_llvm.c:788:7: error: implicit declaration of function ‘LLVMContextSetOpaquePointers’
|
---|
194 | - asahi: Optimize lower_resinfo for cube maps
|
---|
195 | - Metro Exodus hits nir validation with a driver supporting raytracing.
|
---|
196 | - ANV Gen 9.5 swapchain corruption when using newer \`VK_IMAGE_LAYOUT_ATTACHMENT_OPTIMAL` layout
|
---|
197 | - turnip: optimal bin layout
|
---|
198 | - piglit.spec.arb_shader_texture_image_samples.compiler.fs-image-samples_frag regression
|
---|
199 | - lavapipe assert fails on Windows
|
---|
200 | - zink: itoral-gl-terrain-demo rendering failure
|
---|
201 | - asahi: Implement a shader disk cache
|
---|
202 | - [ICL] Trine 4 trace causing GPU HANG
|
---|
203 | - radv: Segfault during createAccelerationStructure when the backing buffer is not bound to memory
|
---|
204 | - 7900 XTX stuck at 'compiling shaders' in Monster Hunter Rise
|
---|
205 | - radv: slow GPL (fast) link times
|
---|
206 | - libEGL warning: failed to get driver name for fd -1
|
---|
207 | - iris: Context incorrectly marked as guilty
|
---|
208 | - simple_mtx.h:34:12: fatal error: valgrind.h: No such file or directory
|
---|
209 | - [ANV] Commit 4ceaed78 causes misrendering on Cyberpunk 2077
|
---|
210 | - adding eglGetMscRateANGLE support for multiple monitors with different refresh rates
|
---|
211 | - Performance regression in Chromium WebGL when implement ANGLE_sync_control_rate with egl/x11
|
---|
212 | - ci: Ensure that the Intel/Freedreno trace pipelines only show up in relevant MR's
|
---|
213 | - anv: Performance issue with Vulkan on Wayland KWin
|
---|
214 | - Incorrect format conversion on big endian
|
---|
215 | - radv: State of Decay 2 character rendering regression
|
---|
216 | - aco_tests assembler.gfx11.vop12c_v128/gfx11 failure
|
---|
217 | - r600,regression: Loading of DOOM stuck at 0% with the NIR backend
|
---|
218 | - RADV: enabling TC-compat HTILE in GENERAL for compute queues is likely broken
|
---|
219 | - Confidential issue #8065
|
---|
220 | - VAAPI HEVC encode broken since 22.3
|
---|
221 | - GPU HANG: ecode 12:1:859ffffb (Resetting rcs0 for stopped heartbeat on rcs0) - reproducible
|
---|
222 | - zink: src/gallium/auxiliary/pipebuffer/pb_slab.c:138: Assertion failed: \`heap < slabs->num_heaps`
|
---|
223 | - [zink] Assertion \`heap < slabs->num_heaps' failed on Pascal (bisected)
|
---|
224 | - [RADV] Incorrect copies to/from compressed textures with mipmaps
|
---|
225 | - mesa_glthread=true and probably ANY id Tech 3 engine games, offroad...
|
---|
226 | - radeonsi: VRAM Leak/abnormally high usage in Minecraft mod pack
|
---|
227 | - nir/lower_blend: Bogus assert
|
---|
228 | - anv-tgl-vk: fails a multiple jobs after changing sharding
|
---|
229 | - radv CTS crashes since ebec42d799b22b7b3d06acd710f5687252446a06
|
---|
230 | - llvmpipe: dEQP-EGL programs.link failures.
|
---|
231 | - libmesa_util depends on gallium
|
---|
232 | - EGL report EGL_EXT_create_context_robustness with kms_dri drvier while can't create context with EGL_LOSE_CONTEXT_ON_RESET_EXT attribute.
|
---|
233 | - v3d: missing drm format modifier support on Raspberry Pi 4 required for mpv
|
---|
234 | - Return To Monkey Island black screen
|
---|
235 | - Return To Monkey Island black screen
|
---|
236 | - navi22 amdgpu: bo 000000002843d677 va 0x0800000400-0x08000005ff conflict with 0x0800000400-0x0800000600
|
---|
237 | - Ryzen 6800H laptop amdgpu: bo 00000000b1eb583a va 0x0800000200-0x08000003ff conflict with 0x0800000200-0x0800000400
|
---|
238 | - [RADV] [MISSED PERFORMANCE POTENTIAL] Vulkan not working when Color Depth is set to "16", but Vulkan works when Color Depth is set to "24"
|
---|
239 | - v3dv: f2f16_rtz lowering could be improved
|
---|
240 | - debug build compilation failed: inlining failed in call to ‘always_inline’ ‘src_is_ssa’: indirect function call with a yet undetermined callee
|
---|
241 | - radv: regression: broken UI rendering in Elden Ring
|
---|
242 | - radv: Missing implementation of VkImageSwapchainCreateInfoKHR and VkBindImageMemorySwapchainInfoKHR
|
---|
243 |
|
---|
244 |
|
---|
245 | Changes
|
---|
246 | -------
|
---|
247 |
|
---|
248 | Adam Jackson (22):
|
---|
249 |
|
---|
250 | - glx/dri3: Simplify protocol version tracking
|
---|
251 | - glx: Remove glx_context::screen
|
---|
252 | - glx: Remove a can't-happen NULL check
|
---|
253 | - glx: Remove support for glXGetDriverConfig for old drivers
|
---|
254 | - glx: Clean up some funny business from context bind/unbind
|
---|
255 | - glx: Reflow MakeContextCurrent a little
|
---|
256 | - glx: Check for initial "glX" first in glXGetProcAddress
|
---|
257 | - glx: Move 1.2 GLXPixmap code into glx_pbuffer.c
|
---|
258 | - glx: Inline a few single-use constant strings into their user
|
---|
259 | - glx: Fix drawable type inference in visual/fbconfig setup
|
---|
260 | - glx: Harmonize glXCreateGLXPixmap with glXCreatePixmap
|
---|
261 | - mesa: Fix extension table formatting
|
---|
262 | - mesa: Trivially advertise NV_generate_mipmap_sRGB
|
---|
263 | - wsi/x11: Make get_sorted_vk_formats handle varying channel widths
|
---|
264 | - wsi/x11: Infer the default surface format from the root window's visual
|
---|
265 | - wsi/x11: Support depth 16 visuals
|
---|
266 | - glx/dri: Use X/GLX error codes for our create_context_attribs
|
---|
267 | - dri: Validate more of the context version in validate_context_version
|
---|
268 | - glx/dri: Fix error generation for invalid GLX_RENDER_TYPE
|
---|
269 | - glx: Disable the indirect fallback in CreateContextAttribs
|
---|
270 | - glx: Fix error handling yet again in CreateContextAttribs
|
---|
271 | - mesa: Enable NV_texture_barrier in GLES2+
|
---|
272 |
|
---|
273 | Adam Stylinski (2):
|
---|
274 |
|
---|
275 | - glx: fix a macro being invoked with the wrong parameter name
|
---|
276 | - mesa: fix out of bounds stack access on big endian
|
---|
277 |
|
---|
278 | Alan Coopersmith (1):
|
---|
279 |
|
---|
280 | - util/disk_cache: Handle OS'es without d_type in struct dirent
|
---|
281 |
|
---|
282 | Alejandro Piñeiro (17):
|
---|
283 |
|
---|
284 | - vulkan/wsi: check if image info was already freed
|
---|
285 | - v3dv/format: remove unused v3dv_get_tex_return_size
|
---|
286 | - v3dv/pipeline: rename lower_tex_src_to_offset to lower_tex_src
|
---|
287 | - v3dv: pass alignment to v3dv_buffer_init
|
---|
288 | - v3dv/image: use 64-byte alingment for linear images if needed
|
---|
289 | - v3dv: skip two ycbcr tests
|
---|
290 | - broadcom/compiler: v3d_nir_lower_txf_ms doesn't need v3d_compile
|
---|
291 | - broadcom/compiler: treat PIPE_FORMAT_NONE as 32-bit formats for output type
|
---|
292 | - v3dv: enable shaderStorageImageReadWithoutFormat
|
---|
293 | - broadcom/compiler: fix indentation at v3d_nir_lower_image_load_store
|
---|
294 | - nir: track if var copies lowering was called
|
---|
295 | - radv: use shader_info->var_copies_lowered
|
---|
296 | - anv: use shader_info->var_copies_lowered
|
---|
297 | - v3d/v3dv: use shader_info->var_copies_lowered
|
---|
298 | - v3dv: handle ASPECT_MEMORY_PLANE aspect flags when getting plane number
|
---|
299 | - v3dv/debug: add debug option to disable TFU codepaths
|
---|
300 | - v3dv/pipeline: use pipeline depth bias enabled to fill up CFG packet
|
---|
301 |
|
---|
302 | Alexandros Frantzis (2):
|
---|
303 |
|
---|
304 | - egl/wayland: Fix destruction of event queue with proxies still attached.
|
---|
305 | - vulkan/wsi/wayland: Fix destruction of event queue with proxies still attached.
|
---|
306 |
|
---|
307 | Alyssa Rosenzweig (351):
|
---|
308 |
|
---|
309 | - nir/peephole_select: Allow load_preamble
|
---|
310 | - agx: Peephole select after opt_preamble
|
---|
311 | - asahi: Handle sampler->compare_mode
|
---|
312 | - panfrost: Don't use AFBC of sRGB luminance-alpha
|
---|
313 | - pan/bi: Fix incorrect compilation of fsat(reg.yx)
|
---|
314 | - pan/bi: Add a unit test for fsat(reg.yx)
|
---|
315 | - panfrost: Enable NV_primitive_restart on Valhall
|
---|
316 | - panfrost: Fix logic ops on Bifrost
|
---|
317 | - panfrost: Stop testing CAP_INT16
|
---|
318 | - panfrost: Remove PAN_MESA_DEBUG=deqp
|
---|
319 | - panfrost: Remove unused debug parameter
|
---|
320 | - panfrost: Fix clears with conditional rendering
|
---|
321 | - panfrost: Document render_condition_check contract
|
---|
322 | - nir: Add Midgard-specific fsin/fcos ops
|
---|
323 | - nir: Optimize vendored sin/cos the same way
|
---|
324 | - pan/mdg: Use special NIR ops for trig scaling
|
---|
325 | - pan/mdg: Scalarize LUT instructions in NIR
|
---|
326 | - pan/mdg: Remove MSGS debug
|
---|
327 | - mesa: Set info.separate_shader for ARB programs
|
---|
328 | - nir/lower_blend: Fix alpha=1 for RGBX format
|
---|
329 | - nir/lower_blend: Clamp blend factors
|
---|
330 | - nir/lower_blend: Fix SNORM logic ops
|
---|
331 | - nir/lower_blend: Avoid useless iand with logic ops
|
---|
332 | - nir/lower_blend: Don't do logic ops on pure float
|
---|
333 | - nir/lower_blend: Handle undefs in stores
|
---|
334 | - nir/lower_blend: No-op nir_color_mask if no mask
|
---|
335 | - asahi: Omit extra call to clock_gettime
|
---|
336 | - nir/opt_preamble: Treat \*size as an input
|
---|
337 | - nir/opt_preamble: Consider load_preamble as movable
|
---|
338 | - agx: Lower system values in NIR in the driver
|
---|
339 | - agx: Bump preamble_storage_size to 512
|
---|
340 | - agx: Centralize texture lowering
|
---|
341 | - asahi: Use non-UAPI specific BO create flags
|
---|
342 | - nir: Add a late texcoord replacement pass
|
---|
343 | - asahi: Run nir_lower_fragcolor during preprocessing
|
---|
344 | - asahi: Lower texcoords late
|
---|
345 | - panfrost: Implement GL_EXT_render_snorm on Bifrost+
|
---|
346 | - ail: Add layout->mipmapped_z input
|
---|
347 | - ail: Test mipmapped_z behaviour
|
---|
348 | - ail: Test 63x63 cube map
|
---|
349 | - asahi: Set layout->mipmapped_z for 3D textures
|
---|
350 | - asahi: Fix encoding of uniform size
|
---|
351 | - asahi: Strengthen agx_usc_uniform contract
|
---|
352 | - asahi/nir_lower_sysvals: Split large ranges
|
---|
353 | - asahi: Correct alignment for USC Uniform packets
|
---|
354 | - agx: Support uniform registers as LODs
|
---|
355 | - asahi: Use writeback when it looks beneficial
|
---|
356 | - asahi: Make STAGING resources linear
|
---|
357 | - asahi: Prefer blit-based texture transfer
|
---|
358 | - asahi: Implement nontrivial rasterizer discard
|
---|
359 | - asahi: DRY dirty tracking conditions
|
---|
360 | - asahi: Remove redundant tri merge disable bit
|
---|
361 | - asahi: Merge fragment control XML
|
---|
362 | - agx: Keep varyings forwarded to texture as fp32
|
---|
363 | - asahi: Don't use 16-bit inputs to 32-bit st_tile
|
---|
364 | - docs/asahi: Document clip distance varyings
|
---|
365 | - agx: Fix storing to varying arrays
|
---|
366 | - agx: Handle constant-offset in address matching
|
---|
367 | - asahi: Add XML for custom border colours
|
---|
368 | - agx/decode: Add a data parameter to stateful
|
---|
369 | - agx/decode: Handle extended samplers
|
---|
370 | - asahi: Implement custom border colours
|
---|
371 | - asahi: Fix delete_vs_state implementation
|
---|
372 | - asahi: Add compute kernel scaffolding
|
---|
373 | - asahi: Don't leak shader NIR
|
---|
374 | - asahi: Add hooks for SSBO and images
|
---|
375 | - asahi: Fake more caps for dEQP-GLES31
|
---|
376 | - asahi: Advertise seamless cube maps
|
---|
377 | - asahi: Stub out MSAA for dEQP
|
---|
378 | - asahi: Bump PIPE_CAP_MAX_TEXTURE_ARRAY_LAYERS
|
---|
379 | - asahi: Add compute batches
|
---|
380 | - asahi: Implement load_ssbo_address/get_ssbo_size
|
---|
381 | - asahi: Identify more compute-related XML
|
---|
382 | - agx: Implement compute ID intrinsics
|
---|
383 | - agx: Implement barriers
|
---|
384 | - nir/print: Extract get_location_str
|
---|
385 | - nir/print: Pretty-print I/O semantic locations
|
---|
386 | - nir/print: Pretty-print color0/1_interp
|
---|
387 | - agx: Allow uniform sources on phis
|
---|
388 | - agx: Run DCE twice
|
---|
389 | - agx: Lower uniform sources with a dedicated pass
|
---|
390 | - agx: Don't scalarize preambles in NIR
|
---|
391 | - nir/lower_clip: Only emit 1 discard
|
---|
392 | - tu,vulkan: Add common Get*OpaqueCaptureDescriptorDataEXT
|
---|
393 | - radv: Use common Get*OpaqueCaptureDescriptorDataEXT
|
---|
394 | - agx: Remove unused AGX_MAX_VARYINGS
|
---|
395 | - agx: Respect component in frag load_input
|
---|
396 | - agx: Fix AGX_MAX_CF_BINDINGS
|
---|
397 | - agx: Remove bogus gl_Position assertion
|
---|
398 | - agx: Implement load_helper_invocation
|
---|
399 | - agx: Write sample mask even with no colour output
|
---|
400 | - asahi: Submit batches that don't touch RTs
|
---|
401 | - asahi: Add XML for indirect dispatch
|
---|
402 | - asahi: Add XML for indirect draws
|
---|
403 | - asahi: Add XML for VDM memory barriers
|
---|
404 | - panvk: Take lock when tracing
|
---|
405 | - panvk: Fix varying linking
|
---|
406 | - panvk: Disable SNORM rendering
|
---|
407 | - asahi: Remove default=true on index list values
|
---|
408 | - asahi: Refactor index buffer upload for indirect
|
---|
409 | - asahi: Implement indirect draws
|
---|
410 | - panfrost: Fix some fields in v10.xml
|
---|
411 | - pan/decode: Add support for decoding CSF
|
---|
412 | - asahi: Vectorize background colour load
|
---|
413 | - panfrost: Disable CRC by default
|
---|
414 | - panfrost: Fix prim restart XML on Valhall
|
---|
415 | - nir: Augment raw_output_pan with IO_SEMANTICS+BASE
|
---|
416 | - pan/lower_framebuffer: Operate on lowered I/O
|
---|
417 | - nir/lower_blend: Don't touch store->dest
|
---|
418 | - nir/lower_blend: Don't handle gl_FragColor
|
---|
419 | - nir/lower_blend,agx,panfrost: Use lowered I/O
|
---|
420 | - asahi: Lower clip distances late
|
---|
421 | - asahi: Move agx_preprocess_nir to CSO create
|
---|
422 | - agx: Don't treat clip distances specially
|
---|
423 | - agx: Do more work in agx_preprocess_nir
|
---|
424 | - asahi: Fix rendering into mipmapped framebuffers
|
---|
425 | - agx: Lower offsets in NIR
|
---|
426 | - agx: Model and pack gathers
|
---|
427 | - agx: Implement gathers (nir_texop_tg4)
|
---|
428 | - docs/features: Sync Asahi with reality
|
---|
429 | - asahi: Advertise ARB_derivative_control
|
---|
430 | - asahi: Advertise ARB_texture_barrier
|
---|
431 | - agx: Model atomic instructions
|
---|
432 | - agx: Model local loads/stores
|
---|
433 | - agx: Disallow immediate bases to device_load
|
---|
434 | - agx: Pack global atomics
|
---|
435 | - agx: Pack local load/store instructions
|
---|
436 | - agx: Translate NIR atomics
|
---|
437 | - agx: Translate load/store_shared
|
---|
438 | - agx: Lower shared memory offsets to 16-bit
|
---|
439 | - agx: Pack local atomics
|
---|
440 | - agx: Implement b2b32
|
---|
441 | - agx: Handle group_memory_barrier
|
---|
442 | - agx: Add and use agx_nir_ssa_index helper
|
---|
443 | - agx: Handle ssa_undef as zero
|
---|
444 | - agx: Add agx_internal_format_supports_mask helper
|
---|
445 | - asahi: Implement color masks with masked stores
|
---|
446 | - asahi: Make shader-db work again
|
---|
447 | - panfrost: Use proper locations in blend shaders
|
---|
448 | - nir/lower_blend: Consume dual stores
|
---|
449 | - nir: Add nir_texop_lod_bias_agx
|
---|
450 | - asahi: Lower lod_bias_agx to uniform registers
|
---|
451 | - agx: Lower sampler LOD bias
|
---|
452 | - nir/lower_blend: Don't dereference null
|
---|
453 | - docs/feature: Mark ARB_sync as done on Asahi
|
---|
454 | - asahi/decode: Handle VDM barriers
|
---|
455 | - nir: Add nir_lower_helper_writes pass
|
---|
456 | - pan/mdg: Use nir_lower_helper_writes
|
---|
457 | - asahi: Advertise dual-source blending
|
---|
458 | - agx: Mask shifts in the backend
|
---|
459 | - agx: Fix 2D MSAA array texture register allocation
|
---|
460 | - asahi: Mark PIPE_FORMAT_NONE "supported"
|
---|
461 | - agx: Don't write sample mask from preambles
|
---|
462 | - agx: Add AGX_MESA_DEBUG=nopreamble option
|
---|
463 | - agx: Clean up after lowering address arithmetic
|
---|
464 | - agx: Factor out allows_16bit_immediate check
|
---|
465 | - agx: Inline 16-bit load/store offsets
|
---|
466 | - agx: Constify agx_print
|
---|
467 | - agx: Refactor vector creation
|
---|
468 | - agx: Use agx_emit_collect for st_tile
|
---|
469 | - agx: Don't print pre-optimization shader
|
---|
470 | - agx: Only lower int64 late
|
---|
471 | - asahi: Bump shader buffers
|
---|
472 | - asahi/meta: Use lowered I/O
|
---|
473 | - agx: Disable tri merging with side effects
|
---|
474 | - agx: Handle fragment shader side effects
|
---|
475 | - asahi: Rework system value lowering
|
---|
476 | - asahi: Wire up compute kernels
|
---|
477 | - nir/lower_tex: Add lower_index_to_offset
|
---|
478 | - pan/bi: Use lower_index_to_offset
|
---|
479 | - ir3: Use lower_index_to_offset
|
---|
480 | - nir/opt_barrier: Generalize to control barriers
|
---|
481 | - glsl/nir: Use scoped_barrier for control barrier
|
---|
482 | - pan/bi: Drop control_barrier handling
|
---|
483 | - pan/mdg: Drop control_barrier handling
|
---|
484 | - ir3: Drop non-scoped barrier handling
|
---|
485 | - gallivm: Drop non-scoped barrier handling
|
---|
486 | - agx/lower_address: Break on match
|
---|
487 | - agx/lower_address: Optimize "shift + constant"
|
---|
488 | - agx/lower_address: Handle large shifts
|
---|
489 | - agx/lower_address: Handle 8-bit load/store
|
---|
490 | - agx/lower_address: Fix handling of 64-bit immediates
|
---|
491 | - agx/lower_address: Handle 16-bit offsets
|
---|
492 | - agx: Assert that memory index is 32-bit reg
|
---|
493 | - agx: Fix clang-formatting
|
---|
494 | - agx: Pack indirect texture/sampler handles
|
---|
495 | - agx: Handle indirect texture/samplers
|
---|
496 | - asahi: Don't allow linear depth/stencil buffers
|
---|
497 | - asahi, agx: Implement dummy samplers
|
---|
498 | - asahi,agx: Implement buffer textures with gnarly NIR
|
---|
499 | - panfrost: Remove some unused definitions
|
---|
500 | - docs/panfrost: Move description of instancing
|
---|
501 | - panfrost: Don't use DECODE_FIXED16 for sample position
|
---|
502 | - panfrost: Handle fixed-point packing in GenXML
|
---|
503 | - panfrost: Add XML for framebuffer pointers
|
---|
504 | - panfrost: Use framebuffer pointer XML
|
---|
505 | - panfrost: Remove FBD tag enum from XML
|
---|
506 | - panfrost: Inline the last MALI_POSITIVE use
|
---|
507 | - panfrost: Remove MALI_POSITIVE macro
|
---|
508 | - pan/mdg: Remove reference to removed macro
|
---|
509 | - agx: Don't set lower_pack_split
|
---|
510 | - agx: Make partial DCE optional
|
---|
511 | - agx: Fix subdivision coalescing
|
---|
512 | - agx: Implement extract_[ui]16
|
---|
513 | - agx: Use nir_lower_mem_access_bit_sizes
|
---|
514 | - agx: Switch to scoped_barrier
|
---|
515 | - nir/lower_point_size: Use shader_instructions_pass
|
---|
516 | - ail: Restructure generated tests
|
---|
517 | - agx: Lower discard late
|
---|
518 | - util/prim_convert: Don't set index_bounds_valid
|
---|
519 | - pan/bi: Ignore signedness in vertex fetch
|
---|
520 | - panfrost: Identify "Base vertex offset" signedness
|
---|
521 | - panfrost: Assert that we don't see unsupported vertex formats
|
---|
522 | - panfrost: Defeature 24-bit textures
|
---|
523 | - panfrost: Handle null textures robustly
|
---|
524 | - panfrost/ci: Skip draw_buffers_indexed.random.* on Midgard
|
---|
525 | - panfrost/ci: Identify some Piglit flakes
|
---|
526 | - panfrost/ci: Add some Piglit skips
|
---|
527 | - panfrost/ci: Remove fbo-mrt-new-bind fail+flake
|
---|
528 | - panfrost: Note glDrawRangeElements underflow
|
---|
529 | - asahi: Fix occlusion query lifetime
|
---|
530 | - panfrost: Don't round up Midgard polygon list BOs
|
---|
531 | - panvk: Use vk_get_physical_device_features
|
---|
532 | - asahi: Use a dynarray for writers
|
---|
533 | - ci: Add clang-format to the amd64 container
|
---|
534 | - ci: Enforce clang-format for asahi
|
---|
535 | - gallium: Fix u_stream_outputs_for_vertices with QUADS
|
---|
536 | - nir/builder: Add nir_umod_imm helper
|
---|
537 | - blorp,anv,hasvk: Use umod_imm
|
---|
538 | - v3d,v3dv: Use udiv_imm/umod_imm
|
---|
539 | - radv: Use umod_imm
|
---|
540 | - ir3: Use umod_imm
|
---|
541 | - nir: Add Panfrost intrinsics to lower sample mask
|
---|
542 | - nir: Add Mali load_output taking converison
|
---|
543 | - panfrost: Use 0/~0 boolean for MSAA sysval
|
---|
544 | - pan/bi: Don't duplicate texture op cases
|
---|
545 | - pan/bi: Lower sample mask writes in NIR
|
---|
546 | - pan/bi: Lower load_output to make sysval explicit
|
---|
547 | - pan/bi: Allow specializing bifrost_nir_options by arch
|
---|
548 | - pan/bi: Lower gl_VertexID in NIR
|
---|
549 | - pan/bi: Remove bi_load_sysval
|
---|
550 | - pan/mdg: Use I/O semantics for MRT blend stores
|
---|
551 | - panfrost: Remove inputs->blend.rt
|
---|
552 | - panfrost: Remove unused inputs.nr_cbufs
|
---|
553 | - pan/bi: Only lower once
|
---|
554 | - pan/mdg: Only lower once
|
---|
555 | - pan/bi: Split out early preprocessing from late
|
---|
556 | - pan/mdg: Split out early preprocessing from late
|
---|
557 | - pan/lower_framebuffer: Only call for FS
|
---|
558 | - pan/lower_framebuffer: Use nir_shader_instructions_pass
|
---|
559 | - pan/blit: Lower load_sampler_lod_parameters_pan
|
---|
560 | - panfrost: Preprocess shaders in the driver
|
---|
561 | - pan/lower_framebuffer: Lower MSAA blend shaders
|
---|
562 | - panfrost: Lower clip_fs late
|
---|
563 | - panfrost: Lower texcoords late
|
---|
564 | - panfrost: Effectively lower gl_FragColor late
|
---|
565 | - panfrost: Preprocess shaders at CSO create time
|
---|
566 | - panfrost: Remove stale TODO
|
---|
567 | - panvk: Lower sysvals in NIR
|
---|
568 | - panvk: Don't use vec4 for vertex_instance_offsets
|
---|
569 | - panvk: Inline blend constants as syvals
|
---|
570 | - panfrost: Add NIR-based sysval lowering pass
|
---|
571 | - panfrost: Lower sysvals in GL
|
---|
572 | - panfrost: Move sysvals to GL driver struct
|
---|
573 | - panvk: Remove unused function
|
---|
574 | - panfrost: Move panfrost_sysvals to GL driver
|
---|
575 | - pan/bi: Export bifrost_nir_lower_load_output
|
---|
576 | - pan/bi: Call pan_nir_lower_zs_store late
|
---|
577 | - panvk: Lower blending late
|
---|
578 | - panfrost: Remove Midgard RSD fields from Bifrost
|
---|
579 | - asahi: Convert to SPDX headers
|
---|
580 | - mesa/st: Only set seamless for GLES3
|
---|
581 | - mesa/st: Normalize wrap modes for seamless cubes
|
---|
582 | - asahi: Don't lie about seamless cube maps
|
---|
583 | - panfrost: Print perf debug when flushing everything
|
---|
584 | - panfrost: Print perf debug on seqnum overflow
|
---|
585 | - panfrost: Don't redundantly call emit_const_buf
|
---|
586 | - panfrost: Mark packs as ALWAYS_INLINE
|
---|
587 | - panfrost: Don't update access with a single batch
|
---|
588 | - panfrost: Add a v9 fast path for no images
|
---|
589 | - panfrost: Clean up tiler calculations
|
---|
590 | - panfrost: Estimate vertex count for hier mask
|
---|
591 | - panfrost: Choose hierarchy masks by vertex count
|
---|
592 | - docs: Remove docs about macOS hardware drivers
|
---|
593 | - nv50,nvc0: Use u_pipe_screen_get_param_defaults
|
---|
594 | - panfrost: Always upload a workaround sampler
|
---|
595 | - pan/{mdg,bi}: Always use sampler 0 for txf
|
---|
596 | - panfrost: Unset TEXTURE_BUFFER_SAMPLERS
|
---|
597 | - gallium: Remove PIPE_CAP_TEXTURE_BUFFER_SAMPLER
|
---|
598 | - docs/gallium: Note samplers are not used for txf
|
---|
599 | - nir/print: Don't print sampler_index for txf
|
---|
600 | - asahi: Support more renderable formats
|
---|
601 | - agx: DCE even with noopt
|
---|
602 | - agx: Assert that we don't overflow registers
|
---|
603 | - agx: Constify agx_{read,write}_registers
|
---|
604 | - agx: Don't allow uniform source to local_atomic
|
---|
605 | - agx: Don't destroy usub_sat with constant
|
---|
606 | - asahi: Add perf debug for generate_mipmap
|
---|
607 | - asahi: Add perf debug for shader variants
|
---|
608 | - agx: Set loads_varying accurately
|
---|
609 | - agx: Add helper for calculating occupancy
|
---|
610 | - asahi/decode: Remove agxdecode_dump_bo
|
---|
611 | - asahi/decode: Print VDM barriers
|
---|
612 | - asahi: Set PIPE_CAP_LOAD_CONSTBUF
|
---|
613 | - agx: Coalesce more collects
|
---|
614 | - agx: Don't overallocate registers
|
---|
615 | - asahi: Honour sampler count
|
---|
616 | - asahi: Implement null textures
|
---|
617 | - asahi: Lower 1D to 2D
|
---|
618 | - asahi: Dirty track depth bias uploads
|
---|
619 | - asahi: Clamp texture buffer sizes
|
---|
620 | - agx: Tease apart some sample_mask packing magic
|
---|
621 | - agx: Rename writeout to wait_pix
|
---|
622 | - agx: Make signal_pix instructions explicit
|
---|
623 | - vulkan: Add common features2_to_features
|
---|
624 | - radv: Use vk_features2_to_features
|
---|
625 | - v3dv: Use vk_features2_to_features
|
---|
626 | - lavapipe: Use vk_features2_to_features
|
---|
627 | - pvr: Use vk_features2_to_features
|
---|
628 | - anv,hasvk: Use vk_features2_to_features
|
---|
629 | - tu: Use vk_features2_to_features
|
---|
630 | - nir: Combine if_uses with instruction uses
|
---|
631 | - nir/opt_ray_queries: Don't use list_length
|
---|
632 | - nir/opt_loop_unroll: Avoid list_length
|
---|
633 | - nir: Remove 2nd argument from nir_before_src
|
---|
634 | - nir/validate: Don't treat if-uses specially
|
---|
635 | - dxil: Avoid list_length
|
---|
636 | - nir: Reduce indirection
|
---|
637 | - nir: Factor out nir_src_rewrite_ssa helper
|
---|
638 | - nir: Use nir_src_rewrite_ssa
|
---|
639 | - dxil: Use nir_src_rewrite_ssa
|
---|
640 | - nir: Remove nir_if_rewrite_condition_ssa
|
---|
641 | - nir/repair_ssa: Refactor some use handling
|
---|
642 | - nir/validate: Only walk uses once
|
---|
643 | - mailmap: Update my e-mail
|
---|
644 | - panfrost: Symlink gallium .clang-format to common
|
---|
645 | - panfrost/winsys: Add .clang-format for winsys folder
|
---|
646 | - panfrost/winsys: Clang-format
|
---|
647 | - pan/decode: Move comment out of designated initializer
|
---|
648 | - panfrost: Re-run clang-format
|
---|
649 | - panvk: Clang-format
|
---|
650 | - ci: Run clang-format on panfrost
|
---|
651 | - mesa/st: Set uses_sample_shading when forcing per-sample
|
---|
652 | - nir/lower_blend: Set uses_fbfetch_output conservatively
|
---|
653 | - nir/lower_blend: Enable per-sample shading
|
---|
654 | - pan/bi: Lower swizzles for 8-bit CSEL
|
---|
655 | - pan/bi: Respect swizzles for more vector ops
|
---|
656 | - pan/bi: Use nir_lower_mem_access_bit_sizes
|
---|
657 | - panfrost: Allocate shared memory in OpenCL
|
---|
658 | - pan/decode: Print compute job payloads
|
---|
659 | - asahi: Fix disk cache disable with AGX_MESA_DEBUG
|
---|
660 |
|
---|
661 | Amber (15):
|
---|
662 |
|
---|
663 | - util/u_trace: pass utrace context to marker functions.
|
---|
664 | - freedreno: add support for markers.
|
---|
665 | - ir3, isaspec: add raw instruction to assembler/disassembler.
|
---|
666 | - ir3: support texture and sampler index with offsets
|
---|
667 | - nir: support lowering nir_intrinsic_image_samples to a constant load
|
---|
668 | - ir3: use lower_image_samples_to_one
|
---|
669 | - intel/compiler: use lower_image_samples_to_one
|
---|
670 | - freedreno: make sure depth/stencil layouts are always tiled
|
---|
671 | - freedreno: use A6XX_GRAS_SC_CNTL_SINGLE_PRIM_MODE with fb readback
|
---|
672 | - gallium: make BlendCoherent usable from gallium drivers
|
---|
673 | - freedreno: use blendcoherent to set FLUSH_PER_OVERLAP
|
---|
674 | - freedreno: check for conditional rendering in launch_grid
|
---|
675 | - nir: allow nir_lower_fb_read to support multiple render targets
|
---|
676 | - nir: Add memory coherency information to shaders.
|
---|
677 | - freedreno, nir, ir3: implement GL_EXT_shader_framebuffer_fetch
|
---|
678 |
|
---|
679 | Andres Calderon Jaramillo (1):
|
---|
680 |
|
---|
681 | - r600: Report multi-plane formats as unsupported
|
---|
682 |
|
---|
683 | André Almeida (2):
|
---|
684 |
|
---|
685 | - radv: Implement vk.check_status
|
---|
686 | - winsys/amdgpu: Fix amdgpu_cs_query_reset_state2 error log
|
---|
687 |
|
---|
688 | Antonio Gomes (11):
|
---|
689 |
|
---|
690 | - rusticl: Enabling reading/writing for images created from buffers
|
---|
691 | - rusticl: Enabling image fill for images created from buffers
|
---|
692 | - rusticl: Enable copy for images created from buffers
|
---|
693 | - rusticl: Enable mapImage for images created from buffers
|
---|
694 | - gallium, rusticl: Add tex2d_from_buf in image_view and sampler_view
|
---|
695 | - mesa/st, nine, nouveau: Fix uninitialized pipe_sampler_view structs
|
---|
696 | - lvmpipe/cs: Add support for 2d images created from buffers
|
---|
697 | - gallium: Add new caps PIPE_CAP_LINEAR_IMAGE_(PITCH_ALIGNMENT|BASE_ADDRESS_ALIGNMENT)
|
---|
698 | - rusticl: Implement spec for cl_khr_image2d_from_buffer
|
---|
699 | - llvmpipe: Add new caps PIPE_CAP_LINEAR_IMAGE_(PITCH_ALIGNMENT|BASE_ADDRESS_ALIGNMENT)
|
---|
700 | - iris: Add support for 2d images created from buffers
|
---|
701 |
|
---|
702 | Anuj Phogat (3):
|
---|
703 |
|
---|
704 | - anv: implement TES distribution mode WA 22012785325
|
---|
705 | - iris: implement TES distribution mode WA 22012785325
|
---|
706 | - intel/genxml/125: Add preferred SLM allocation size field
|
---|
707 |
|
---|
708 | Asahi Lina (43):
|
---|
709 |
|
---|
710 | - asahi: Split off common BO code into its own file
|
---|
711 | - asahi: Split off macOS support into its own file
|
---|
712 | - asahi: Refuse to transfer out-of-bounds mip levels
|
---|
713 | - meson: Fix Asahi build on macOS
|
---|
714 | - asahi: Fix shader key cloning overreads
|
---|
715 | - asahi: Do not use memctx for pools / meta cache
|
---|
716 | - asahi: Drop agx_device.memctx
|
---|
717 | - asahi: Only apply FS lowerings to fragment shaders
|
---|
718 | - asahi: Add BO_SHAREABLE flag
|
---|
719 | - asahi: Add readonly BO flag
|
---|
720 | - asahi: Identify USC cache invalidate
|
---|
721 | - asahi: Flush USC caches on the first draw
|
---|
722 | - asahi: Drop macOS backend
|
---|
723 | - asahi: Add nocluster,sync,stats debug flags
|
---|
724 | - asahi: Align device submission API with upcoming UAPI
|
---|
725 | - asahi: Implement Linux driver scaffolding, sans UAPI
|
---|
726 | - asahi: Add APIs for DMA-BUF sync file import/export
|
---|
727 | - asahi: Add agx_debug_fault() helper
|
---|
728 | - asahi: Add result buffer to context/batches
|
---|
729 | - asahi: Add agx_bo_mmap() calls to transfer path
|
---|
730 | - asahi: Pull device name from device struct
|
---|
731 | - asahi: Do not overread user index buffers
|
---|
732 | - asahi: Fix scissor culling check when out of bounds for FB/viewport
|
---|
733 | - asahi: Fix device fd leak in agx_close_device
|
---|
734 | - asahi: Destroy the renderonly context on screen destroy
|
---|
735 | - asahi: clang-format the world again
|
---|
736 | - asahi: Assert on TIB strides > 64
|
---|
737 | - asahi: Support importing sync objects on BO export
|
---|
738 | - asahi: Make agx_flush_resource reallocate non-shareable resources
|
---|
739 | - asahi: Extend batch tracking for explicit sync
|
---|
740 | - Revert "asahi: Advertise dual-source blending"
|
---|
741 | - asahi: Make agx_alloc_staging() take a screen instead of a context
|
---|
742 | - asahi: Enable glthread
|
---|
743 | - asahi: Locate low VA BOs correctly
|
---|
744 | - asahi: Fix style nits
|
---|
745 | - asahi: Implement valid buffer range tracking
|
---|
746 | - asahi: Make BO import path failures more robust
|
---|
747 | - asahi: Add a helper macro for debug/error messages
|
---|
748 | - asahi: Add resource debugging
|
---|
749 | - asahi: Print reasons why compression is disabled
|
---|
750 | - asahi: Fix compressed ZS support
|
---|
751 | - asahi: Flip kmsro around to allocate on the GPU
|
---|
752 | - asahi: Allow explicit non-LINEAR modifiers for scanout
|
---|
753 |
|
---|
754 | Axel Davy (1):
|
---|
755 |
|
---|
756 | - frontend/nine: Fix num_textures count
|
---|
757 |
|
---|
758 | Bas Nieuwenhuizen (26):
|
---|
759 |
|
---|
760 | - aco: Pass correct number of coords to Vega 1D LOD instruction.
|
---|
761 | - radv: Strictly limit alignment needed within a descriptor set.
|
---|
762 | - radv: Reduce descriptor pool allocation for alignment.
|
---|
763 | - radv: Set FDCC_CONTROL SAMPLE_MASK_TRACKER_WATERMARK
|
---|
764 | - radv: Shift left the tile swizzle more on GFX11.
|
---|
765 | - nir: Apply a maximum stack depth to avoid stack overflows.
|
---|
766 | - radv: Add helper to hash stages.
|
---|
767 | - radv: Hash group handles as part of RT pipeline key.
|
---|
768 | - radv: Use provided handles for switch cases in RT shaders.
|
---|
769 | - radv: Use group handles based on shader hashes.
|
---|
770 | - radv: Implement & expose VK_EXT_pipeline_library_group_handles.
|
---|
771 | - Update my mailmap aliases
|
---|
772 | - ac/surface,radv: Avoid pitch weirdness if image not used for rendertarget.
|
---|
773 | - ac/surface: Only allow stencil pitch adjustment for mipmaps.
|
---|
774 | - ac/surface,radv: Opt out of stencil adjust.
|
---|
775 | - util: Add aligned int64_t types for x86(non 64).
|
---|
776 | - util/disk_cache: Align atomic size.
|
---|
777 | - radv: Align atomic values.
|
---|
778 | - radv: Reserve space in framebuffer emission.
|
---|
779 | - radv: Reserve space in various streamout functions.
|
---|
780 | - radv: Reserve space in conditional rendering functions.
|
---|
781 | - radv: Reserve space in si_cs_emit_cache_flush.
|
---|
782 | - radv: Reserve space for updating DCC metadata.
|
---|
783 | - radv: Reserve space for fast clear related writes.
|
---|
784 | - radv: Reserve space for indirect descriptor set address writes.
|
---|
785 | - radv: Move all the dirty flags from TES binding to TCS binding.
|
---|
786 |
|
---|
787 | Benjamin Cheng (1):
|
---|
788 |
|
---|
789 | - radv: initialize cmd_buffer upload list earlier
|
---|
790 |
|
---|
791 | Boyuan Zhang (6):
|
---|
792 |
|
---|
793 | - radeonsi/vcn: check fence before destroying dpb
|
---|
794 | - radeonsi/vcn: check fence before destroying decoder
|
---|
795 | - radeonsi/vcn: validate fence handle before using it
|
---|
796 | - virgl/video: disable decoder fence
|
---|
797 | - virgl: add more formats to conv table
|
---|
798 | - frontends/va: check decoder in va surface call
|
---|
799 |
|
---|
800 | Brian Paul (9):
|
---|
801 |
|
---|
802 | - anv: add a third memory type for LLC configuration
|
---|
803 | - llvmpipe: do additional checks in lp_state_fs_analysis.c for linear shaders
|
---|
804 | - llvmpipe: remove debug printf spam in lp_setup_wait_empty_scene()
|
---|
805 | - gallium/xlib: call fence_finish() in XMesaSwapBuffers()
|
---|
806 | - llvmpipe: fix ps invocations query bug
|
---|
807 | - llvmpipe: rename some vars related to occlusion query and ps invocations
|
---|
808 | - llvmpipe: s/tabs/spaces/
|
---|
809 | - llvmpipe: s/unsigned/enum pipe_query_type/
|
---|
810 | - llvmpipe: clean-up llvmpipe_get_query_result()
|
---|
811 |
|
---|
812 | Błażej Szczygieł (1):
|
---|
813 |
|
---|
814 | - glx: Fix glXGetFBConfigFromVisualSGIX
|
---|
815 |
|
---|
816 | Caio Oliveira (26):
|
---|
817 |
|
---|
818 | - glsl: Account for unsized arrays in NIR linker
|
---|
819 | - hasvk: Update driver name in debug information
|
---|
820 | - intel: Add extra zeros at the end of debug identifiers
|
---|
821 | - iris, crocus: Align workaround address to 32B
|
---|
822 | - anv, hasvk: Align workaround address to 32B
|
---|
823 | - nir: Add nir_intrinsic_rotate
|
---|
824 | - nir/lower_subgroups: Add option lower_rotate_to_shuffle
|
---|
825 | - spirv: Implement SPV_KHR_subgroup_rotate
|
---|
826 | - nir: Support use_scoped_barrier in nir_lower_atomics_to_ssbo
|
---|
827 | - microsoft/compiler: Handle scoped barrier in Tess splitting
|
---|
828 | - gallivm: Fix handling of nir_intrinsic_scoped_barrier
|
---|
829 | - glsl: Implement use_scoped_barrier option for lowering memory barriers
|
---|
830 | - intel/compiler: Mark various memory barriers intrinsics unreachable
|
---|
831 | - pan/compiler: Fix handling of nir_intrinsic_scoped_barrier
|
---|
832 | - pan/midgard: Handle nir_intrinsic_scoped_barrier in Midgard compiler
|
---|
833 | - panfrost: Use NIR scoped barriers instead of memory barriers
|
---|
834 | - spirv: Don't specify nir_var_uniform or nir_var_mem_ubo in barriers
|
---|
835 | - spirv/tests: Subclass spirv_test helper to namespace the tests
|
---|
836 | - spirv/tests: Add script to generate C array from SPIR-V source
|
---|
837 | - spirv/tests: Parametrize stage in get_nir() helper
|
---|
838 | - spirv/tests: Add some basic control flow tests
|
---|
839 | - spirv: Add skip_os_break_in_debug_build option to use in unit tests
|
---|
840 | - intel/fs: Handle scoped barriers with execution scope
|
---|
841 | - intel/vec4: Handle scoped barriers with execution scope
|
---|
842 | - intel/compiler: Drop brw_nir_lower_scoped_barriers
|
---|
843 | - intel/compiler: Drop non-scoped barrier handling
|
---|
844 |
|
---|
845 | Caleb Cornett (6):
|
---|
846 |
|
---|
847 | - d3d12: Lower minimum supported Shader Model to 6.0
|
---|
848 | - futex: Change INT_MAX to INT32_MAX.
|
---|
849 | - util: Add #ifdefs for Xbox GDK support.
|
---|
850 | - dxil_validator: Add support for Xbox GDK.
|
---|
851 | - wgl: Add support for Xbox GDK.
|
---|
852 | - d3d12: Add support for Xbox GDK.
|
---|
853 |
|
---|
854 | Charlie Birks (1):
|
---|
855 |
|
---|
856 | - docs: add a few vulkan extensions supported by multiple drivers
|
---|
857 |
|
---|
858 | Charmaine Lee (5):
|
---|
859 |
|
---|
860 | - svga: fix resource_get_handle from resource created without SHARED bind flag
|
---|
861 | - svga: fix compatible formats for shareable surfaces
|
---|
862 | - svga: use upload buffer if texture has pending changes
|
---|
863 | - translate: do not clamp element index in generic_run
|
---|
864 | - svga: set PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY for VGPU10 device
|
---|
865 |
|
---|
866 | Chia-I Wu (38):
|
---|
867 |
|
---|
868 | - turnip: replace TU_DEBUG_DONT_CARE_AS_LOAD by a bool
|
---|
869 | - turnip: make debug_flags a global variable
|
---|
870 | - freedreno: add has_implicit_modifier helper
|
---|
871 | - freedreno: support UBWC scanout
|
---|
872 | - turnip: add a comment to tu_format_for_aspect
|
---|
873 | - turnip: move a comment about FMT6_Z24_UNORM_S8_UINT_AS_R8G8B8A8
|
---|
874 | - turnip: remove tu_native_format::tile_mode
|
---|
875 | - turnip: make tu6_format_*_supported static
|
---|
876 | - turnip: let tu6_format_vtx* take pipe format
|
---|
877 | - turnip: add blit_format_texture
|
---|
878 | - turnip: add blit_format_color and blit_base_format
|
---|
879 | - turnip: handle ubwc in blit_base_format
|
---|
880 | - turnip: reorder tu6_format_*
|
---|
881 | - freedreno/registers: correct WFM bit in CP_REG_TEST
|
---|
882 | - turnip: add a comment to tu_render_pass_cond_config
|
---|
883 | - turnip: skip unnecessary CP_REG_TEST for cond load/store
|
---|
884 | - freedreno/registers: document more bits of CP_REG_TEST
|
---|
885 | - freedreno: avoid conditional ib in fd6_emit_tile
|
---|
886 | - radv: fix a hang with binning on CHIP_RENOIR
|
---|
887 | - turnip: fix a major leak with GPL LTO
|
---|
888 | - turnip: fix a null descriptor set dereference
|
---|
889 | - turnip: avoid FMT6_Z24_UNORM_S8_UINT_AS_R8G8B8A8 for event blits
|
---|
890 | - radv: add a size check in radv_create_buffer for Android
|
---|
891 | - util/log: refactor mesa_log
|
---|
892 | - util/log: allow multiple loggers
|
---|
893 | - util/log: improve logger_file newline handling
|
---|
894 | - util/log: improve logger_android
|
---|
895 | - util/log: add logger_syslog
|
---|
896 | - util/log: add support for MESA_LOG_FILE
|
---|
897 | - util/log: add logger_windbg
|
---|
898 | - mesa: add missing newlines for _mesa_debug/_mesa_log callers
|
---|
899 | - mesa: use mesa_log from output_if_debug
|
---|
900 | - anv: process utrace payloads on queue submission
|
---|
901 | - ci/radv: remove dEQP-VK.image.sample_texture.* fails/flakes
|
---|
902 | - radv: set RADEON_FLAG_GTT_WC for external mem on vram
|
---|
903 | - radv: rework radv_layout_fmask_compressed
|
---|
904 | - radv: add RADV_FMASK_COMPRESSION_PARTIAL
|
---|
905 | - radv: disable tc_compatible_cmask on GFX9 in some cases
|
---|
906 |
|
---|
907 | Christian Gmeiner (1):
|
---|
908 |
|
---|
909 | - etnaviv: nir: use lower_fround_even
|
---|
910 |
|
---|
911 | Collabora's Gfx CI Team (3):
|
---|
912 |
|
---|
913 | - Uprev Piglit to 60e7f0586bac0cfcfcb5871046e31ca2057a5117
|
---|
914 | - Uprev Piglit to 2391a83d1639a7ab7bbea02853b922878687b0e5
|
---|
915 | - Uprev Piglit to 355ad6bcb2cb3d9e030b7c6eef2b076b0dfb4d63
|
---|
916 |
|
---|
917 | Connor Abbott (12):
|
---|
918 |
|
---|
919 | - freedreno/a6xx: Rename CP_CSQ_IB*_STAT
|
---|
920 | - freedreno/a6xx: Add CP_ROQ_*_STAT
|
---|
921 | - freedreno/a6xx: Fix CP_ROQ_THRESHOLDS_1
|
---|
922 | - freedreno/a6xx: Fill in ROQ status registers
|
---|
923 | - freedreno/crashdec: Fix apparent off-by-one with ROQ size
|
---|
924 | - freedreno/crashdec: Add prefetch test
|
---|
925 | - tu: Fix tile_align_h on a650
|
---|
926 | - freedreno: Fix or/and'ing two BitmaskEnums
|
---|
927 | - tu: Use vk_pipeline_get_renderpass_flags()
|
---|
928 | - vk/render_pass: Support VK_EXT_fragment_density_map
|
---|
929 | - nir, spirv: Add support for VK_EXT_fragment_density_map
|
---|
930 | - tu: Don't override depth for GMEM
|
---|
931 |
|
---|
932 | Constantine Shablya (12):
|
---|
933 |
|
---|
934 | - anv: handle ATTACHMENT_OPTIMAL layout
|
---|
935 | - anv: use Vulkan runtime's robust buffer access
|
---|
936 | - hasvk: use Vulkan runtime's robust buffer access
|
---|
937 | - anv,hasvk: flush what UNIFORM_READ flushes on SHADER_READ
|
---|
938 | - vulkan: relocate rmv to its correct home
|
---|
939 | - vulkan: tidy up vk_physical_device_features
|
---|
940 | - vulkan: delete trailing namespace
|
---|
941 | - vulkan: add hepler for vkGetPhysicalDeviceFeatures2
|
---|
942 | - vulkan: use vk_features for vk_device::enabled_features
|
---|
943 | - anv: use vk_get_physical_device_features
|
---|
944 | - vulkan: fix building with python3.8
|
---|
945 | - vulkan: depend idep_vulkan_runtime_headers on vk_physical_device_features.h
|
---|
946 |
|
---|
947 | Corentin Noël (12):
|
---|
948 |
|
---|
949 | - ci/venus: Remove failure now passing
|
---|
950 | - kopper: Do not free the given screen in initScreen implementation
|
---|
951 | - ci: uprev virglrenderer
|
---|
952 | - ci/venus: Skip tests risking out of memory issues
|
---|
953 | - ci: uprev virglrenderer and crosvm
|
---|
954 | - ci: Setup XDG_RUNTIME_DIR in crosvm-init
|
---|
955 | - ci: Allow to use crosvm-runner before deqp-runner
|
---|
956 | - ci: Uprev crosvm and virglrenderer
|
---|
957 | - venus/ci: Only run one crosvm instance
|
---|
958 | - mesa: OpenGL ES 3.0 requires EXT_instanced_arrays
|
---|
959 | - glapi: Make EXT_draw_instanced functions available for GLES 2.0
|
---|
960 | - mesa: Add EXT_instanced_arrays support
|
---|
961 |
|
---|
962 | Daniel Schürmann (82):
|
---|
963 |
|
---|
964 | - radv: CSE ray_launch_{size|id}
|
---|
965 | - radv: rename shader_info->cs.uses_sbt -> shader_info->cs.is_rt_shader
|
---|
966 | - radv: unconditionally enable scratch for RT shaders
|
---|
967 | - radv/rt: introduce and set rt_pipeline->stack_size
|
---|
968 | - radv/rt: use dynamic_callable_stack_base also for static stack_sizes
|
---|
969 | - radv/rt: don't hash maxPipelineRayRecursionDepth
|
---|
970 | - nir: add Continue Construct to nir_loop
|
---|
971 | - nir: add assertions that loops don't have a Continue Construct
|
---|
972 | - nir: create nir_push_continue() and related helpers
|
---|
973 | - nir: add lowering for Loop Continue Constructs
|
---|
974 | - spirv: use Loop Continue Construct to emit SPIR-V loops and lower after parsing
|
---|
975 | - nir/lower_continue_constructs: special-case Continue Constructs with zero or one predecessors
|
---|
976 | - nir/lower_continue_targets: only repair SSA when necessary
|
---|
977 | - nir: simplify nir_block_cf_tree_{next|prev}
|
---|
978 | - radv/rt: rename library_pipeline->groups to library_pipeline->group_infos
|
---|
979 | - radv/rt: defer library_pipeline allocation
|
---|
980 | - radv/rt: introduce struct radv_ray_tracing_module
|
---|
981 | - radv/rt: move stack_sizes into radv_ray_tracing_module
|
---|
982 | - radv/rt: only reserve stack_sizes after rt_case insertion
|
---|
983 | - radv: expose radv_postprocess_nir()
|
---|
984 | - radv: expose radv_pipeline_capture_shaders()
|
---|
985 | - radv/rt: introduce and use radv_rt_pipeline_compile()
|
---|
986 | - radv: remove unused parameters from radv_compute_pipeline_compile()
|
---|
987 | - radv/rt: move radv_pipeline_key from rt_variables to traversal_data
|
---|
988 | - nir/gather_info: allow terminate() in non-PS
|
---|
989 | - aco: fix NIR infinite loops
|
---|
990 | - radv/rt: use terminate() when returning from raygen shaders
|
---|
991 | - aco/dominance: set immediate dominator for any BB without predecessors
|
---|
992 | - aco/value_numbering: clear hashmap between disconnected CFGs
|
---|
993 | - aco/dead_code_analysis: don't add artificial uses to p_startpgm
|
---|
994 | - aco/insert_exec_mask: allow for disconnected CFG
|
---|
995 | - aco/spill: allow for disconnected CFG
|
---|
996 | - radv/rt: place any-hit scratch vars after intersection scratch vars
|
---|
997 | - radv/rt: Fix any_hit scratch variables.
|
---|
998 | - mesa: add gl_shader_stage_is_rt()
|
---|
999 | - radv: add RT shader args
|
---|
1000 | - radv: handle RT stages in radv_nir_shader_info_pass()
|
---|
1001 | - radv: add RT stages to radv_get_shader_name()
|
---|
1002 | - radv: add RT shader handling to radv_postprocess_config
|
---|
1003 | - aco: add RT stage enums
|
---|
1004 | - aco: don't set private_segment_buffer/scratch_offset on GFX9+
|
---|
1005 | - aco: move rt_dynamic_callable_stack_base_amd to VGPR
|
---|
1006 | - aco: implement load_ray_launch_{id|size}
|
---|
1007 | - aco: create hw_init_scratch() function for p_init_scratch lowering
|
---|
1008 | - aco: implement select_rt_prolog()
|
---|
1009 | - radv: add radv_create_rt_prolog()
|
---|
1010 | - radv: compile rt_prolog
|
---|
1011 | - radv/rt: use prolog for raytracing shaders
|
---|
1012 | - aco: remove aco::rt_stack variable
|
---|
1013 | - radv: remove unused parameter from radv_open_rtld_binary()
|
---|
1014 | - radv: separate radv_postprocess_binary_config() from radv_shader_create()
|
---|
1015 | - radv: remove unnecessary copy of binary->config
|
---|
1016 | - radv: inline radv_postprocess_config()
|
---|
1017 | - radv: separate radv_capture_shader_executable_info() from radv_shader_create()
|
---|
1018 | - radv: move gl_shader_stage from radv_binary to radv_shader_info
|
---|
1019 | - radv: remove radv_create_gs_copy_shader()
|
---|
1020 | - radv: refactor shader_compile()
|
---|
1021 | - radv: skip pipeline caching with RADV_DEBUG=shaders
|
---|
1022 | - radv: fix radv_shader_binary member fields to 32 bit.
|
---|
1023 | - radv/rt: Fix VK_KHR_pipeline_executable_properties
|
---|
1024 | - aco: split ps_epilog args before exporting them
|
---|
1025 | - aco/ra: adjust_max_used_regs() for fixed Operands
|
---|
1026 | - aco: don't use shared VGPRs for shaders consisting of multiple binaries
|
---|
1027 | - radv: update PS num_vgprs in case of epilogs rather than overallocating VGPRs
|
---|
1028 | - vulkan/pipeline_cache: remove vk_device from vk_pipeline_cache_object
|
---|
1029 | - vulkan/pipeline_cache: Don't re-insert disk-cache hits into disk-cache
|
---|
1030 | - vulkan/pipeline_cache: implement vk_pipeline_cache_create_and_insert_object()
|
---|
1031 | - vulkan/pipeline_cache: use vk_pipeline_cache_create_and_insert_object() during vk_pipeline_cache_load()
|
---|
1032 | - vulkan/pipeline_cache: add cache parameter to deserialize() function
|
---|
1033 | - vulkan/pipeline_cache: move vk_log on failed deserialization to vk_pipeline_cache_load()
|
---|
1034 | - radv: derive struct radv_shader from vk_pipeline_cache_object
|
---|
1035 | - radv: unconditionally store the binary code in radv_shader
|
---|
1036 | - radv: add radv_shader_serialize() and radv_shader_deserialize() functions
|
---|
1037 | - radv: add struct radv_pipeline_cache_object
|
---|
1038 | - radv: implement radv_shader_create_cached()
|
---|
1039 | - radv: use vk_pipeline_cache
|
---|
1040 | - radv: clean up pipeline-cache interface
|
---|
1041 | - radv/ci: add 2 more Flakes for Navi21
|
---|
1042 | - radv/rt: fix total stack size computation
|
---|
1043 | - radv/rt: properly destroy radv_ray_tracing_lib_pipeline on error
|
---|
1044 | - vulkan/pipeline_cache: replace raw data objects on cache insertion of real objects
|
---|
1045 | - radv: add padding to radv_shader_binary_legacy
|
---|
1046 |
|
---|
1047 | Daniel Stone (18):
|
---|
1048 |
|
---|
1049 | - ci/fdno: Only run full tests on a limited subset of machines
|
---|
1050 | - ci/radv: Skip vkCreateInstance memory-fail test
|
---|
1051 | - ci/anv: Temporarily halve TGL testing load
|
---|
1052 | - intel/isl: Don't scream FINISHME into logs for 3D vs. CCS
|
---|
1053 | - ci/radv: Drop raven quick_shader load
|
---|
1054 | - ci/fdno: Add a618 Vulkan flakes
|
---|
1055 | - ci/zink: Add flake seen in the wild
|
---|
1056 | - ci/radv: Lower stoney CTS load
|
---|
1057 | - ci/android: Use a more aggressive timeout for the job
|
---|
1058 | - ci: Actually run Piglit on LAVA
|
---|
1059 | - ci: Disable Collabora LAVA farm
|
---|
1060 | - Revert "ci: Disable Collabora LAVA farm"
|
---|
1061 | - CI: Disable Windows runners
|
---|
1062 | - CI: Disable mingw job
|
---|
1063 | - ci/panfrost: Add texturesize flake seen in the wild
|
---|
1064 | - CI: Disable freedreno
|
---|
1065 | - ci/radeonsi: sort and dedup stoney skips
|
---|
1066 | - ci/radeonsi: Skip really slow tests on stoney
|
---|
1067 |
|
---|
1068 | Danylo Piliaiev (52):
|
---|
1069 |
|
---|
1070 | - tu/kgsl: do not use kgsl_command_object::offset
|
---|
1071 | - tu: Prevent using stale value of RB_UNKNOWN_88D0 on BLIT
|
---|
1072 | - tu: Prevent using stale value of GRAS_SC_CNTL in sysmem clear
|
---|
1073 | - freedreno: Document A6XX_GRAS_SC_CNTL::rotation field
|
---|
1074 | - turnip: Ensure that there is no renderpass rotation in binning
|
---|
1075 | - turnip: Disable draw states after dyn renderpass in all cases
|
---|
1076 | - ir3: Consider dst type in ubo_vec4 to ldc lowering
|
---|
1077 | - tu: Don't expose KHR_present_id,KHR_present_wait without KHR_swapchain
|
---|
1078 | - turnip: Add debug option to find usage of stale reg values
|
---|
1079 | - docs/freedreno: Add info about stale reg stomper dbg option
|
---|
1080 | - ci/tu: Add 1/200 pass to test for stale reg usage
|
---|
1081 | - ir3: Add cat5/cat7 cache related instructions
|
---|
1082 | - ir3: Add cat7 sleep instruction
|
---|
1083 | - freedreno/register: Define chip enum values
|
---|
1084 | - util/perf: C++-proof util/perf
|
---|
1085 | - util/format: Make format_table compatible with C++
|
---|
1086 | - spirv: sort spirv_supported_capabilities
|
---|
1087 | - vk/vk_extension_gen: Make table struct initializable in C++ on older gcc
|
---|
1088 | - vk/wsi: C++-proof wsi_common_drm.h
|
---|
1089 | - vk/util: remove (void \*) casts from vk_foreach_multi_draw macros
|
---|
1090 | - vk/util: Generate defines to help casting structs with vk_find_struct
|
---|
1091 | - freedreno/common: C++-proof freedreno_uuid.h
|
---|
1092 | - ir3: C++-proofing
|
---|
1093 | - tu: C++-proofing: fix offsetof with dynamic array index
|
---|
1094 | - tu: C++-proofing: fix struct initializers
|
---|
1095 | - tu: C++-proofing: various enum fixes
|
---|
1096 | - tu: C++-proof: do not goto over variables initialization
|
---|
1097 | - tu: C++-proofing: fix designator initializer order
|
---|
1098 | - tu: C++-proofing: fix extension table initialization
|
---|
1099 | - tu: C++-proofing: Initialize tu_reg_value in-order by pack funcs
|
---|
1100 | - tu: C++-proofing: fix casting from void * fpermissive warnings
|
---|
1101 | - tu: C++-proofing: ease access to global bo struct
|
---|
1102 | - tu: C++-proofing: prevent taking address from rvalue
|
---|
1103 | - tu: C++-proofing: cast result when extracting field from reg value
|
---|
1104 | - tu: C++-proofing: misc fixes
|
---|
1105 | - freedreno/msm: Rename drm_msm_gem_submit_reloc::or in C++ code
|
---|
1106 | - tu: compile as C++
|
---|
1107 | - vk/entry_points: Add option to generate template entrypoints
|
---|
1108 | - freedreno/regs: Include assert.h in generated headers
|
---|
1109 | - tu: Generate entrypoints for each gen
|
---|
1110 | - turnip: add cached and cached-coherent memory types
|
---|
1111 | - tu/drm: Support cached non-coherent memory
|
---|
1112 | - freedreno/registers: Document new CP_EVENT_WRITE::SEQNO
|
---|
1113 | - freedreno/registers: More a7xx regs
|
---|
1114 | - freedreno/computerator: C++ proofing
|
---|
1115 | - freedreno: C++ fixes for computerator to compile
|
---|
1116 | - freedreno/computerator: Convert to C++
|
---|
1117 | - freedreno: Move fd6_pack.h to common code accessible by computerator
|
---|
1118 | - freedreno: Add dummy a730/a740 definition
|
---|
1119 | - freedreno/computerator: Templatize a6xx backend
|
---|
1120 | - freedreno/computerator: Add support for a7xx
|
---|
1121 | - vulkan: Sanitize pSampleMask in CmdSetSampleMaskEXT
|
---|
1122 |
|
---|
1123 | Dave Airlie (37):
|
---|
1124 |
|
---|
1125 | - ci: bump vk cts to 1.3.3.1 + and a crash fix.
|
---|
1126 | - vulkan/video: add common h264/h265 parameter set management code.
|
---|
1127 | - vulkan/format: add a 10-bit video format
|
---|
1128 | - radv: remove the status query mark it unsupported.
|
---|
1129 | - radv: add new upload alloc aligned api
|
---|
1130 | - ac: add name to codec info struct
|
---|
1131 | - radv: adding video decode queue support
|
---|
1132 | - radv: add video decoder register setup.
|
---|
1133 | - radv/video: add initial frameworking.
|
---|
1134 | - radv/video: add initial h264 decoder for VCN
|
---|
1135 | - radv/video: add h264 support for uvd
|
---|
1136 | - radv: add vcn h265 decode.
|
---|
1137 | - radv/video: add h265 decode UVD support
|
---|
1138 | - radv/vcn: enable dynamic dpb tier 2 for h264/h265 on navi21+
|
---|
1139 | - anv: add video engine support in various places
|
---|
1140 | - anv: set Y/4 tiling for video decode images
|
---|
1141 | - anv: add video format features for the one supported video output format
|
---|
1142 | - anv/format: handle video extensions structs by ignoring them
|
---|
1143 | - intel/genxml: align some of the fields with the media driver
|
---|
1144 | - intel/genxml: add missing power well control bits
|
---|
1145 | - anv/image: allocate some memory for mv storage after video images.
|
---|
1146 | - anv: add initial video decode support for h264.
|
---|
1147 | - anv/query: add query status report
|
---|
1148 | - anv: enable video decode extensions.
|
---|
1149 | - anv/video: fix video memory bindings.
|
---|
1150 | - crocus: disable Y tiling for render targets properly.
|
---|
1151 | - crocus: switch gen4/5 tiling flags to follow suggestions.
|
---|
1152 | - llvmpipe: fix compute address bits to return native pointer size.
|
---|
1153 | - anv: always pick graphics queue to execute prime blits on.
|
---|
1154 | - radv: add video format support to format probing.
|
---|
1155 | - anv/video: fix chroma qp to be a integer value.
|
---|
1156 | - anv/video: disable picture id reampping.
|
---|
1157 | - anv: fix image height for field pictures.
|
---|
1158 | - radv/video: fix h264 frame heights when field images are in use
|
---|
1159 | - radv/video: fix used for reference flags.
|
---|
1160 | - radv/video: fix h265 decoding sizes.
|
---|
1161 | - radv/trace: don't attempt to emit trace on non-graphics/compute queues
|
---|
1162 |
|
---|
1163 | David (Ming Qiang) Wu (1):
|
---|
1164 |
|
---|
1165 | - radeonsi/vcn: add an exception of field case for h264 decoding
|
---|
1166 |
|
---|
1167 | David Heidelberg (73):
|
---|
1168 |
|
---|
1169 | - ci/zink: Penumbra is now fixed.
|
---|
1170 | - freedreno/ci: Switch also performance a630 job to manual
|
---|
1171 | - ci/anv: add multiple fails uncovered by change of sharding
|
---|
1172 | - ci/intel: fully utilize asus-cx9400-volteer
|
---|
1173 | - ci/piglit: explicitely define we want GLX tests
|
---|
1174 | - ci: migrate from wget to curl
|
---|
1175 | - ci/piglit: 2023-01-19 uprev
|
---|
1176 | - ci: bump ci-fairy with session support (robust downloads)
|
---|
1177 | - ci: Sir trace has small invisible change in rendering
|
---|
1178 | - ci: bump Mold to the 1.10.0
|
---|
1179 | - ci: uprev piglit (etag md5 checksumming support)
|
---|
1180 | - ci/lavapipe: use dxvk for the traces
|
---|
1181 | - ci: revert download of git cache to the wget
|
---|
1182 | - ci/llvmpipe: add flake timeout for rusticl program@execute@[email protected]
|
---|
1183 | - util/process_test: make the error variable static
|
---|
1184 | - intel: enable -mfpmath=sse on x86
|
---|
1185 | - intel: use c_see2_arg instead of explicit -msse2
|
---|
1186 | - ci/freedreno: add flaking KHR-GL45.buffer_storage.map_persistent_dispatch
|
---|
1187 | - meson: print c_cpp_args
|
---|
1188 | - intel/vulkan: add missing dependency on generated headers
|
---|
1189 | - ci/freedreno: add flaking KHR-GL45.buffer_storage.map_persistent_flush
|
---|
1190 | - ci/alpine: keep the curl inside the image
|
---|
1191 | - ci: alpine: install bash and coreutils for date -d
|
---|
1192 | - ci: implement unified sections
|
---|
1193 | - ci: make meson build and test uncollapsed
|
---|
1194 | - ci: deqp-runner: drop already unused function
|
---|
1195 | - ci: Retry, retry, retry... No one likes to trigger Marge more than once.
|
---|
1196 | - ci/zink: add skip for the Single-GL46.enhanced_layouts.ssb_member_align_non_power_of_2
|
---|
1197 | - ci/lavapipe: add recent occasional flake
|
---|
1198 | - ci/freedreno: rare flake KHR-GL45.sample_variables.mask.rgba8i.samples_4.mask_3
|
---|
1199 | - crocus/meson: add dependency on libintel_dev also for versioned static libraries
|
---|
1200 | - ci/ci_run_n_monitor: while we usually disable many jobs, print them inline
|
---|
1201 | - ci: do not exit when an error happens inside the section
|
---|
1202 | - ci/lavapipe: fixes typo
|
---|
1203 | - ci/zink: fixup the zink-lvp job
|
---|
1204 | - ci: disable mesa-swrast runner jobs
|
---|
1205 | - ci/lava: implement the priority
|
---|
1206 | - ci/weston: before testing, verify that XWayland is really running
|
---|
1207 | - ci/weston: add background PID
|
---|
1208 | - ci: add and utilize dalboz devices
|
---|
1209 | - ci/amd: move skqp and va jobs on raven from XOrg to the XWayland
|
---|
1210 | - ci/panfrost: correct the job name, as it runs on gles2
|
---|
1211 | - ci/lava: every LAVA job doesn't want to run gles2 deqp, drop it
|
---|
1212 | - ci: build Wayland support for the amd64
|
---|
1213 | - ci/iris: update apl and glk expectations, after enabling Wayland support
|
---|
1214 | - ci/clover: disable the jobs
|
---|
1215 | - ci/traces: disable nheko trace with zink since it flakes
|
---|
1216 | - ci/freedreno: add recent occasional flakes
|
---|
1217 | - ci/traces: add two skips due to flakes
|
---|
1218 | - ci/intel: add dEQP-EGL.functional.wide_color.window_fp16_default_colorspace flake
|
---|
1219 | - ci: distribute XDG_RUNTIME_DIR with setup-test-env script
|
---|
1220 | - ci: disable weston session timeout for llvmpipe
|
---|
1221 | - meson: implement quirk for the compilation under armv7 GCC with LTO
|
---|
1222 | - aco: drop leftover variable
|
---|
1223 | - ci: bump Alpine to 3.17 (again)
|
---|
1224 | - ci/freedreno: do not build tools executables without explicitly enabling them
|
---|
1225 | - freedreno/decode: fix possible overflow
|
---|
1226 | - ci: rename .lava-test to .lava-test-deqp to describe it correctly
|
---|
1227 | - ci: create lava-test without deqp HWCI_TEST_SCRIPT
|
---|
1228 | - ci: remove deqp from lava piglit and traces runs
|
---|
1229 | - ci/freedreno: split deqp from other jobs
|
---|
1230 | - ci/freedreno: define Google farm specific includes
|
---|
1231 | - ci/freedreno: Make traces work on LAVA caching proxy
|
---|
1232 | - ci/broadcom: test occasionally fails, but typically passes
|
---|
1233 | - ci: disable lima farm, currently out-of-space, needs to be fixed
|
---|
1234 | - ci: implement sections for cuttlefish
|
---|
1235 | - ci/v3d: add flaking spec@ext_framebuffer_blit@fbo-sys-blit
|
---|
1236 | - Revert "mesa: Enable NV_texture_barrier in GLES2+"
|
---|
1237 | - ci/amd: update device status
|
---|
1238 | - ci/amd: raven is currently downgraded to 2 machines only, adapt
|
---|
1239 | - ci/amd: add draw.dynamic_rendering flake
|
---|
1240 | - ci/freedreno: fix the a530_piglit job and switch to Weston
|
---|
1241 | - panvk: clear dangling pointers
|
---|
1242 |
|
---|
1243 | David Redondo (1):
|
---|
1244 |
|
---|
1245 | - egl/wayland: fix oob buffer access during buffer_fds clean up
|
---|
1246 |
|
---|
1247 | David Rosca (2):
|
---|
1248 |
|
---|
1249 | - frontents/va: Use PIPE_USAGE_STAGING for coded buffer
|
---|
1250 | - frontends/va: Map VAEncCodedBufferType buffer as PIPE_MAP_READ
|
---|
1251 |
|
---|
1252 | Dmitry Baryshkov (2):
|
---|
1253 |
|
---|
1254 | - freedreno/a5xx: reorder GPMU registers
|
---|
1255 | - freedreno/a5xx: add SP clock control register
|
---|
1256 |
|
---|
1257 | Dmitry Osipenko (6):
|
---|
1258 |
|
---|
1259 | - util/cache_test: Unset env vars left after Cache.List test
|
---|
1260 | - util/mesa-db: Don't account header size
|
---|
1261 | - util/mesa-db: Support removal of cache entries
|
---|
1262 | - util/cache_test: Remove dummy cache entry added by cache_exists()
|
---|
1263 | - util/mesa-db: Introduce multipart mesa-db cache
|
---|
1264 | - util/disk_cache: Switch to multipart mesa-db cache
|
---|
1265 |
|
---|
1266 | Dylan Baker (38):
|
---|
1267 |
|
---|
1268 | - VERSION: bump to 23.1.0-devel for further development
|
---|
1269 | - docs: reset new_features.txt
|
---|
1270 | - meson: bump minimum required version to meson 0.59
|
---|
1271 | - meson: replace has_exe_wrapper with can_run_host_binaries
|
---|
1272 | - meson: replace uses of ExternalProgram.path with .full_path
|
---|
1273 | - meson: drop meson < 0.54 workaround
|
---|
1274 | - meson: use a feature option for dri3
|
---|
1275 | - meson: use a feature option for gallium-vdpau
|
---|
1276 | - meson: use a feature option for gallium-va
|
---|
1277 | - meson: use a feature option for gallium-xa
|
---|
1278 | - meson: use a feature option for shader_cache
|
---|
1279 | - meson: use a feature option for shared-glapi
|
---|
1280 | - meson: use a feature option for gles1
|
---|
1281 | - meson: use a feature option for gles2
|
---|
1282 | - meson: use a feature option for gbm
|
---|
1283 | - meson: use a feature option for llvm
|
---|
1284 | - meson: use a feature option for valgrind
|
---|
1285 | - meson: use a feature option for libunwind
|
---|
1286 | - meson: use a feature option for lmsensors
|
---|
1287 | - meson: use a feature option for power8
|
---|
1288 | - meson: use a feature option for xlib-lease
|
---|
1289 | - meson: use a feature option for zstd
|
---|
1290 | - meson: use a feature option for egl
|
---|
1291 | - meson: use a feature option for shared-llvm
|
---|
1292 | - meson: Use feature option methods for xmlconfig
|
---|
1293 | - meson: remove version checks for < 0.59
|
---|
1294 | - meson: use builtin support for reading version from a file
|
---|
1295 | - meson: use [] instead of 'lib for !windows name_prefix
|
---|
1296 | - meson: use the same workaround for setting 'lib' on windows
|
---|
1297 | - meson: combine checks for linker --gc-sections support
|
---|
1298 | - util: rzalloc and free hash_table_u64
|
---|
1299 | - iris: consider bufmgr creation to have failed if \`dup`ing of the fd fails
|
---|
1300 | - intel/mi: use 64bit constant for bitshift
|
---|
1301 | - intel/dev: create a helper dependency for libintel_dev
|
---|
1302 | - docs: Add calendar entries for 23.0 release.
|
---|
1303 | - docs: add release notes for 23.0.0
|
---|
1304 | - docs: Add sha256 sum for 23.0.0
|
---|
1305 | - docs/relnotes: add 23.0.0 to relnotes.rst
|
---|
1306 |
|
---|
1307 | Ella Stanforth (1):
|
---|
1308 |
|
---|
1309 | - v3dv: add support for multi-planar formats, enable YCbCr
|
---|
1310 |
|
---|
1311 | Emma Anholt (211):
|
---|
1312 |
|
---|
1313 | - dri2: Fix exposing robustness with swkms.
|
---|
1314 | - ci/llvmpipe: Drop dEQP-EGL.functional.sharing.*.link.7 flakes.
|
---|
1315 | - ci/iris: Add known flakes for skqp.
|
---|
1316 | - ci/iris: Generalize the 8888_pbuffer EGL known flakes and share with GLK.
|
---|
1317 | - ci/zink: Add more blit conversion xfails for a618.
|
---|
1318 | - freedreno: Skip CPU/GPU timestamp sync when not supported.
|
---|
1319 | - ci/freedreno: Add glx-swap-event-async as a flake.
|
---|
1320 | - freedreno/pps: Fix a signed/unsigned complaint.
|
---|
1321 | - ci: Enable building the testing drivers with perfetto.
|
---|
1322 | - ci: Add some new folks to the restricted-traces access list.
|
---|
1323 | - Revert "nouveau/ci: temporary disable gk20a-gles"
|
---|
1324 | - ci/virgl: Disable iris traces for now while it's unstable.
|
---|
1325 | - ci: Drop windowoverlap xfails, since it's always skipped.
|
---|
1326 | - ci/zink: Drop xfail for copy-sub-buffer.
|
---|
1327 | - ci/zink: Drop glx-swap-copy xfails.
|
---|
1328 | - ci/zink: Clear issue #7781 flakes.
|
---|
1329 | - ci/freedreno: Switch the piglit job to using a deqp-runner suite.
|
---|
1330 | - ci: Move PIGLIT_PLATFORM settings out of the .tomls.
|
---|
1331 | - ci/piglit: Add some common piglit skips for Mesa CI's testing of glx.
|
---|
1332 | - ci/piglit: Exclude swapbuffers front-readback tests with PIGLIT_PLATFORM=gbm.
|
---|
1333 | - zink: Fatal error if requesting validation and we fail to load the layer.
|
---|
1334 | - zink: Add missing Flat decorations on some inputs.
|
---|
1335 | - zink: Fix validation failure for maxLod < minLod.
|
---|
1336 | - zink: Fix up mismatches of memory model vs addressing model.
|
---|
1337 | - zink: Re-emit the SpvBuiltInSampleMask access chain each load.
|
---|
1338 | - ci/zink: Add coverage using the vulkan validation layer on lvp.
|
---|
1339 | - ci/zink: Update TGL full-run xfails.
|
---|
1340 | - ci/zink: Update radv xfails for the recent shadow fixes.
|
---|
1341 | - ci/freedreno: Mark max-texture-size as a flake.
|
---|
1342 | - ci: Move the performance jobs' allow_failure:true to the gl rules.
|
---|
1343 | - ci: Add manual rules variations to disable irrelevant driver jobs.
|
---|
1344 | - freedreno: Don't sync timestamps while perfetto isn't running.
|
---|
1345 | - ci/zink: Disable Amnesia trace until the linked issue gets fixed.
|
---|
1346 | - ci/zink: Move the zink-anv-tgl manual full run to custom manual deps.
|
---|
1347 | - ci: Run our manual jobs during the nightly scheduled run.
|
---|
1348 | - ci: Fix perf jobs blocking Marge pipelines.
|
---|
1349 | - ci: Fix perf job condition.
|
---|
1350 | - ci: Drop the itoral-gl-terrain demo from traces.
|
---|
1351 | - tu: Mark tiling impossible if we couldn't lay out gmem in the first place.
|
---|
1352 | - turnip: Optimize tile sizes to reduce the number of bins.
|
---|
1353 | - tu: Only emit the conditional gmem subpass resolves when gmem is possible.
|
---|
1354 | - turnip: Make the tiling-impossible case have an impossible tile layout.
|
---|
1355 | - gallivm: Optimize emit_read_invocation's first-invocation loop.
|
---|
1356 | - gallivm: Refactor out a shared "get the first active invocation" loop.
|
---|
1357 | - gallivm: Return 0 first_active_invocation when we know that up front.
|
---|
1358 | - gallivm: Use cttz instead of a loop for first_active_invocation().
|
---|
1359 | - gallivm: Use first active invocation in some image/ssbo accesses.
|
---|
1360 | - ci/lvp: Drop the subgroupbroadcast skips.
|
---|
1361 | - llvmpipe: Enable LP_DEBUG on normal builds.
|
---|
1362 | - gallivm: Enable GALLIVM_DEBUG (mostly) on non-DEBUG builds.
|
---|
1363 | - gallivm: Fix the type of array nir_registers.
|
---|
1364 | - gallivm: Fix codegen performance for constant-index register array stores.
|
---|
1365 | - gallivm: Do the same codegen improvement for constant-index array loads.
|
---|
1366 | - ci/swrast: Drop skips for tests whose perf had been fixed.
|
---|
1367 | - ci/llvmpipe: Drop skip of InteractionFunctionCalls2.
|
---|
1368 | - ci/freedreno: Don't forget to report flakes on a618, too.
|
---|
1369 | - u_trace: Add an interface for checking trace enablement outside a context.
|
---|
1370 | - zink: Add tracing of blit operations.
|
---|
1371 | - ci: Disable systems in my farm that haven't recovered.
|
---|
1372 | - ci/zink: Update TGL full-run xfails.
|
---|
1373 | - ci/freedreno: Disable the a306_piglit_gl job.
|
---|
1374 | - ci/freedreno: Update a530 manual-run xfails.
|
---|
1375 | - ci/freedreno: Add an xfail for a618 VK full run.
|
---|
1376 | - ci/freedreno: Update a3xx piglit_shader xfails.
|
---|
1377 | - ci/nouveau: Disable the gm20b jobs entirely.
|
---|
1378 | - ci/radv: Update navi21 llvm xfails.
|
---|
1379 | - ci/crocus: Update HSW expectations.
|
---|
1380 | - ci/freedreno: Update manual-run xfails for a530.
|
---|
1381 | - Revert "freedreno/a5xx: Fix clip_mask"
|
---|
1382 | - ci/radv: Add a skip for navi21-llvm for a test that consistently timeouts.
|
---|
1383 | - ci/etnaviv: Drop stale xfails from gc7000.
|
---|
1384 | - ci/etnaviv: Update deqp xfails for gc2000.
|
---|
1385 | - egl/kopper: Add assert for no kopper in dri2_copy_region.
|
---|
1386 | - egl: Add a note explaining the swapBuffers badness in dri2_x11_copy_buffers().
|
---|
1387 | - egl/kopper: Use the kopper private interface for swapBuffers.
|
---|
1388 | - egl/kopper: Pass ancillary invalidate flush flags down to gallium.
|
---|
1389 | - ci: Add a manual full and 1/10th hasvk CTS runs.
|
---|
1390 | - hasvk: Silence conformance warning in CI.
|
---|
1391 | - hasvk: Fix SPIR-V warning about TF unsupported on gen7.
|
---|
1392 | - anv: Fix gfx8/9 VB range > 32bits workaround detection.
|
---|
1393 | - hasvk: Fix gfx8/9 VB range > 32bits workaround detection.
|
---|
1394 | - glsl: Drop the (v.x + v.y + v.z + v.w) -> dot(v, 1.0) optimization.
|
---|
1395 | - ci/etnaviv: Drop one more gc7000 xfail.
|
---|
1396 | - ci/freedreno: Drop a530 piglit_gl coverage.
|
---|
1397 | - ci/turnip: Drop the #8219 xfail.
|
---|
1398 | - ci/zink+turnip: Disable flaky minetest trace.
|
---|
1399 | - ci/hasvk: Add a synchronization flake.
|
---|
1400 | - ci: Fix stage of etnaviv manual runs.
|
---|
1401 | - ci/zink: Add a glx flake on anv
|
---|
1402 | - ci/crocus: Add new tess xfails and a link to the regression bug report.
|
---|
1403 | - ci/crocus: Mark unvanquished as flaky.
|
---|
1404 | - anv: Skip the RT flush when doing depth-only rendering.
|
---|
1405 | - anv: Skip BTI RT flush if we're doing an op that doesn't use render targets.
|
---|
1406 | - glsl/opt_algebraic: Drop ~~x == x transformation.
|
---|
1407 | - glsl/opt_algebraic: Drop log(exp(x)) -> x and exp(log(x)) -> x optimisations.
|
---|
1408 | - glsl/opt_algebraic: Drop pow-recognizer.
|
---|
1409 | - glsl/opt_algebraic: Drop abs(-x) -> abs(x) and abs(abs(x)) -> abs(x).
|
---|
1410 | - glsl/opt_algebraic: Drop -(-x) -> x optimization.
|
---|
1411 | - glsl/opt_algebraic: Drop f2i(trunc(x)) -> f2i(x) optimization.
|
---|
1412 | - glsl/opt_algebraic: drop fsat(fadd(b2f(x), b2f(y))) -> b2f(ior(x, y)) opt.
|
---|
1413 | - glsl/opt_algebraic: Drop shifts of 0 optimizations.
|
---|
1414 | - glsl/opt_algebraic: Drop pow optimizations.
|
---|
1415 | - glsl/opt_algebraic: Drop rcp optimizations.
|
---|
1416 | - glsl/opt_algebraic: Drop and/or/xor optimizations.
|
---|
1417 | - glsl/opt_algebraic: Drop fdiv(1,x) -> frcp(x) and fdiv(x,1) -> x optimizations.
|
---|
1418 | - glsl/opt_algebraic: Drop add/sub with 0 optimizations.
|
---|
1419 | - glsl/opt_algebraic: Drop x + -x -> 0 optimization.
|
---|
1420 | - glsl/opt_algebraic: Drop csel(true/false, x, y) optimization.
|
---|
1421 | - nir: Add optimization for fdot(x, 0) -> 0.
|
---|
1422 | - glsl/opt_algebraic: Drop fdot 0-channel optimizations.
|
---|
1423 | - glsl/opt_algebraic: Drop scalar all_eq/any_neq -> eq/neq opt.
|
---|
1424 | - glsl/opt_algebraic: Drop the eq/neq add-removal optimization.
|
---|
1425 | - glsl/opt_algebraic: Drop no-op pack/unpack optimization.
|
---|
1426 | - glsl/opt_algebraic: Drop the flrp/ffma simplifiers.
|
---|
1427 | - glsl/opt_algebraic: Drop some fmul simplifications.
|
---|
1428 | - nir: Port a floor->truncate algebraic opt pattern from GLSL.
|
---|
1429 | - glsl/opt_algebraic: Drop the ftrunc pattern recognizer.
|
---|
1430 | - glsl/opt_algebraic: Drop the flrp recognizer.
|
---|
1431 | - glsl: Remove unused as_rvalue_to_saturate().
|
---|
1432 | - ci: Update traces expectations for gutting glsl opt_algebraic.
|
---|
1433 | - panfrost/midgard: Fix handling of csel with a vector constant condition.
|
---|
1434 | - panfrost/midgard: Drop redundant arg to emit_explicit_constant.
|
---|
1435 | - glsl: Move lower_vector_insert to GLSL-to-NIR.
|
---|
1436 | - nir/split_64bit_vec3_and_vec4: Handle 64-bit matrix types.
|
---|
1437 | - gallivm: Return 0 for first active invocation when no invocations are active.
|
---|
1438 | - gallivm: Use first_active_invocation for ubo/kernel memory loads.
|
---|
1439 | - gallivm: Use first_active_invocation for scalar SSBO loads.
|
---|
1440 | - gallivm: Add some notes about other invocation_0_must_be_active usages.
|
---|
1441 | - ci: Add some xfail updates from VKCTS 1.3.5.0 for the manual jobs.
|
---|
1442 | - ci/etnaviv: Drop the dEQP-GLES2.functional.uniform_api.random.94 xfail.
|
---|
1443 | - anv+hasvk: Use driconf to disable 16-bit for zink.
|
---|
1444 | - zink: Pass the cmdbuf to the end of the marker, too.
|
---|
1445 | - Revert "ci: disable mesa-swrast runner jobs"
|
---|
1446 | - ci: Re-enable some swrast testing using fd.o's shared runners for now.
|
---|
1447 | - glsl/nir: Include early glsl-to-nir output in NIR_DEBUG=print.
|
---|
1448 | - glsl_to_nir: Use a variable's constant_value if it wasn't const-propped out.
|
---|
1449 | - glsl: Delete constant propagation pass.
|
---|
1450 | - glsl: Delete constant folding pass.
|
---|
1451 | - glsl: Delete constant-variables pass.
|
---|
1452 | - ci: Update trace expectations for GLSL constant prop removal.
|
---|
1453 | - ci/zink: Update TGL xfails/flakes based on the last nightly pipelines.
|
---|
1454 | - ci/turnip: Extend a630 vk full timeout to 3 hours.
|
---|
1455 | - ci/iris: Add skips for slow tests on APL.
|
---|
1456 | - turnip: Don't push inline uniform buffer contents outside constlen.
|
---|
1457 | - ci/turnip: Clear out stale xfails.
|
---|
1458 | - ci/turnip: Disable dEQP-VK.image.queue_transfer.* for now.
|
---|
1459 | - ci/turnip: Move some more of the 1.3.5 new xfails under links.
|
---|
1460 | - glsl: Simplify vector constructors from scalars.
|
---|
1461 | - glsl/lower_precision: Add a unit test that I thought we might fail at.
|
---|
1462 | - glsl/lower_precision: Add a cut-down testcase for #8124
|
---|
1463 | - glsl: Set the precisions of builtin function arguments and returns.
|
---|
1464 | - glsl: Handle highp promotion of builtin function args in the builtins.
|
---|
1465 | - glsl: Set the precision of function return value temporaries.
|
---|
1466 | - glsl/lower_precision: Drop most special-casing of builtin arg precision.
|
---|
1467 | - glsl: Fix the precision of atomic counter builtin function args.
|
---|
1468 | - glsl/lower_precision: Add actual spec quotes for "check_parameters"
|
---|
1469 | - nir/lower_mediump: Fix assertion about copy_deref lowering matching.
|
---|
1470 | - ci/iris: Update more manual job xfails from the Wayland build change.
|
---|
1471 | - ci/crocus: Update expectations from VK CTS 1.3.5.0.
|
---|
1472 | - ci/hasvk: Update some xfails from the 8-sample fast clear disable.
|
---|
1473 | - ci/etnaviv: Get the gc2000_piglit manual job mostly working.
|
---|
1474 | - glsl/standalone: Pull program create/destroy out to a public function.
|
---|
1475 | - glsl/standalone: Pull out a helper function for adding GLSL source shaders.
|
---|
1476 | - glsl/standalone: Make all standalone contexts have NewProgram set.
|
---|
1477 | - glsl: Write a new test for GLSL and NIR mediump lowering.
|
---|
1478 | - ci/crocus: Fix 1.3.5.0 xfails.
|
---|
1479 | - ci/etnaviv: Polish the gc2000 xfails a bit.
|
---|
1480 | - ci/zink: Update the tgl manual run xfails.
|
---|
1481 | - gallivm: Skip loads/stores that are definitely outside of compact vars.
|
---|
1482 | - nir/lower_sysvals: Add support for un-lowered tess_level_inner/outer.
|
---|
1483 | - nir_to_tgsi: Handle stores to compact outputs.
|
---|
1484 | - glsl: Delete the lower_tess_level pass.
|
---|
1485 | - glsl: Remove the TessLevel lowering special case from xfb.
|
---|
1486 | - glsl: Drop dead prototype.
|
---|
1487 | - ci/freedreno: Flake KHR-GL45.shader_image_load_store.basic-allTargets-store
|
---|
1488 | - ci/broadcom: Skip another texelfetch case.
|
---|
1489 | - perfetto: Add a .clang-format for the directory.
|
---|
1490 | - intel/perfetto: Drop unused "pipelined" field.
|
---|
1491 | - perfetto: Make a MesaRenderpassDataSource with common setup/start/stop.
|
---|
1492 | - perfetto: Deduplicate clock sync packet emit from renderstage sources.
|
---|
1493 | - perfetto: Move intel's cmdbuf/queue annotation code to the shared util.
|
---|
1494 | - ci/zink: Drop validation exception for leaks at device destroy.
|
---|
1495 | - ci/zink: Disable godot-tps-gles3 on a630.
|
---|
1496 | - docs: Update Vulkan renderpass docs for !22191
|
---|
1497 | - ci: Add missing dependency on doxygen sources for docs-generation jobs.
|
---|
1498 | - docs: Claim less functionality for glsl_compiler.
|
---|
1499 | - glsl: Move ForceGLSLAbsSqrt handling to glsl-to-nir.
|
---|
1500 | - zink: Add mapping for nir_op_ldexp, but disable it for 64-bit's sake.
|
---|
1501 | - glsl: Retire ldexp lowering in favor of the nir lowering flag.
|
---|
1502 | - glsl/softfp64: GC the temp vars after we lower them to SSA.
|
---|
1503 | - glsl/softfp64: Add fisfinite lowering.
|
---|
1504 | - state_tracker: Lower frexp before lowering doubles.
|
---|
1505 | - intel: Always call nir_lower_frexp.
|
---|
1506 | - ir3: Move turnip's nir_lower_frexp to the shared compiler.
|
---|
1507 | - nouveau: Add missing nir_opt_algebraic_late.
|
---|
1508 | - nouveau: Enable frexp lowering in the backend.
|
---|
1509 | - zink: Enable nir_lower_frexp.
|
---|
1510 | - v3d: Lower frexp in the GL compiler like we do in Vulkan.
|
---|
1511 | - agx: Enable nir_lower_frexp.
|
---|
1512 | - panfrost/midgard: Enable nir_lower_frexp.
|
---|
1513 | - nir_to_tgsi: Always lower frexp_exp/sig.
|
---|
1514 | - glsl: Drop frontend lowering of 32-bit frexp.
|
---|
1515 | - glsl: Drop PIPE_SHADER_CAP_DFRACEXP_DLDEXP_SUPPORTED.
|
---|
1516 | - tgsi: Drop TGSI_OPCODE_DFRACEXP.
|
---|
1517 | - ci/zink: Disable a630 portal-2-v2 due to kernel OOMs.
|
---|
1518 | - etnaviv: Fix regression from if_uses change.
|
---|
1519 | - blob: Don't valgrind assert for defined memory if we aren't writing.
|
---|
1520 | - util/log: Fix log messages over 1024 characters.
|
---|
1521 | - vulkan: Handle alignment failure in the pipeline cache.
|
---|
1522 | - vulkan: Actually increment the count of objects in GetPipelineCacheData.
|
---|
1523 | - ci/radeonsi: Mark glx-make-current as flaky.
|
---|
1524 |
|
---|
1525 | EmperorPenguin18 (1):
|
---|
1526 |
|
---|
1527 | - v3d: expose more drm formats with SAND128 modifier
|
---|
1528 |
|
---|
1529 | Eric Engestrom (172):
|
---|
1530 |
|
---|
1531 | - bin/ci: add gitlab_gql.py.cache to the .gitignore
|
---|
1532 | - mesa/st: drop unused param
|
---|
1533 | - ci/bare-metal: add more timestamps to help debugging issues
|
---|
1534 | - ci: be explicit about the \`meson setup` subcommand
|
---|
1535 | - docs: add release notes for 22.3.4
|
---|
1536 | - docs/relnotes: add sha256sum for 22.3.4
|
---|
1537 | - docs: update calendar for 22.3.4
|
---|
1538 | - meson: turn android-libbacktrace into a feature option
|
---|
1539 | - v3dv: mark dEQP-VK.api.command_buffers.record_many_draws_secondary_2 as flaky
|
---|
1540 | - ci/android: move common config to common job
|
---|
1541 | - ci/android: move virgl-specific gpu_mode to virgl-defined variables
|
---|
1542 | - ci/android: move virgl-specific fails/flakes/skips lists to virgl-defined variables
|
---|
1543 | - ci/android: move virgl-specific deqp suite to virgl-defined variables
|
---|
1544 | - ci/android: move virgl-specific so lib name to virgl-defined variables
|
---|
1545 | - ci/android: add missing line terminator at the end of the file
|
---|
1546 | - docs: add release notes for 22.3.5
|
---|
1547 | - docs: update calendar for 22.3.5
|
---|
1548 | - panfrost: drop no-longer-needed libglsl
|
---|
1549 | - gallium/u_screen.h: add missing stdint.h include
|
---|
1550 | - util: avoid calling kcmp on Android
|
---|
1551 | - etnaviv: use simple_mtx to avoid breaking windows in the next commit
|
---|
1552 | - gallium: move etnaviv screen_lookup_or_create function to common code
|
---|
1553 | - freedreno: replace custom code with u_pipe_screen_lookup_or_create()
|
---|
1554 | - lima: replace custom code with u_pipe_screen_lookup_or_create()
|
---|
1555 | - v3d: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens
|
---|
1556 | - vc4: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens
|
---|
1557 | - panfrost: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens
|
---|
1558 | - asahi: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens
|
---|
1559 | - u_pipe_screen_lookup_or_create: avoid re-querying the fd to have a consistent hash key
|
---|
1560 | - broadcom/ci: mark test as flaky
|
---|
1561 | - vk/util: keep track of extension requirements
|
---|
1562 | - vk/runtime: keep track of supported instance extensions
|
---|
1563 | - vk/runtime: turn vk.xml extension requirements into asserts
|
---|
1564 | - meson: move float64_glsl_file one meson.build up
|
---|
1565 | - meson: only build mapi when needed
|
---|
1566 | - meson: only build the loader when needed
|
---|
1567 | - meson: only build libglsl_util when needed
|
---|
1568 | - meson: only build glsl when needed
|
---|
1569 | - meson: drop \`TODO: opengl`, it's done
|
---|
1570 | - ci: simplify adding & removing deqp patches
|
---|
1571 | - ci: remove no-op sed
|
---|
1572 | - ci: fix grouping of image tags
|
---|
1573 | - ci: bump tags of deqp images
|
---|
1574 | - docs: add 23.1 branchpoint & rc dates
|
---|
1575 | - meson: make GLX require OpenGL
|
---|
1576 | - meson/windows: only build libgl-gdi for desktop gl
|
---|
1577 | - meson: allow building GLES without GL
|
---|
1578 | - mesa: add _mesa_is_desktop_gl_compat() and _mesa_is_desktop_gl_core() helpers
|
---|
1579 | - mesa: make use of the new _mesa_is_desktop_gl_compat() helper
|
---|
1580 | - mesa: make use of the new _mesa_is_desktop_gl_core() helper
|
---|
1581 | - mesa: make more use of the existing _mesa_is_gles* helpers
|
---|
1582 | - mesa: add & use new _mesa_is_gles1() & _mesa_is_gles2() helpers
|
---|
1583 | - mesa: make more use of the new _mesa_is_gles1() helper
|
---|
1584 | - mesa: make more use of the new _mesa_is_gles2() helper
|
---|
1585 | - mesa: optimize out _mesa_is_desktop_gl*() and _mesa_is_gles*() calls when not built
|
---|
1586 | - ci: stop watching for changes in removed script
|
---|
1587 | - meson: improve formatting of options file
|
---|
1588 | - broadcom/ci: refactor a bit
|
---|
1589 | - broadcom/ci: fold .vc4-rpi3-piglit:armhf into its only user
|
---|
1590 | - broadcom/ci: use deqp-runner to run piglit tests
|
---|
1591 | - docs/release-calendar: drop the last 22.2.x, it won't happen
|
---|
1592 | - broadcom/ci: group x11 and wayland variant of the same test failing
|
---|
1593 | - broadcom/ci: use weston's xwayland instead of starting X as well
|
---|
1594 | - broadcom/ci: add x11- prefix to x11 EGL tests
|
---|
1595 | - broadcom/ci: drop create_pixmap_surface from the fails; it passes now
|
---|
1596 | - broadcom/ci: skip buffer_age.no_preserve and swap_buffers_with_damage on wayland
|
---|
1597 | - broadcom/ci: add two known failures
|
---|
1598 | - broadcom/ci: re-enable egl on wayland
|
---|
1599 | - docs: include explicit \`setup` in instructions
|
---|
1600 | - docs: add release notes for 22.3.6
|
---|
1601 | - docs/relnotes: add sha256sum for 22.3.6
|
---|
1602 | - docs: update calendar for 22.3.6
|
---|
1603 | - v3d: update supertuxkart reference after 1c028a4d5b623e73bdf5
|
---|
1604 | - docs: mention the meson summary
|
---|
1605 | - docs: mention \`meson configure` and drop broken workaround script
|
---|
1606 | - meson: reuse vulkan_wsi_list for defining vk_wsi_args
|
---|
1607 | - meson: replace vk_wsi_args with dependencies to let meson take care of transitivity
|
---|
1608 | - egl: include directly the useful vulkan header, instead of including everything
|
---|
1609 | - glx: include directly the useful vulkan header, instead of including everything
|
---|
1610 | - gbm: drop unnecessary vulkan dependency
|
---|
1611 | - radv: split linker script for android since it requires different symbols
|
---|
1612 | - glsl: align definition of _mesa_problem with the one in main/error.h
|
---|
1613 | - glapi/meson: drop duplicate line in deps
|
---|
1614 | - meson: allow checking for null pointers even if they're supposed to be non-null
|
---|
1615 | - panfrost/ci: add EGL tests
|
---|
1616 | - asahi/winsys: add .clang-format
|
---|
1617 | - vk: move radv's linker symbols scripts for use in all drivers
|
---|
1618 | - v3dv: add linker script to fix android symbols
|
---|
1619 | - tu: add linker script to fix android symbols
|
---|
1620 | - anv: add linker script to fix android symbols
|
---|
1621 | - vn: add linker script to fix android symbols
|
---|
1622 | - android/vk: drop unnecessary symbols
|
---|
1623 | - vk: be stricter about symbols check between android and other platforms
|
---|
1624 | - v3d/ci: add dEQP-GLES3.functional.texture.specification.teximage2d_pbo.*_cube flakes
|
---|
1625 | - osmesa: add exported symbols check
|
---|
1626 | - docs: add release notes for 22.3.7
|
---|
1627 | - docs/relnotes: add sha256sum for 22.3.7
|
---|
1628 | - docs: update calendar for 22.3.7
|
---|
1629 | - v3dv/ci: add a test to the known failures
|
---|
1630 | - meson: bump minimum version to 0.60
|
---|
1631 | - meson: allow feature options to take true/false to mean enabled/disabled
|
---|
1632 | - meson: inline gtest_test_protocol now that it's always 'gtest'
|
---|
1633 | - v3dv: split out broadcom_shader_stage_to_gl() calls to improve readability
|
---|
1634 | - ci: take valve farm offline
|
---|
1635 | - ci: disable weston session timeout
|
---|
1636 | - broadcom/ci: no need to skip the tests that swap buffers anymore
|
---|
1637 | - ci/broadcom: move rare failure to the flakes
|
---|
1638 | - ci: drop redundant .no_scheduled_pipelines-rules + .core-rules since the latter already includes it
|
---|
1639 | - ci/rustfmt: simplify getting all the rust files
|
---|
1640 | - ci/rustfmt: print which files are checked
|
---|
1641 | - ci: group RESULT logic in a single place
|
---|
1642 | - v3dv/ci: fix test name (\`,Fail` is not part of the test name)
|
---|
1643 | - asahi: replace copies of .clang-format with symlinks
|
---|
1644 | - asahi: fix a few typos
|
---|
1645 | - v3d: fix \`dirty` bitset being too small to accept V3D_DIRTY_SSBO
|
---|
1646 | - v3dv: use common GetPhysicalDeviceFeatures
|
---|
1647 | - v3dv: reorder features as 1.0, 1.1, 1.2, 1.3
|
---|
1648 | - v3dv: use vk_get_physical_device_features
|
---|
1649 | - v3d/ci: add another depthstencil-default_fb-drawpixels-* to the flakes
|
---|
1650 | - v3d/ci: group dEQP-GLES3.functional.texture.specification.teximage2d_pbo.* flakes and add another one
|
---|
1651 | - ci: centralize detection of ccache in link-werror wrapper
|
---|
1652 | - ci: add linker wrapper for clang
|
---|
1653 | - ci: always use the -Werror wrapper
|
---|
1654 | - ci: deduplicate compiler wrappers
|
---|
1655 | - ci/docs: start documenting ci_run_n_monitor.py
|
---|
1656 | - v3d: add link to issue investigating failure
|
---|
1657 | - asahi: change create_renderonly signature to uniformize it
|
---|
1658 | - etnaviv: change create_renderonly signature to uniformize it
|
---|
1659 | - freedreno: change create_renderonly signature to uniformize it
|
---|
1660 | - lima: change create_renderonly signature to uniformize it
|
---|
1661 | - panfrost: change create_renderonly signature to uniformize it
|
---|
1662 | - v3d: change create_renderonly signature to uniformize it
|
---|
1663 | - vc4: change create_renderonly signature to uniformize it
|
---|
1664 | - kmsro: uniformize renderonly creation
|
---|
1665 | - kmsro: sort drivers alphabetically
|
---|
1666 | - ci/broadcom: consolidate vc4-rpi3* jobs into a single vc4-rpi3-gl:armhf
|
---|
1667 | - ci/broadcom: consolidate v3d-rpi4* jobs into a single v3d-rpi4-gl:armhf
|
---|
1668 | - ci/broadcom: slightly increase coverage of vk tests
|
---|
1669 | - vc4/ci: add arm64 failure to flakes as it works on armhf
|
---|
1670 | - broadcom/ci: run gl jobs on arm64, just like vk
|
---|
1671 | - vc4/ci: add another sync flake
|
---|
1672 | - panfrost: assign the correct create_for_resource from the start
|
---|
1673 | - Revert "broadcom/ci: run gl jobs on arm64, just like vk"
|
---|
1674 | - v3dv/ci: mark known dEQP-VK.wsi.xlib.surface.query_formats failure
|
---|
1675 | - ci/rustfmt: make sure to only check each file once
|
---|
1676 | - v3d: disable GL_NV_conditional_render
|
---|
1677 | - VERSION: bump for 23.1.0-rc1
|
---|
1678 | - .pick_status.json: Update to 8ebc5cbe2b828f34b9bfb32c528d3514ead59798
|
---|
1679 | - v3dv/ci: drop fixed failure from fails.txt
|
---|
1680 | - .pick_status.json: Update to 0d7912d239dac5bf3c8b07f2a6ca467f760d6aa6
|
---|
1681 | - .pick_status.json: Update to 543b6ca7c4b00c4bfff5668ba0a0643d565db201
|
---|
1682 | - amd: fix buggy usage of unreachable()
|
---|
1683 | - compiler: fix buggy usage of unreachable()
|
---|
1684 | - pvr: fix buggy usage of unreachable()
|
---|
1685 | - vk/util: fix buggy usage of unreachable()
|
---|
1686 | - v3d: add flake spec@ext_framebuffer_blit@fbo-sys-sub-blit
|
---|
1687 | - VERSION: bump for 23.1.0-rc2
|
---|
1688 | - .pick_status.json: Update to 3017d01c9ded9c9fd097b600081b1bbe86e90fb8
|
---|
1689 | - .pick_status.json: Update to a18a51a708a86f51e0a5ab031b379f65bc84fb49
|
---|
1690 | - .pick_status.json: Update to c060b649c5a866f42e5df73f41c6e2809cf30e99
|
---|
1691 | - ci: rework vulkan validation layer build script
|
---|
1692 | - .pick_status.json: Update to 3f14fd8578549e34db2f564396f300819b2ff10f
|
---|
1693 | - VERSION: bump for 23.1.0-rc3
|
---|
1694 | - .pick_status.json: Update to 040aeb5a23e5cc8a71a352e55282d514dd2ab64f
|
---|
1695 | - .pick_status.json: Update to 9f522ac0c65ceae11ad1a4e84ec9f32a9393a25c
|
---|
1696 | - .pick_status.json: Update to efc94390f716b70ac1d5b09c6f949f938aeadcac
|
---|
1697 | - VERSION: bump for 23.1.0-rc4
|
---|
1698 | - .pick_status.json: Update to 6d84b34359dcbad477209adb9f9d0592c5a71bb9
|
---|
1699 | - .pick_status.json: Update to cb4e4fc5de48886758a26ff19d322947b5abfcec
|
---|
1700 | - dzn: fix pointer type mismatch
|
---|
1701 | - .pick_status.json: Update to 57afa7c0b12d6d0c9013368853080dfea5b50d07
|
---|
1702 | - .pick_status.json: Update to 31e6d15801a9904089aa2913c8eb5a31b79c7dfc
|
---|
1703 |
|
---|
1704 | Erico Nunes (5):
|
---|
1705 |
|
---|
1706 | - lima/ci: Add more piglit unsupported tests to skip
|
---|
1707 | - Revert "CI: Lima farm is offline"
|
---|
1708 | - lima: don't use resource_from_handle while creating scanout
|
---|
1709 | - lima/ci: restore swap buffers egl tests
|
---|
1710 | - Revert "ci: disable lima farm, currently out-of-space, needs to be fixed"
|
---|
1711 |
|
---|
1712 | Erik Faye-Lund (54):
|
---|
1713 |
|
---|
1714 | - zink: whitespace fixup
|
---|
1715 | - zink: fix depth-clip disable cap
|
---|
1716 | - zink: remove depth_clip_control_missing workaround
|
---|
1717 | - radeonsi: respect smoothing_enabled
|
---|
1718 | - meson: remove dupliace add_devenv call
|
---|
1719 | - meson: remove deprecated osmesa-bits option
|
---|
1720 | - meson: remove deprecated dri-drivers option
|
---|
1721 | - meson: avoid using deprecated build_root() method
|
---|
1722 | - meson: use files() instead of joining paths
|
---|
1723 | - freedreno/meson: simplify script-path logic
|
---|
1724 | - meson: do not reconstruct ICD paths
|
---|
1725 | - anv, hasvk: remove stale TODO-files
|
---|
1726 | - zink: correct companies in requirements
|
---|
1727 | - zink: remove incorrect trailing comma
|
---|
1728 | - meson: remove unused USE_FOO_ASM defines
|
---|
1729 | - vulkan: prefer vulkan_core.h over vulkan.h
|
---|
1730 | - meson: don't pass vk wsi args where they don't belong
|
---|
1731 | - Revert "meson: Fix Asahi build on macOS"
|
---|
1732 | - zink: prefer vulkan_core.h over vulkan.h
|
---|
1733 | - zink: get rid of needless dependency
|
---|
1734 | - ci: correct typo in name of linkcheck job
|
---|
1735 | - docs: update link to intel optimization reference manual
|
---|
1736 | - nir: add a print_internal debug-flag
|
---|
1737 | - docs: implement new vk-feat role
|
---|
1738 | - docs/zink: use vk-feat role for features
|
---|
1739 | - docs/zink: remove some trailing spaces
|
---|
1740 | - docs/zink: fixup wording of the GL 4.6 requirements
|
---|
1741 | - meson: correct typo in comment
|
---|
1742 | - ci: move docs-stuff out of root .gitlab-ci.yml
|
---|
1743 | - docs: fixup broken envvar-role syntax
|
---|
1744 | - docs: escape a few more strings
|
---|
1745 | - docs: fixup broken indentation
|
---|
1746 | - docs/zink: mention vk1.2 mirror-clamp feature option
|
---|
1747 | - docs/zink: clean up requirements-language
|
---|
1748 | - docs: move developers article to main website
|
---|
1749 | - docs: remove old thanks-article
|
---|
1750 | - docs: prefer http-links over ftp
|
---|
1751 | - docs/freedreno: fix turnip-heading level
|
---|
1752 | - docs: drop reference to modindex
|
---|
1753 | - docs: move old relnotes to _extra directory
|
---|
1754 | - docs: use version-number as toctree-title for relnotes
|
---|
1755 | - zink: emit terminate for spir-v 1.6
|
---|
1756 | - zink: use demote from spir-v 1.6 when possible
|
---|
1757 | - zink: use spir-v 1.6 local-size when needed
|
---|
1758 | - zink: enable spir-v 1.6 for vulkan 1.3
|
---|
1759 | - docs: format code-block as ini
|
---|
1760 | - docs: format code-block as toml
|
---|
1761 | - docs: make code-block indents consistent
|
---|
1762 | - ci: move virgl-rules after intel-rules
|
---|
1763 | - virgl/ci: clean up manual rules for virgl
|
---|
1764 | - ci: remove unused rules
|
---|
1765 | - zink: do not use sampled-image for buffers
|
---|
1766 | - nir: fix constant-folding of 64-bit fpow
|
---|
1767 | - llvmpipe: fixup refactor copypasta
|
---|
1768 |
|
---|
1769 | Faith Ekstrand (99):
|
---|
1770 |
|
---|
1771 | - nir: Add more opcodes to nir_tex_instr_is_query()
|
---|
1772 | - nir/builder: Add some texture helpers
|
---|
1773 | - radv: Use the new NIR builder tex helpers for meta
|
---|
1774 | - anv: Refactor Android externalFormat handling in CreateYcbcrConversion
|
---|
1775 | - anv/android: Use VkFormat for externalFormat
|
---|
1776 | - util/format: YUYV and UYVY have 4 8-bit channels
|
---|
1777 | - vulkan/formats: Add YCbCr format information
|
---|
1778 | - vulkan: Add a common vk_ycbcr_conversion struct
|
---|
1779 | - anv: Use the common vk_ycbcr_conversion object
|
---|
1780 | - anv: Use the YCbCr format info from common code
|
---|
1781 | - nir: Add copyright and include guards to nir_vulkan.h
|
---|
1782 | - anv,nir: Move the ANV YCbCr lowering pass to common code
|
---|
1783 | - gallium,util: Pull u_indices and u_primconvert back into gallium
|
---|
1784 | - mailmap: Remap e-mail addresses for Faith Ekstrand
|
---|
1785 | - vtn: Set alignment on initial UBO/SSBO casts
|
---|
1786 | - anv: Let spirv_to_nir() set UBO/SSBO base cast alignments
|
---|
1787 | - hasvk: Let spirv_to_nir() set UBO/SSBO base cast alignments
|
---|
1788 | - intel/compiler: Document wm_prog_key::persample_interp
|
---|
1789 | - intel/nir: Lower barycentrics to per-sample in a dedicated pass
|
---|
1790 | - nir: Remove nir_lower_io_force_sample_interpolation
|
---|
1791 | - intel/compiler: Use SHADER_OPCODE_SEND for PI messages
|
---|
1792 | - intel/fs: Return early in a couple builtin setup helpers
|
---|
1793 | - intel/compiler: Convert brw_wm_aa_enable to brw_sometimes
|
---|
1794 | - intel/fs: Make per-sample and coarse dispatch tri-state
|
---|
1795 | - intel/compiler: Convert wm_prog_key::persample_interp to a tri-state
|
---|
1796 | - intel/compiler: Convert wm_prog_key::multisample_fbo to a tri-state
|
---|
1797 | - intel/fs/validate: Assert SEND [extended] descriptors are uniform
|
---|
1798 | - intel/fs: Break out yet another FB write helper
|
---|
1799 | - intel/fs: Rework dynamic coarse handling
|
---|
1800 | - nir/deref: Preserve alignments in opt_remove_cast_cast()
|
---|
1801 | - nir/from_ssa: Use more helpers in resolve_parallel_copies
|
---|
1802 | - nir/from_ssa: Only re-locate values that are destinations
|
---|
1803 | - nir/from_ssa: Move the loop bounds check in resolve_parallel_copy
|
---|
1804 | - nir: Add a load/store bit size lowering pass
|
---|
1805 | - intel/nir: Use nir_lower_mem_access_bit_sizes()
|
---|
1806 | - Revert "vk/runtime: turn vk.xml extension requirements into asserts"
|
---|
1807 | - Revert "vk/util: keep track of extension requirements"
|
---|
1808 | - vulkan: Remove unused fields from Extension and ApiVersion
|
---|
1809 | - vulkan: Improve extension parsing
|
---|
1810 | - vulkan: Parse the platform in Extensions.from_xml()
|
---|
1811 | - vulkan: Add a get_all_required() helper
|
---|
1812 | - vulkan: Properly filter entrypoints
|
---|
1813 | - vulkan: Properly filter by api in enum_to_str
|
---|
1814 | - Vulkan: Properly filter structs in vk_cmd_queue_gen
|
---|
1815 | - vulkan: Filter out provisional extensions
|
---|
1816 | - vulkan: Move the features generator to vulkan/util
|
---|
1817 | - vulkan: Properly filter structs in vk_physical_device_features
|
---|
1818 | - vulkan/layers: Use PUBLIC instead of VK_LAYER_EXPORT
|
---|
1819 | - vulkan/device-select-layer: Include vulkan.h
|
---|
1820 | - vulkan: Update the XML and headers to 1.3.241
|
---|
1821 | - nir/lower_io: Handle buffer_array_length for more address modes
|
---|
1822 | - anv: Drop our manual SSBO size handling
|
---|
1823 | - hasvk: Drop our manual SSBO size handling
|
---|
1824 | - panvk: Drop our manual SSBO size handling
|
---|
1825 | - turnip: Set spirv_options::use_deref_buffer_array_length
|
---|
1826 | - lavapipe: Set spirv_options::use_deref_buffer_array_length
|
---|
1827 | - v3dv: Set spirv_options::use_deref_buffer_array_length
|
---|
1828 | - spirv: Always emit deref_buffer_array_length intrinsics
|
---|
1829 | - nir: Check against combined alignment in nir_lower_mem_access_bit_sizes
|
---|
1830 | - nir: Add mode filtering to lower_mem_access_bit_sizes
|
---|
1831 | - nir: Add UBO support to nir_lower_mem_access_bit_sizes
|
---|
1832 | - nir: Add a combined alignment helper
|
---|
1833 | - nir: Rename align to whole_align in lower_mem_load
|
---|
1834 | - nir: Rename nir_mem_access_size_align::align_mul to align
|
---|
1835 | - nir: Make chunk_align_offset const in lower_mem_load()
|
---|
1836 | - nir: Handle wider unaligned loads in lower_mem_access_bit_size
|
---|
1837 | - intel/nir: Limit unaligned loads to vec4
|
---|
1838 | - vulkan/runtime: Rename and document storage image Z range
|
---|
1839 | - intel/blorp: Set array_len for 3D images properly
|
---|
1840 | - isl: Set Depth to array len for 3D storage images
|
---|
1841 | - intel: Use nir_lower_tex_options::lower_index_to_offset
|
---|
1842 | - vulkan: Update XML and headers to 1.3.244
|
---|
1843 | - vulkan: Provide wrappers for VK_EXT_map_memory2 functions
|
---|
1844 | - anv: Limit memory maps to the client-allocated size
|
---|
1845 | - anv: Implement VK_KHR_map_memory2
|
---|
1846 | - intel/isl: Support Yf/Ys/Tile-64 in isl_surf_get_image_offset_sa
|
---|
1847 | - intel/blorp: Drop the TODO file
|
---|
1848 | - docs: Fix Faith's name in relnotes
|
---|
1849 | - nir: Drop a bunch of Authors tags
|
---|
1850 | - spirv: Drop a bunch of Authors tags
|
---|
1851 | - intel: Drop some author comments and update Faith's name
|
---|
1852 | - util,mesa,panfrost: Drop some author tags
|
---|
1853 | - vulkan: vk_android.c should be copyright Intel
|
---|
1854 | - util: Update some copyright tags
|
---|
1855 | - CODEOWNERS: s/jekstrand/gfxstrand
|
---|
1856 | - vulkan,anv,hasvk,radv: Add a common vk_image_usage_to_ahb_usage helper
|
---|
1857 | - vulkan/android: Fix hardware buffer usage flags
|
---|
1858 | - vulkan: Add an ahardware_buffer_format field to vk_image
|
---|
1859 | - anv,hasvk: Set vk_image.ahardware_buffer_format
|
---|
1860 | - radv: Set vk_image.ahardware_buffer_format
|
---|
1861 | - vulkan,anv,hasvk,radv: Unify Android hardware buffer creation
|
---|
1862 | - vulkan: Add a vk_device_memory base struct
|
---|
1863 | - anv: Use the new vk_device_memory base struct
|
---|
1864 | - vulkan: Record pipeline flags in the render pass
|
---|
1865 | - vulkan: Plumb rendering flags through vk_graphics_pipeline_state
|
---|
1866 | - anv/pipeline: Use feedback loop flags for self-dependencies
|
---|
1867 | - hasvk/pipeline: Use feedback loop flags for self-dependencies
|
---|
1868 | - vulkan: Drop vk_render_pass_state::\*self_dependenc*
|
---|
1869 | - vulkan: Drop VkRenderingSelfDependencyInfoMESA
|
---|
1870 |
|
---|
1871 | Felix DeGrood (10):
|
---|
1872 |
|
---|
1873 | - intel/perf: Hide extended metrics by default
|
---|
1874 | - anv: cs_stall during compute state flush on < gen12.5
|
---|
1875 | - anv: only emit CFE_STATE when scratch space increases
|
---|
1876 | - anv: set CFE_STATE.OverDispatchControl to default
|
---|
1877 | - iris: report draw count for perfetto
|
---|
1878 | - anv/blorp: support surf generation for addresses
|
---|
1879 | - anv/blorp: implement anv_cmd_buffer_fill_area
|
---|
1880 | - anv/blorp: add flush reasons to RT flushes
|
---|
1881 | - anv: reset query pools using blorp
|
---|
1882 | - anv: disable reset query pools using blorp opt on MTL
|
---|
1883 |
|
---|
1884 | Filip Gawin (2):
|
---|
1885 |
|
---|
1886 | - crocus: don't quantize the clear value
|
---|
1887 | - nine: add fallback for D3DFMT_D16 in d3d9_to_pipe_format_checked
|
---|
1888 |
|
---|
1889 | Francisco Jerez (11):
|
---|
1890 |
|
---|
1891 | - intel/fs/gfx12: Ensure that prior reads have executed before barrier with acquire semantics.
|
---|
1892 | - intel/disasm/gfx12+: Use helper instead of hardcoded bit access for 64-bit immediates.
|
---|
1893 | - intel/disasm/gfx12+: Fix print out of non-existing condmod field with 64-bit immediate.
|
---|
1894 | - intel/eu/gfx12+: Implement decoding of 64-bit immediates.
|
---|
1895 | - intel/fs/gfx12+: Drop redundant handling of SHADER_OPCODE_BROADCAST in exec pipe inference.
|
---|
1896 | - intel/fs: Fix src and dst types of LOAD_PAYLOAD ACP entries during copy propagation.
|
---|
1897 | - intel/eu/gfx8-9: Fix execution with all channels disabled due to HW bug #220160235.
|
---|
1898 | - intel/rt: Fix L3 bank performance bottlenecks due to SW stack stride alignment.
|
---|
1899 | - intel/fs: Track force_writemask_all behavior of copy propagation ACP entries.
|
---|
1900 | - intel/fs: Fix copy propagation dataflow analysis in presence of force_writemask_all ACP overwrites.
|
---|
1901 | - intel/fs: Fix register coalesce in presence of force_writemask_all copy source writes.
|
---|
1902 |
|
---|
1903 | Frank Binns (7):
|
---|
1904 |
|
---|
1905 | - pvr: small cleanups
|
---|
1906 | - pvr: remove start/stop transfer flags
|
---|
1907 | - pvr: stop restricting the compiler to the Sascha Willems triangle demo
|
---|
1908 | - pvr: remove duplicate define
|
---|
1909 | - pvr: initialise size for placeholder "zeroed" shaders
|
---|
1910 | - pvr: replace nop binary shader with run-time compiled shader
|
---|
1911 | - pvr: fix clang-format issue
|
---|
1912 |
|
---|
1913 | Friedrich Vock (26):
|
---|
1914 |
|
---|
1915 | - radv/rt: Divide by the correct workgroup size
|
---|
1916 | - radv/bvh: Prevent NANs when computing node cost
|
---|
1917 | - radv/rmv: Also check the other pid field
|
---|
1918 | - radv/rmv: Avoid more CPU unmap deadlocks
|
---|
1919 | - radv/rmv: Log bo destruction before freeing it
|
---|
1920 | - radv/rmv: Correct timestamp shifting
|
---|
1921 | - vulkan/rmv: Use the timestamp divisor instead of a hardcoded value
|
---|
1922 | - vulkan/rmv: Remove delta parameter from dump helpers
|
---|
1923 | - mesa: Report GL_SHADER_BINARY_FORMAT_SPIR_V as supported
|
---|
1924 | - docs: Fix formatting for RMV tracing docs
|
---|
1925 | - radv: Extend hit attribute lowering for LDS
|
---|
1926 | - radv: Use LDS for closest-hit hit attributes
|
---|
1927 | - radv: Emit RT shader VA user SGPR
|
---|
1928 | - radv/rt: Add shader config combination/postprocessing utils
|
---|
1929 | - radv: Add RT shader stage names for executable properties
|
---|
1930 | - aco: Swap operands for v_and_b32 in RT prolog
|
---|
1931 | - radv/rt: Also adjust the SGPR count in postprocess_rt_config
|
---|
1932 | - aco: Un-swap addressable VGPRs/SGPRs in RT prolog
|
---|
1933 | - radv: Work around use-after-free compiler errors
|
---|
1934 | - radv: Add RT stages to radv_mesa_to_rgp_shader_stages
|
---|
1935 | - radv/rmv: Fix creating RT pipelines
|
---|
1936 | - radv/rmv: Fix import memory
|
---|
1937 | - radv/rt: Plug some memory leaks during shader creation
|
---|
1938 | - radv: Don't leak the RT prolog binary
|
---|
1939 | - radv: Always call si_emit_cache_flush before writing timestamps
|
---|
1940 | - radv: Add driconf to always drain waves before writing timestamps
|
---|
1941 |
|
---|
1942 | GH Cao (1):
|
---|
1943 |
|
---|
1944 | - gallium: Add MCJIT target triplet for Windows ARM64
|
---|
1945 |
|
---|
1946 | Ganesh Belgur Ramachandra (1):
|
---|
1947 |
|
---|
1948 | - ac/nir: fix CDNA image lowering for array textures
|
---|
1949 |
|
---|
1950 | Georg Lehmann (81):
|
---|
1951 |
|
---|
1952 | - Revert "aco: Combine v_cvt_u32_f32 with insert to v_cvt_pk_u8_f32."
|
---|
1953 | - aco: use s_bfm_64 for constant copies
|
---|
1954 | - aco: use s_pack_ll_b32_b16 for constant copies
|
---|
1955 | - aco: Improve wave64 cycle estimates.
|
---|
1956 | - aco: fix imod/omod for gfx11 VOP3 opcodes
|
---|
1957 | - aco: add mov/cndmask opcodes to does_fp_op_flush_denorms
|
---|
1958 | - aco: don't allow output modifiers for v_cvt_pkrtz_f16_f32
|
---|
1959 | - aco: allow output modifiers for ldexp_f16
|
---|
1960 | - aco: don't list imod/omod support v_fmaak_f32/v_fmamk_f32
|
---|
1961 | - aco: support omod/imod for v_fmac_f16
|
---|
1962 | - aco: remove stale TODOs about v_interp opsel
|
---|
1963 | - aco: new 16bit VOP3 opcodes can use opsel
|
---|
1964 | - aco: Don't use vcmpx with DPP.
|
---|
1965 | - aco: combine a ^ ~b and ~(a ^ b) to v_xnor_b32
|
---|
1966 | - amd,nir: remove byte_permute_amd intrinsic
|
---|
1967 | - nir: change 16bit image dest folding option to per type
|
---|
1968 | - amd: don't use d16 for integer loads
|
---|
1969 | - amd: d16 uses rtz conversion for 32bit float
|
---|
1970 | - aco: use v_permlane(x)16_b32 for masked swizzle
|
---|
1971 | - aco/gfx11: use dpp_row_xmask and dpp_row_share
|
---|
1972 | - aco: use and swizzle mask in dpp quad perm
|
---|
1973 | - aco/optimizer_postRA: assume all registers are untrackable in loop headers
|
---|
1974 | - nir/opt_algebraic: add patterns for iand/ior of feq/fneu with 0
|
---|
1975 | - aco: mark mad definition as precise if the mul/add were precise
|
---|
1976 | - aco: use v_fma_mix_f32 for v_fma_f32 with 2 fp16 representable, different literals
|
---|
1977 | - nir/lower_mediump: don't use fp16 for constants if the result is denormal
|
---|
1978 | - aco: treat VINTERP_INREG as VALU
|
---|
1979 | - aco/ir: rework IR to have one common valu instruction struct
|
---|
1980 | - aco/ra: set opsel_hi to zero when converting to VOP2
|
---|
1981 | - aco: validate VALU modifiers
|
---|
1982 | - aco/print_ir: simplify using VALU instruction
|
---|
1983 | - aco/optimizer: simplify using VALU instruction
|
---|
1984 | - aco: remove VOP[123C]P? structs
|
---|
1985 | - aco: add bitfield array helper classes
|
---|
1986 | - aco: use bitfield array helpers for valu modifiers
|
---|
1987 | - aco/assembler/gfx11: simplify 16bit VOP12C promotion to VOP3
|
---|
1988 | - aco/optimizer: don't reallocate instruction when converting to VOP3
|
---|
1989 | - aco: don't reallocate fma{mk,ak,_mix} instruction
|
---|
1990 | - aco: copy abs/neg with assignment
|
---|
1991 | - aco: use integer access for neg_lo/neg_hi
|
---|
1992 | - aco: use array indexing for opsel/opsel_lo/opsel_hi
|
---|
1993 | - aco: access neg/abs as int in usesModifiers
|
---|
1994 | - aco: use bitfield_array for temporary neg/abs/opsel
|
---|
1995 | - nir: optimize i2f(f2i(fsign))
|
---|
1996 | - aco: remove duplicates from .clang-format
|
---|
1997 | - amd: remove duplicate from .clang-format
|
---|
1998 | - aco: don't check usesModifiers for pseudo instructions
|
---|
1999 | - aco: fix p_interp_gfx11 comment
|
---|
2000 | - aco: make .clang-format usable with tests
|
---|
2001 | - aco/ir: fix copy paste bug in convert_to_SDWA
|
---|
2002 | - aco/util: override default assignment operator for bitfield helpers
|
---|
2003 | - aco: clean up to_mad_mix
|
---|
2004 | - aco/ra: don't reallocate VOP3 instruction for non-vcc lane mask
|
---|
2005 | - aco/vn: hash opsel for VOP12C
|
---|
2006 | - aco/assembler: support VOP12C opsel
|
---|
2007 | - aco: validate VOP12C opsel
|
---|
2008 | - aco/to_hw_instr: use VOP1 opsel for v_mov_b16
|
---|
2009 | - aco/ra: prepare for VOP12C opsel
|
---|
2010 | - aco/optimizer: preserve opsel when fusing fma
|
---|
2011 | - aco: handle opsel in combine_comparison_ordering
|
---|
2012 | - aco: handle opsel in combine_ordering_test
|
---|
2013 | - aco: handle opsel in combine_constant_comparison_ordering
|
---|
2014 | - aco: update match_op3_for_vop3 for VOP12C opsel
|
---|
2015 | - aco: support v_cvt_f32_f16 with opsel in combine_mad_mix
|
---|
2016 | - aco: support neg(mul)/abs(mul) optimization in more cases
|
---|
2017 | - aco: return true in usesModifiers for VOP12C with opsel
|
---|
2018 | - aco: swap opsel when swapping VOP2/C operands
|
---|
2019 | - aco/ir: copy opsel when converting to DPP
|
---|
2020 | - aco: don't label mul with opsel as abs/neg
|
---|
2021 | - aco/gfx11: allow opsel for VOP12C
|
---|
2022 | - aco/optimizer: use opsel for VOP12C
|
---|
2023 | - aco: keep label_mul/usedef/minmax in apply_extract
|
---|
2024 | - aco/optimizer: remove to_SDWA
|
---|
2025 | - aco: add tests for fma with opsel
|
---|
2026 | - aco: add tests for dpp with opsel
|
---|
2027 | - aco: add tests for swap operand with opsel
|
---|
2028 | - aco: add tests for cmp ordering with opsel
|
---|
2029 | - aco: add test for min/max combining with opsel
|
---|
2030 | - aco/tests: run optimize.mad_mix.input_conv.modifiers on gfx11
|
---|
2031 | - aco: add tests for neg(mul) with opsel
|
---|
2032 | - aco/tests: add missing dependency on generated header
|
---|
2033 |
|
---|
2034 | Gert Wollny (49):
|
---|
2035 |
|
---|
2036 | - glsl/nir: only set uses_sample_shading when the output is a fbfetch
|
---|
2037 | - nir: Add possibility to store image var offset in range_base
|
---|
2038 | - nir: Add range_base to atomic_counter and an option to use it
|
---|
2039 | - ntt: handle the image intrinsic range_base when translating to TGSI
|
---|
2040 | - ntt: Make use of the range_base offset when translating atomics in NTT
|
---|
2041 | - virgl: lower image variable offsets into the intrinsic range_base value
|
---|
2042 | - virgl: Request setting the atomic offset in the range_base
|
---|
2043 | - virgl: drop the separable flag for cases that can't be handled
|
---|
2044 | - r600/sfn: Fix readport check
|
---|
2045 | - r600/sfn: Do a bit of cleanup with the secondary read port validation
|
---|
2046 | - r600/sfn: Fix opcode and result dest slot mask for variable size dot
|
---|
2047 | - r600/sfn: Fix splitting of multislot alu ops
|
---|
2048 | - virgl: remove unused virgl_encoder_inline_write
|
---|
2049 | - r600/sfn: Use range_base for atomics and images
|
---|
2050 | - r600/sfn: Work around dependency issue when splitting op to group
|
---|
2051 | - r600/sfn: drop useless instr use count
|
---|
2052 | - r600/sfn: Fix a typo
|
---|
2053 | - r600/sfn: Silence warnings about unused parameters
|
---|
2054 | - r600/sfn: Don't copy propagate indirect loads to more than one dest
|
---|
2055 | - r600/sfn: Stop try scheduling in t-slot with empty related v-slot
|
---|
2056 | - r600/sfn: rename texture coordinate offset for clarity
|
---|
2057 | - r600/sfn: address use in group only if instr can be added
|
---|
2058 | - r600/sfn: Forward setting the block ID and index
|
---|
2059 | - r600: Don't start new CF for every fetch through tex clause
|
---|
2060 | - r600/sfn: Fix handling of fetch through texture clause
|
---|
2061 | - r600/sfn: Fix alu trans op flag setup
|
---|
2062 | - r600/sfn: Fix Cayman trans from string and add test for copy prop
|
---|
2063 | - vulkan/wsi: Take Xwayland into account for x11_min_image_count
|
---|
2064 | - zink/kopper: Add extra swapchain images for Venus
|
---|
2065 | - r600/sfn: be more conservative with channel use in multi-slot ops
|
---|
2066 | - r600/sfn: Fix readport cylce map
|
---|
2067 | - r600/sfn: Fix minimum required registers
|
---|
2068 | - r600/sfn: Add AluGroup method to update readport validation from scratch
|
---|
2069 | - r600/sfn: Split AluInstr replace_source into test and actual replace
|
---|
2070 | - r600/sfn: Add method to AluGroup to replace sources
|
---|
2071 | - r600/sfn: Add print method to AluReadportValidation
|
---|
2072 | - r600/sfn: redirect copy propagation to alu parent group
|
---|
2073 | - r600/sfn/tests: Add a test for the copy prop into a group
|
---|
2074 | - r600/sfn: Fix atomic lowering
|
---|
2075 | - virgl: Enable AMD_vertex_shader_(layer|viewport_index) when host supports it
|
---|
2076 | - virgl: Don't try to do re-alloc or readback by transfer for blob resources
|
---|
2077 | - ntt: add option to lower SSBO bindings to buffer index
|
---|
2078 | - virgl: Lower binding start into buffer indices
|
---|
2079 | - r600/sfn: fix container allocators
|
---|
2080 | - r600/sfn: Lower tess levels to vectors in TCS
|
---|
2081 | - r600/sfn: make sure f2u32 is lowered late and correctly for 64 bit floats
|
---|
2082 | - r600/sfn: assign window_space_position in shader state
|
---|
2083 | - r600/sfn: Ass support for image_samples
|
---|
2084 | - r600/sfn: fix cube to array lowering for LOD
|
---|
2085 |
|
---|
2086 | Giancarlo Devich (25):
|
---|
2087 |
|
---|
2088 | - d3d12: Use varying comparison function for TESS stage key compare
|
---|
2089 | - d3d12: Add unions to encompass shader key stage vars, use in hashing
|
---|
2090 | - nir: Check sampler_binding is valid when lowering tex shadow
|
---|
2091 | - d3d12: Don't clear d3d12_shader_key
|
---|
2092 | - d3d12: Move d3d12_context_state_table_entry to d3d12_resource_state.h
|
---|
2093 | - d3d12: Assign up to 16 simultaneously active contexts unique IDs
|
---|
2094 | - d3d12: Track up to 16 active context resource states locally in d3d12_bo
|
---|
2095 | - d3d12: Don't recompute has_flat_varyings or missing_dual_src_outputs
|
---|
2096 | - d3d12: Track max varying slot, set and compare less bytes
|
---|
2097 | - d3d12: Don't unnecessarily zero out gs/tcs keys
|
---|
2098 | - d3d12: Don't memcmp gs/tcs keys
|
---|
2099 | - d3d12: Create varying structures as necessary, reference them
|
---|
2100 | - d3d12: Don't loop in update_draw_indirect_with_sysvals
|
---|
2101 | - d3d12: Compare shader keys with a switch, instead of cascading if's
|
---|
2102 | - d3d12: Compare shader keys with union-encompassing fields all at once
|
---|
2103 | - d3d12: Compare shader key common parts with memcmp, instead of if's
|
---|
2104 | - d3d12: Cache varying info to reduce compare/copy cost
|
---|
2105 | - d3d12: Use memcmp for full tcs/gs variant keys
|
---|
2106 | - d3d12: Track up to 16 contexts worth of pending barriers locally in bos
|
---|
2107 | - d3d12: Don't unnecessarily recompute manual_depth_range
|
---|
2108 | - d3d12: Use context-level sampler_state array for filling shader keys
|
---|
2109 | - d3d12: Use short circuit in shader key compare; update key hash
|
---|
2110 | - d3d12: Reduce gs variant key init cost; unnecessary validate gs calls
|
---|
2111 | - d3d12: Unroll shader variant selection loop
|
---|
2112 | - d3d12: Track up to 16 contexts worth of batch references locally in bos
|
---|
2113 |
|
---|
2114 | Guilherme Gallo (24):
|
---|
2115 |
|
---|
2116 | - radeonsi/ci: Update stoney test expectations
|
---|
2117 | - radeonsi/ci: Skip slow traces on raven
|
---|
2118 | - Revert "ci: disable Collabora's LAVA lab for maintance"
|
---|
2119 | - ci/lava: Move LAVA dependencies to pip
|
---|
2120 | - ci/lava: Add LavaFarm class to find LAVA farm from runner tag
|
---|
2121 | - ci/lava: Fix LAVA logs issues for Collabora jobs
|
---|
2122 | - ci: Upload debian-release artifact to S3
|
---|
2123 | - ci: Create debian-arm64-release job
|
---|
2124 | - ci: Use release builds in perf jobs
|
---|
2125 | - ci: Use workflow to make CI aware of performance jobs
|
---|
2126 | - ci: Reuse MESA_CI_PERFORMANCE_ENABLED in performance-rules
|
---|
2127 | - ci: Handle carriage return characters in LAVA logs
|
---|
2128 | - ci: Fix release build use for performance jobs
|
---|
2129 | - ci/baremetal: Wrap artifact download curl with xtrace
|
---|
2130 | - ci: Improve piglit-traces "no-perf" filter
|
---|
2131 | - ci: Fix freedreno-rules-performance
|
---|
2132 | - ci: Add piglit traces hidden jobs
|
---|
2133 | - ci/freedreno: create a618-traces and perf jobs
|
---|
2134 | - ci/zink: Add zink-a618 trace jobs
|
---|
2135 | - ci/zink: Add zink-turnip-manual-rules
|
---|
2136 | - ci/zink: Add zink-tu-a618-traces-performance job
|
---|
2137 | - ci/zink: Fix zink-tu-a618-traces perf job rules
|
---|
2138 | - ci/zink: Reduce zink-tu-a618-traces parallelism
|
---|
2139 | - Revert "ci: disable Collabora's LAVA lab for maintance"
|
---|
2140 |
|
---|
2141 | Hampus Linander (4):
|
---|
2142 |
|
---|
2143 | - nir: Add extr_agx opcode
|
---|
2144 | - agx: Add extr instruction to AGX backend
|
---|
2145 | - agx: Use AGX extr for tex lowering
|
---|
2146 | - agx: Optimize lower_resinfo for cube maps
|
---|
2147 |
|
---|
2148 | Hans-Kristian Arntzen (13):
|
---|
2149 |
|
---|
2150 | - radv: Fix invalid 64-bit shift.
|
---|
2151 | - radv: Fix missing VK_ACCESS_2_SHADER_SAMPLED_READ_BIT.
|
---|
2152 | - radv: Implement VK_ACCESS_2_DESCRIPTOR_BUFFER_READ_BIT_EXT.
|
---|
2153 | - wsi/common: Add common implementation of vkReleaseSwapchainImagesEXT.
|
---|
2154 | - wsi/x11: Implement EXT_swapchain_maintenance1.
|
---|
2155 | - wsi/common: Implement swapchain present fence.
|
---|
2156 | - wsi/common: Add comment about DEFERRED_ALLOCATION_BIT_EXT.
|
---|
2157 | - wsi/common: Add function to modify present mode.
|
---|
2158 | - wsi/wayland: Implement EXT_swapchain_maintenance1.
|
---|
2159 | - wsi/display: Implement EXT_swapchain_maintenance1.
|
---|
2160 | - wsi/win32: Implement VK_EXT_swapchain_maintenance1.
|
---|
2161 | - radv: Expose VK_EXT_swapchain_maintenance1.
|
---|
2162 | - wsi/x11: Fix present ID signal when IDLE comes before COMPLETE.
|
---|
2163 |
|
---|
2164 | Harri Nieminen (6):
|
---|
2165 |
|
---|
2166 | - docs/specs: Fix typos
|
---|
2167 | - docs/gallium: Fix typos
|
---|
2168 | - docs/freedreno: Fix typos
|
---|
2169 | - docs/panfrost: Fix typo
|
---|
2170 | - docs/svga3d: Fix typo
|
---|
2171 | - bin: Fix typos
|
---|
2172 |
|
---|
2173 | Helen Koike (12):
|
---|
2174 |
|
---|
2175 | - ci/debian-android: move pkgconfig paths to the cross file
|
---|
2176 | - ci: move patches to patches directory
|
---|
2177 | - android: allow system = 'android' on cross file
|
---|
2178 | - ci/android: move sdk version and ndk to a job variable
|
---|
2179 | - ci: compile deqp for android
|
---|
2180 | - ci: compile deqp-runner for android
|
---|
2181 | - ci: debian-android compile virgl
|
---|
2182 | - ci: export artifacts from debian-android
|
---|
2183 | - ci/android: add android to the ci
|
---|
2184 | - android/ci: fix removal of inexistent file
|
---|
2185 | - android/ci: Fix call to adb
|
---|
2186 | - android/ci: raise error on script when not related to the tests
|
---|
2187 |
|
---|
2188 | Hyunjun Ko (1):
|
---|
2189 |
|
---|
2190 | - vulkan/runtime: match the spec when taking pipeline subsets.
|
---|
2191 |
|
---|
2192 | Iago Toral Quiroga (20):
|
---|
2193 |
|
---|
2194 | - broadcom/compiler: produce better code for f2f16 with RTZ rounding
|
---|
2195 | - v3dv: add paths to handle partial copies of linear images
|
---|
2196 | - v3dv: drop unused field from v3dv_cmd_buffer
|
---|
2197 | - v3dv: increase BO allocation size when growing CLs
|
---|
2198 | - v3dv: ensure we allocate at least the requested space for a CL
|
---|
2199 | - v3dv: add a cl_advance_and_end helper
|
---|
2200 | - v3dv: ensure at least V3D_CL_MAX_INSTR_SIZE bytes in last CL instruction
|
---|
2201 | - v3dv: ensure we apply binning syncs to secondary command buffers
|
---|
2202 | - v3dv: fix stencil view aspect selection of depth/stencil image
|
---|
2203 | - v3d: support r{g,gba}16f formats for vertex buffers
|
---|
2204 | - broadcom/compiler: track pending ldtmu count with each TMU lookup
|
---|
2205 | - v3dv: pause occlusion queries during vkCmdClearAttachments
|
---|
2206 | - v3dv: fix format swizzle for buffer views
|
---|
2207 | - v3dv: drop unused parameter
|
---|
2208 | - v3dv: always acquire display device before checking if we can present
|
---|
2209 | - vulkan/wsi/display: set pDisplay to NULL on error
|
---|
2210 | - v3d,v3dv: stop trying to force 16-bit TMU output for shadow comparisons
|
---|
2211 | - broadcom/compiler: fix v3d_qpu_uses_sfu
|
---|
2212 | - broadcom/compiler: add a v3d_qpu_instr_is_legacy_sfu helper
|
---|
2213 | - broadcom/compiler: fix incorrect check for SFU op
|
---|
2214 |
|
---|
2215 | Ian Romanick (60):
|
---|
2216 |
|
---|
2217 | - ntt: Add support for fcsel_gt and fcsel_ge opcodes
|
---|
2218 | - nir/lower_int_to_float: Add support for i32csel opcodes
|
---|
2219 | - r300: Enable generation of fcsel_gt and fcsel_ge opcodes
|
---|
2220 | - i915: Enable generation of fcsel_gt and fcsel_ge opcodes
|
---|
2221 | - gallium/draw: Enable aapoint NIR helpers to generate bool1, bool32, or float32 Booleans
|
---|
2222 | - gallium/draw: Enable polygon stipple NIR helpers to generate bool1 or bool32 Booleans
|
---|
2223 | - nir/builder: Eliminate nir_f2b helper (and use of nir_f2b32 helper)
|
---|
2224 | - nir/builder: Handle f2b conversions specially in nir_type_convert
|
---|
2225 | - nir: Eliminate nir_op_f2b
|
---|
2226 | - lavapipe: Fix bad array index scale factor in lvp_inline_uniforms pass
|
---|
2227 | - lavapipe: Only check NULL pointers in one place in src_only_uses_uniforms
|
---|
2228 | - nir/inline_uniforms: Change num_offsets type to uint8_t
|
---|
2229 | - nir/inline_uniforms: Pass max_num_bo and max_offset around as parameters
|
---|
2230 | - nir/inline_uniforms: Allow possibility of more than one UBO
|
---|
2231 | - nir/inline_uniforms: Allow possibility of uni_offsets and num_offsets being NULL
|
---|
2232 | - nir/inline_uniforms: Make src_only_uses_uniforms public, change name
|
---|
2233 | - nir/inline_uniforms: Make add_inlinable_uniforms public
|
---|
2234 | - nir/inline_uniforms: Add inot condition support
|
---|
2235 | - nir/tests: Don't unconditionally log shaders from this one CF test
|
---|
2236 | - nir/tests: Refactor creation of loops for loop_analyze test cases
|
---|
2237 | - nir/tests: Add tests for "inverted" loops
|
---|
2238 | - nir/tests: Add tests for nir_loop_info::induction_vars tracking
|
---|
2239 | - nir/loop_analyze: Track induction variables with uniform increments
|
---|
2240 | - nir/loop_analyze: Use nir_loop_variable::update_src instead of nir_basic_induction_var::alu
|
---|
2241 | - nir/loop_analyze: Use nir_loop_variable::init_src instead of nir_basic_induction_var::def_outside_loop
|
---|
2242 | - nir/loop_analyze: Eliminate nir_basic_induction_var
|
---|
2243 | - nir/loop_analyze: Track induction variables with uniform initializer
|
---|
2244 | - nir/loop_analyze: Simplify some logic in compute_induction_information
|
---|
2245 | - nir: ifind_msb_rev can only have int32 sources
|
---|
2246 | - intel/compiler: Lower find_lsb in NIR
|
---|
2247 | - nir: intel/compiler: Move ifind_msb lowering to NIR
|
---|
2248 | - intel/compiler: Tighter src and dest size bounds checking for some opcodes
|
---|
2249 | - nir/algebraic: Only lower ufind_msb with 32-bit sources
|
---|
2250 | - nir: intel/compiler: Move ufind_msb lowering to NIR
|
---|
2251 | - nir/builder: Do not generate 8- or 16-bit find_msb
|
---|
2252 | - nir/algebraic: Do not generate 8- or 16-bit find_msb
|
---|
2253 | - nir: Restrict ufind_msb and ufind_msb_rev to 32- or 64-bit sources
|
---|
2254 | - nir/algebraic: Optimize some ifind_msb to ufind_msb
|
---|
2255 | - nir/lower_int64: Optionally lower ufind_msb using uadd_sat
|
---|
2256 | - intel/fs: Don't copy propagate from saturate to sel
|
---|
2257 | - nir/algebraic: Undistribute fsat from fmax
|
---|
2258 | - intel/fs: Output opt_combine_constants debug to stderr
|
---|
2259 | - intel/fs: Refactor part of opt_combine_constants to a separate function
|
---|
2260 | - intel/fs: Rework the loop of opt_combine_constants that collects constants
|
---|
2261 | - intel/compiler: Remove one overload of backend_instruction::insert_before
|
---|
2262 | - intel/compiler: Use NIR_PASS instead of NIR_PASS_V
|
---|
2263 | - intel/compiler: Micro optimize inst_is_in_block
|
---|
2264 | - intel/fs: Use specialized version of regions_overlap in opt_copy_propagation
|
---|
2265 | - intel/compiler: Micro optimize regions_overlap
|
---|
2266 | - intel/fs: Linked list micro optimizations in brw_nir_move_interpolation_to_top
|
---|
2267 | - intel/fs: Preserve meta data more often in brw_nir_move_interpolation_to_top
|
---|
2268 | - intel/fs: White space fixes
|
---|
2269 | - nir/tests: Add many loop analysis tests for induction vars updated by shifts
|
---|
2270 | - nir/tests: Add more loop analysis tests for induction vars updated by shifts
|
---|
2271 | - nir/tests: Add many loop analysis tests for induction variables modified by imul
|
---|
2272 | - nir/loop_analyze: Add a function to evaluate an ALU as constant
|
---|
2273 | - nir/loop_analyze: Track induction variable basis information
|
---|
2274 | - nir/loop_analyze: Change invert_cond instead of changing the condition
|
---|
2275 | - nir/loop_analyze: Use try_eval_const_alu and induction variable basis info
|
---|
2276 | - nir/tests: Port almost all loop_analyze tests to new macro-based infastructure
|
---|
2277 |
|
---|
2278 | Ikshwaku Chauhan (1):
|
---|
2279 |
|
---|
2280 | - radeonsi: Fix distortion for yuv422 format for GFX10.
|
---|
2281 |
|
---|
2282 | Illia Abernikhin (1):
|
---|
2283 |
|
---|
2284 | - util: Extend vk_enum_to_str with bitmasks vk_enum_to_str only generates literals for enums with type: @type="enum", but many enums have type: @type="bitmask" and were not taken into account here.
|
---|
2285 |
|
---|
2286 | Illia Polishchuk (4):
|
---|
2287 |
|
---|
2288 | - ANV: Add extra memory types for ANV driver instead of a single one
|
---|
2289 | - hasvk: Add extra memory types for hasvk driver instead of a single one
|
---|
2290 | - nir: Add sha1 hash for nir shaders converted from spir-v
|
---|
2291 | - glx: fix indirect initialization crash
|
---|
2292 |
|
---|
2293 | Ilya K (1):
|
---|
2294 |
|
---|
2295 | - intel/vk/grl: don't install libgrl.a
|
---|
2296 |
|
---|
2297 | Isaac Bosompem (1):
|
---|
2298 |
|
---|
2299 | - tool/pps: Fix 32-bit build issue with format string
|
---|
2300 |
|
---|
2301 | Isabella Basso (5):
|
---|
2302 |
|
---|
2303 | - nir/algebraic: insert patterns inside optimizations list
|
---|
2304 | - nir/algebraic: extend mediump patterns
|
---|
2305 | - nir/algebraic: extend lowering patterns for conversions on smaller bit sizes
|
---|
2306 | - nir/algebraic: make patterns for float conversion lowerings imprecise
|
---|
2307 | - nir/algebraic: remove duplicate bool conversion lowerings
|
---|
2308 |
|
---|
2309 | Italo Nicola (10):
|
---|
2310 |
|
---|
2311 | - panfrost: fix off-by-one when exporting format modifiers
|
---|
2312 | - panfrost: fix tiny sample_positions BO memory leak
|
---|
2313 | - hud: use defines for default scale/rotation/visibility values
|
---|
2314 | - hud: add GALLIUM_HUD_OPACITY envvar
|
---|
2315 | - panfrost: fix strict-aliasing violations when packing fb ptrs
|
---|
2316 | - etnaviv: abort() instead of assert(0) on compiler error
|
---|
2317 | - etnaviv: use stderr for compiler error logging
|
---|
2318 | - etnaviv: add default clear_buffer and clear_texture APIS
|
---|
2319 | - etnaviv: lower (un)pack_{2x16,2x32}_split and extract_{byte,word}
|
---|
2320 | - etnaviv: implement nir_op_uclz and lower find_{msb,lsb} to uclz
|
---|
2321 |
|
---|
2322 | Iván Briano (7):
|
---|
2323 |
|
---|
2324 | - anv: uncompressed views of compressed 3d images are now valid
|
---|
2325 | - vulkan: track the right value on CmdSetColorWriteMasks
|
---|
2326 | - anv: fix testing for dynamic color blend bits
|
---|
2327 | - anv: stop tracking color blend state in the pipeline
|
---|
2328 | - anv: use the parameter passed to the macro
|
---|
2329 | - intel/fs: handle interpolation modes for at_sample and at_offset too
|
---|
2330 | - vulkan/wsi/display: do not dereference a NULL pointer
|
---|
2331 |
|
---|
2332 | Jakub Kulík (1):
|
---|
2333 |
|
---|
2334 | - mesa: Fix format transform on big endian platforms
|
---|
2335 |
|
---|
2336 | Jan Beich (1):
|
---|
2337 |
|
---|
2338 | - util/u_process: implement util_get_command_line for BSDs
|
---|
2339 |
|
---|
2340 | Janne Grunau (1):
|
---|
2341 |
|
---|
2342 | - asahi: Fix typo in debug/error message helper macro
|
---|
2343 |
|
---|
2344 | Jarred Davies (16):
|
---|
2345 |
|
---|
2346 | - pvr: Use common queue submit implementation
|
---|
2347 | - pvr: Add support for VK_KHR_timeline_semaphore
|
---|
2348 | - pvr: Enable threaded submit when supported
|
---|
2349 | - pvr: Clear wait syncs after job submission
|
---|
2350 | - pvr: Don't update fragment signal sync when fragment stage is disabled
|
---|
2351 | - pvr: Fix segfaults when pDepthStencilAttachment is NULL
|
---|
2352 | - pvr: Generate EOT program at runtime
|
---|
2353 | - pvr: Generate dummy emit for renders without any emits
|
---|
2354 | - pvr: Add support for multiple emits from EOT program
|
---|
2355 | - pvr: Select a single aspect format for the texture state of DS image views
|
---|
2356 | - pvr: Add initial support for VK_FORMAT_S8_UINT
|
---|
2357 | - pvr: Don't allocate/upload 0 size coeff programs
|
---|
2358 | - pvr: Always mark robustBufferAccess as supported
|
---|
2359 | - pvr: Rename pvr_xgl_pds.c to pvr_pipeline_pds.c
|
---|
2360 | - pvr: Add robustness buffer support
|
---|
2361 | - pvr: Mark all normalized formats as supporting with_packed_usc_channel
|
---|
2362 |
|
---|
2363 | Jesse Natalie (224):
|
---|
2364 |
|
---|
2365 | - ci/windows: Download updated WARP 1.0.4 package
|
---|
2366 | - dzn/ci: Remove flakes/fails that don't hit anymore
|
---|
2367 | - dzn/ci: Add image test group, which is all passing now
|
---|
2368 | - dzn: Fix clear bind flag logic
|
---|
2369 | - microsoft/compiler: Lower pack_[u/s]norm_2x16
|
---|
2370 | - microsoft/compiler: Implement texture sample count query
|
---|
2371 | - microsoft/compiler: Remove arrays when testing for structs in I/O
|
---|
2372 | - microsoft/compiler: Always emit float types in the I/O signature for structs
|
---|
2373 | - microsoft/compiler: Re-work the logic for adding SV_SampleIndex to force sample-rate
|
---|
2374 | - microsoft/compiler: Use nir info.fs.uses_sample_shading to force sample-rate
|
---|
2375 | - microsoft/compiler: Set num_components to 4 when updating pos write instructions
|
---|
2376 | - spirv2dxil: For removing unused vars, consider the whole I/O var size
|
---|
2377 | - spirv2dxil: When removing unused inputs, make sure they're actually inputs
|
---|
2378 | - spirv2dxil: Allow killing position as an undef varying
|
---|
2379 | - spirv2dxil: Replace not-provided inputs with zero instead of undef
|
---|
2380 | - dzn: Get options13
|
---|
2381 | - dzn: Support alpha blend factor
|
---|
2382 | - dzn: When changing root signature, dirty descriptors too
|
---|
2383 | - dzn: Use R24G8_TYPELESS for 24/8 depth resources
|
---|
2384 | - dzn: Support int border colors
|
---|
2385 | - dzn: Storage buffer sizes need to be 4-byte-aligned
|
---|
2386 | - dzn: Set MultisampleEnable to enable MSAA lines
|
---|
2387 | - dzn: Use typeless format for creation of depth-only or stencil-only D24S8
|
---|
2388 | - dzn: Define a symbol that was present in older D3D headers
|
---|
2389 | - dzn: Support root signature 1.2
|
---|
2390 | - dzn: Support unnormalized coordinate samplers
|
---|
2391 | - dzn: Always align cached pipeline header size to input element align
|
---|
2392 | - dzn: Add a zeroed zsa state when depth or raster is disabled
|
---|
2393 | - dzn: Disable depth when the rasterizer is disabled due to no position output
|
---|
2394 | - dzn: Fix format support checks for storage/uniform texel buffers
|
---|
2395 | - dzn: Remove cmdbuf query 'wait' list
|
---|
2396 | - microsoft/compiler: Delete incorrect implementation for load_layer_id
|
---|
2397 | - microsoft/compiler: Subpass textures are supposed to be arrays
|
---|
2398 | - microsoft/compiler: Delete stale TODO comment
|
---|
2399 | - microsoft/compiler: Support view instancing
|
---|
2400 | - spirv2dxil: Pass runtime conf struct to lower_shader_system_values
|
---|
2401 | - spirv2dxil: Implement lowering for multiview
|
---|
2402 | - spirv2dxil: Claim multiview support
|
---|
2403 | - dzn: Put nir compilation options in a struct
|
---|
2404 | - dzn: Handle multiview pipeline creation
|
---|
2405 | - dzn: Handle draws and clears for multiview rendering
|
---|
2406 | - dzn: Implement multiview queries
|
---|
2407 | - dzn: Enable multiview
|
---|
2408 | - dzn: Enable independent blending
|
---|
2409 | - dzn: Delete an unnecessary assert
|
---|
2410 | - dzn: Rework meta blit VS
|
---|
2411 | - microsoft/compiler: Add an overload param to unary function helpers
|
---|
2412 | - microsoft/compiler: Implement a few basic wave/subgroup intrinsics
|
---|
2413 | - microsoft/compiler: Add lowering passes for basic subgroup vars
|
---|
2414 | - spirv2dxil: Use 32-bit shared offsets
|
---|
2415 | - spirv2dxil: Support basic subgroups
|
---|
2416 | - dzn: Support basic subgroups
|
---|
2417 | - microsoft/compiler: Fix atomic image umax
|
---|
2418 | - microsoft/compiler: Lower device index to zero
|
---|
2419 | - spirv2dxil: Support dispatches with base group indices
|
---|
2420 | - dzn: Support vkCmdDispatchBase
|
---|
2421 | - dzn: Use common physical device list/enumeration helpers
|
---|
2422 | - dzn: Respect suspending/resuming flags to omit clears/resolves
|
---|
2423 | - dzn: Set dynamic rendering caps
|
---|
2424 | - dzn: When rendering to 3D, don't treat layers as subresources for barriers
|
---|
2425 | - dzn: Move patched vertex buffer capability check up a level
|
---|
2426 | - dzn: Use SHADER_LOAD to indicate SAMPLED_IMAGE support
|
---|
2427 | - dzn: Usage image view usage instead of image usage
|
---|
2428 | - dzn: Support EXTENDED_USAGE bit
|
---|
2429 | - dzn: Usage MULTISAMPLE_LOAD support instead of RT/DS support for MSAA
|
---|
2430 | - dzn: Descriptor limits are based on binding tier, not heap tier
|
---|
2431 | - dzn: A single sampler descriptor set needs to support 1024 samplers
|
---|
2432 | - dzn: Don't expose variable pointers
|
---|
2433 | - dzn: Fix independent blend check
|
---|
2434 | - dzn: Enable Vulkan 1.1
|
---|
2435 | - microsoft/compiler: Don't emit threadgroup barriers for graphics shaders
|
---|
2436 | - microsoft/compiler: Handle i2i1 and u2u1
|
---|
2437 | - microsoft/compiler: Handle i1 overloads
|
---|
2438 | - microsoft/compiler: Implement more wave/quad ops
|
---|
2439 | - microsoft/compiler: Support emitting the SM6.6 wave size tag
|
---|
2440 | - spirv2dxil: Lower some wave op properties
|
---|
2441 | - spirv2dxil: Support subgroup SPIR-V caps
|
---|
2442 | - dzn: Support more subgroup/quad ops
|
---|
2443 | - dzn: Implement subgroup size control extension
|
---|
2444 | - dzn: Use core feature matching logic instead of rolling our own
|
---|
2445 | - microsoft/compiler: Support float controls
|
---|
2446 | - dzn: Fix dynamic rendering clear load op for non-multiview
|
---|
2447 | - dzn: Handle separate stencil usage
|
---|
2448 | - dzn: Cache GPUVA for buffers
|
---|
2449 | - dzn: Support float control
|
---|
2450 | - dzn: Always do clears with copies on non-graphics queues
|
---|
2451 | - dzn: Enhanced barriers fixes/workarounds
|
---|
2452 | - dzn: Ensure we don't mix DSV+simultaneous-access
|
---|
2453 | - dzn: Support Vulkan 1.2
|
---|
2454 | - dzn: Fix Windows WSI
|
---|
2455 | - dzn: Don't recursively lock the physical device enum mutex
|
---|
2456 | - dzn: Report as a software device for non-Windows
|
---|
2457 | - CI/windows: Don't limit deqp-runner to 4 jobs
|
---|
2458 | - CI/windows: Apply CI_FDO_CONCURRENT to piglit too
|
---|
2459 | - dzn: Consider linked shaders when computing DXIL hash
|
---|
2460 | - wsi/win32: Always use non-SRGB formats for DXGI
|
---|
2461 | - wsi/win32: Use app-provided timeout instead of arbitrary hardcoded value
|
---|
2462 | - CI: Lima farm is offline
|
---|
2463 | - dzn, driconf: Add a driconf entry for NMS to claim wide line support
|
---|
2464 | - vulkan/wsi: Add a wsi_device param to get_present_modes
|
---|
2465 | - vulkan/wsi/win32: Support tearing (immediate) and VSync (FIFO) present modes
|
---|
2466 | - wsi/win32: Don't require buffer blits for software drivers
|
---|
2467 | - wsi/win32: We don't need a window DC for DXGI
|
---|
2468 | - clc: Include opencl-c-base.h with LLVM 15 (using builtins)
|
---|
2469 | - microsoft/clc: Set features that are used by CL tests
|
---|
2470 | - ci/windows: Update LLVM to 15
|
---|
2471 | - nir: Add alignment to load_push_constant
|
---|
2472 | - nir_lower_fp16_casts: Allow opting out of lowering certain rounding modes
|
---|
2473 | - microsoft/compiler: Handle struct consts in DXIL module dumper
|
---|
2474 | - microsoft/compiler: Handle frcp for float16/float64
|
---|
2475 | - microsoft/compiler: Ensure native_low_precision is set for 16-bit bitcasts/stores
|
---|
2476 | - microsoft/compiler: Handle undef-rounding f2f16 as rtz
|
---|
2477 | - microsoft/compiler: Move unaligned load/store pass from CL
|
---|
2478 | - microsoft/compiler: Pass deref modes to unaligned pass and handle push const
|
---|
2479 | - microsoft/compiler: Simplify bitpacking for load/store lowering with nir_extract_bits
|
---|
2480 | - microsoft/compiler: Pass an alignment to constant buffer load lowering
|
---|
2481 | - microsoft/compiler: Handle 48-bit stores to SSBO/shared
|
---|
2482 | - microsoft/compiler: Support raw buffer load/store intrinsics with 16bit alignment
|
---|
2483 | - microsoft/compiler: Support lowering SSBO accesses to 16bit vectors
|
---|
2484 | - spirv2dxil: Set min UBO/SSBO alignments
|
---|
2485 | - spirv2dxil: Lower unaligned loads and stores
|
---|
2486 | - spirv2dxil: Move shader model into runtime conf struct
|
---|
2487 | - spirv2dxil: Support 16bit types
|
---|
2488 | - dzn: Enable get_surface_capabilities2
|
---|
2489 | - dzn: Delete unused extensions table
|
---|
2490 | - dzn: Get options4
|
---|
2491 | - dzn: Enable 16bit types when supported
|
---|
2492 | - dzn: Enable KHR_storage_buffer_storage_class
|
---|
2493 | - vulkan/wsi: Fix Windows build
|
---|
2494 | - radv: Fix returning an expression from a void function
|
---|
2495 | - Revert "CI: Disable Windows runners"
|
---|
2496 | - nir: Propagate alignment when rematerializing cast derefs
|
---|
2497 | - microsoft/compiler: Implement wave reduce/exclusive scan ops that are supported
|
---|
2498 | - microsoft/compiler: Add a lowering pass for scan ops that aren't supported
|
---|
2499 | - spirv2dxil: Handle arithmetic subgroup ops
|
---|
2500 | - dzn: Claim the arithmetic subgroup bit
|
---|
2501 | - ci/windows: Update warp to 1.0.5
|
---|
2502 | - microsoft/compiler: Handle writable buffer UAV size queries
|
---|
2503 | - d3d12: Report correct texel buffer max size
|
---|
2504 | - d3d12: Fix buffer SRV/UAV creation
|
---|
2505 | - d3d12: Remove now-unused UAV format from shader info
|
---|
2506 | - microsoft/compiler: Fix setting bit 31 in feature flags
|
---|
2507 | - microsoft/compiler: Only set typed UAV load feature bit for multi-comp loads
|
---|
2508 | - microsoft/compiler: Refactor type -> resource kind helper
|
---|
2509 | - microsoft/compiler: Add helpers for getting res_props structs
|
---|
2510 | - microsoft/compiler: Split handle annotation into two parts
|
---|
2511 | - microsoft/compiler: Handle "bindless" image/tex sources as heap indices
|
---|
2512 | - microsoft/compiler: Support descriptor heap indexing for UBO/SSBO
|
---|
2513 | - microsoft/compiler: Use store_dest instead of store_dest_value more
|
---|
2514 | - microsoft/compiler: Update header docs for binding modes supported by compiler
|
---|
2515 | - spirv2dxil: Add a pass to lower deref tex/image and vulkan ubo/ssbo to bindless
|
---|
2516 | - spirv2dxil: Only lower readonly images to SRVs when the option is set
|
---|
2517 | - spirv2dxil: Support descriptor indexing capabilities
|
---|
2518 | - dzn: Remove device pointers from descriptor heaps
|
---|
2519 | - dzn: Remove descriptor heap type from descriptor heap wrapper
|
---|
2520 | - dzn: Fix a leak in descriptor set layout creation
|
---|
2521 | - dzn: Add some docs around descriptor sets and remove redundant/unused data
|
---|
2522 | - dzn: Put UAVs first for storage images/buffers in descriptor tables
|
---|
2523 | - dzn: Consistently order depth formats before stencil
|
---|
2524 | - dzn: Don't use plane slice 1 for depth+stencil SRVs
|
---|
2525 | - dzn: Set up SRV descs for 3D textures correctly
|
---|
2526 | - dzn: Skip setting up UAVs for depth resources
|
---|
2527 | - dzn: Add initial bindless infrastructure
|
---|
2528 | - dzn: When bindless, only allocate one descriptor per layout entry
|
---|
2529 | - dzn: Remove defragmenting of descriptor pools
|
---|
2530 | - dzn: Delete unused function
|
---|
2531 | - dzn: Allocate descriptor sets in buffers for bindless mode
|
---|
2532 | - dzn: Don't dirty bindings if root signature doesn't change
|
---|
2533 | - dzn: Use separate dirty bits for descriptor sets/dynamic buffers
|
---|
2534 | - dzn: Bind buffers for bindless descriptor sets
|
---|
2535 | - dzn: Add a binding classification in the pipeline layout remapping
|
---|
2536 | - dzn: When binding a bindless root signature, bind descriptor heaps first
|
---|
2537 | - dzn: Ensure root signatures are re-bound after a meta op
|
---|
2538 | - dzn: Only bind descriptor sets up to the used amount of the current layout
|
---|
2539 | - dzn: Apply bindless lowering when compiling pipelines
|
---|
2540 | - dzn: Add a debug option for enabling bindless mode
|
---|
2541 | - dzn: Support descriptor indexing via bindless
|
---|
2542 | - dzn: Enable variable size bindings
|
---|
2543 | - dzn: Use mesa_loge for DXIL validation errors
|
---|
2544 | - microsoft/clc: Add shader model / validator to compiler API
|
---|
2545 | - d3d12: Move forward-front-face pass to common DXIL code
|
---|
2546 | - spirv2dxil: Expose yz flip pass to external callers
|
---|
2547 | - dzn: Add a helper to generate triangle->point GS
|
---|
2548 | - dzn: Handle polygon point mode
|
---|
2549 | - dzn: Claim fillModeNonSolid
|
---|
2550 | - CI/windows: Update headers and Agility redist to 1.710.0-preview
|
---|
2551 | - CI/windows: Increase timeout for build container job
|
---|
2552 | - microsoft/compiler: Fix 8-bit loads and stores when supporting 16-bit DXIL
|
---|
2553 | - microsoft/compiler: Fix barrier for wave ID computation
|
---|
2554 | - microsoft/compiler: Assign 1D wave IDs based on local thread ID
|
---|
2555 | - microsoft/compiler: Fix large shifts
|
---|
2556 | - spirv2dxil: Add some more supported caps
|
---|
2557 | - dzn: Add a driconf entry for enabling 8bit loads and stores
|
---|
2558 | - dzn: Add a driconf option for enabling subgroup ops in VS/GS
|
---|
2559 | - dzn: Fix SRV barrier state on compute command lists
|
---|
2560 | - dzn: Raise max number of descriptor sets to 8
|
---|
2561 | - dzn: Report some more caps correctly that are supported
|
---|
2562 | - dzn: Align descriptor sets in the bindless buffer
|
---|
2563 | - dzn: Ensure pipeline variants are used for dynamic stencil masks
|
---|
2564 | - dzn: Don't use write-combine memory for cache-coherent UMA
|
---|
2565 | - dzn: Ensure buffer offsets are aligned
|
---|
2566 | - dzn: Attempt to force depth write states for depth access in LAYOUT_GENERIC
|
---|
2567 | - dzn: Don't do initial-layout barriers for simultaneous-access resources
|
---|
2568 | - dzn: Batch command lists together
|
---|
2569 | - dzn: Fix bindless descriptor sets with multiple dynamic buffers that need custom descriptors
|
---|
2570 | - dzn: Early-out on no-op barriers
|
---|
2571 | - dzn: Clean up ABI helpers now that we require DirectX-Headers 606
|
---|
2572 | - dzn: Use GetResourceAllocationInfo3 for castable formats
|
---|
2573 | - dzn: Don't leave deleted physical devices in the instance pdev list
|
---|
2574 | - dzn: Remove skips now that WARP is faster
|
---|
2575 | - dzn: Support >2K samplers with bindless
|
---|
2576 | - dzn: Remove xfail for test that passes (if run)
|
---|
2577 | - microsoft/compiler: Don't split loads/stores that will be split by lower_explicit_io
|
---|
2578 | - dzn: Changes to descriptor set dirty flag handling
|
---|
2579 | - dzn: Use a linear allocator for upload data on command buffers
|
---|
2580 | - dzn: Ignore unnormalized sampling flag if driver doesn't support it
|
---|
2581 | - dzn: Never set STATE_RENDER_TARGET on a compute command list
|
---|
2582 | - dzn: Don't enable bindless by default
|
---|
2583 | - d3d12: Support creating PSOs with no attachments with MSAA without TIR
|
---|
2584 | - d3d12: Fix buffer reference leak for SO count staging buffer
|
---|
2585 | - dzn: Handle mismatches in bound descriptor set vs pipeline layout
|
---|
2586 | - d3d12: Respect buffer offsets for sampler views
|
---|
2587 | - dzn: Hook up subgroup size to compute shader compilation
|
---|
2588 | - dzn: Delete queue-level event waits
|
---|
2589 |
|
---|
2590 | Jonathan Gray (3):
|
---|
2591 |
|
---|
2592 | - egl/dri2: avoid undefined unlocks
|
---|
2593 | - intel/dev: remove invalid EHL pci id
|
---|
2594 | - intel/dev: Add another EHL pci id
|
---|
2595 |
|
---|
2596 | Jonathan Marek (1):
|
---|
2597 |
|
---|
2598 | - turnip: fix use of align() instead of util_align_npot() with tile_align_w
|
---|
2599 |
|
---|
2600 | Jordan Justen (8):
|
---|
2601 |
|
---|
2602 | - intel/vk/grl: Don't include anv_private.h in genX_grl.h
|
---|
2603 | - intel/vk/grl: Allow genX_grl.h to be included by C++ files
|
---|
2604 | - intel/vk/grl: Allow grl/grl_cl_kernel.h to be included by C++ files
|
---|
2605 | - intel/vk/grl: genX-ify genX_grl_uuid.cpp
|
---|
2606 | - intel/vk/grl: genX-ify grl_cl_kernel_name()
|
---|
2607 | - intel/dev: Enable MTL PCI ids
|
---|
2608 | - intel/compiler: Support fmul_fsign opt for fp64 when int64 isn't supported
|
---|
2609 | - intel/compiler/gfx12.5+: Lower 64-bit cluster_broadcast with 32-bit ops
|
---|
2610 |
|
---|
2611 | Joshua Peisach (1):
|
---|
2612 |
|
---|
2613 | - gallum/asahi: fix memory leak in agx_resource_from_handle
|
---|
2614 |
|
---|
2615 | José Fonseca (4):
|
---|
2616 |
|
---|
2617 | - llvmpipe: Ensure floating point SSE state is reset regardless of the write mask.
|
---|
2618 | - llvmpipe: Honor zero sample_mask when multisample is disabled.
|
---|
2619 | - trace: Don't use italic escape code.
|
---|
2620 | - wgl: Fix unintentional assignment on assert.
|
---|
2621 |
|
---|
2622 | José Roberto de Souza (107):
|
---|
2623 |
|
---|
2624 | - anv: Start to move i915 specific code from anv_device to i915/anv_device
|
---|
2625 | - anv: Export anv_exec_batch_debug() and chain_command_buffers()
|
---|
2626 | - anv: Split i915 code from anv_batch_chain.c
|
---|
2627 | - anv: Move anv_device_check_status() code to i915/anv_device.c
|
---|
2628 | - intel/dev: Export functions that will be used by different kernel drivers
|
---|
2629 | - intel/dev: Move i915 code to i915/intel_device_info.c
|
---|
2630 | - intel/dev: Split hwconfig i915 specific code
|
---|
2631 | - intel/dev: Detect what is the kernel mode driver loaded
|
---|
2632 | - intel: Add intel_kmd_type parameter to intel_engine_get_info()
|
---|
2633 | - intel: Add kmd_type parameter to necessary intel_gem.h functions
|
---|
2634 | - anv: Nuke anv_queue:index_in_family
|
---|
2635 | - hasvk: Nuke anv_queue:index_in_family
|
---|
2636 | - intel/ds: Nuke intel_ds_queue::queue_id
|
---|
2637 | - intel/ds: Fix crash when allocating more intel_ds_queues than u_vector was initialized
|
---|
2638 | - intel/genxml/gen125: Add walker configuration fields to 3DSTATE_WM
|
---|
2639 | - intel/genxml/gen125: Tune 3DSTATE_WM Walker direction
|
---|
2640 | - intel: Add intel_memory_class_instance
|
---|
2641 | - anv: Convert drm_i915_gem_memory_class_instance to intel_memory_class_instance
|
---|
2642 | - anv: Use DRM_IOCTL_I915_GEM_CREATE_EXT in all supported kernels
|
---|
2643 | - anv: Add basic KMD backend infrastructure
|
---|
2644 | - anv: Start to move anv_gem_stubs.c to kmd backend
|
---|
2645 | - anv: Remove remaining bits of anv_i915_query()
|
---|
2646 | - hasvk: Remove remaining bits of anv_i915_query()
|
---|
2647 | - anv: Add gem_close to kmd backend
|
---|
2648 | - anv: Add gem_mmap to kmd backend
|
---|
2649 | - anv: Move execute_simple_batch() and queue_exec_locked() to kmd backend
|
---|
2650 | - intel/common: Move i915 files to i915 folder
|
---|
2651 | - iris: Export batch debug functions
|
---|
2652 | - iris: Export update_batch_syncobjs()
|
---|
2653 | - iris: Export num_fences()
|
---|
2654 | - intel: Make gen12 URB space reservation dependent on compute engine presence
|
---|
2655 | - intel/blorp: Allocate only necessary amount of VERTEX_BUFFER_STATE
|
---|
2656 | - intel: Pull in xe_drm.h
|
---|
2657 | - intel: Add Meson parameter to enable Xe KMD support
|
---|
2658 | - intel/dev: Add INTEL_KMD_TYPE_XE
|
---|
2659 | - intel/dev: Implement Xe functions to fill intel_device_info
|
---|
2660 | - intel/dev: Implement Xe functions to handle hwconfig
|
---|
2661 | - intel/dev: Query and compute hardware topology for Xe
|
---|
2662 | - iris: Convert drm_i915_gem_memory_class_instance to intel_memory_class_instance
|
---|
2663 | - iris/bufmgr: Add i915_gem_set_domain()
|
---|
2664 | - iris: Use DRM_IOCTL_I915_GEM_CREATE_EXT in all supported kernels
|
---|
2665 | - iris: Add initial skeleton of kmd backend
|
---|
2666 | - iris: Move iris_bo_madvise() to i915/iris_bufmgr.c
|
---|
2667 | - iris: Add iris_bo_set_caching()
|
---|
2668 | - intel/common: Implement the Xe functions for intel_engine
|
---|
2669 | - intel/common: Implement the Xe functions for intel_gem
|
---|
2670 | - iris: Move bo_madvise to kmd backend
|
---|
2671 | - iris: Move bo_set_caching to kmd backend
|
---|
2672 | - iris: Move iris_bo_busy_gem() to i915/iris_bufmgr.c
|
---|
2673 | - iris: Move iris_bo_wait_gem() to i915/iris_bufmgr.c
|
---|
2674 | - iris: Don't mark protected bo as reusable
|
---|
2675 | - intel/perf: Disable it for Xe KMD
|
---|
2676 | - build: Block build of HASVK, Crocus and i915 in non-x86 architectures
|
---|
2677 | - iris: Add gem_mmap() to kmd backend
|
---|
2678 | - iris: Add batch_check_for_reset() to kmd backend
|
---|
2679 | - iris: Move i915 submit_batch() to i915 backend
|
---|
2680 | - anv: Implement gem_create for Xe backend
|
---|
2681 | - anv: Implement Xe functions to create and destroy VM
|
---|
2682 | - anv: Implement gem close and mmap for Xe backend
|
---|
2683 | - anv: Add gem VM bind and unbind to backend
|
---|
2684 | - anv: Integrate gem vm bind and unbind kmd backend functions
|
---|
2685 | - iris: Drop I915_EXEC_FENCE types
|
---|
2686 | - iris: Drop usage of i915 EXEC_OBJECT_WRITE
|
---|
2687 | - iris: Move iris_bufmgr_init_global_vm() to i915/iris_bufmgr.c and prepare for Xe KMD
|
---|
2688 | - anv: Implement Xe version of anv_physical_device_get_parameters()
|
---|
2689 | - anv: Properly alloc buffers that will be promoted to framebuffer in Xe KMD
|
---|
2690 | - anv: Handle external objects allocation in Xe
|
---|
2691 | - iris: Only mark buffer as exported if drmPrimeHandleToFD() succeed
|
---|
2692 | - iris: Implement the Xe version of iris_bufmgr_init_global_vm()
|
---|
2693 | - iris: Implement the function to destroy VM in Xe
|
---|
2694 | - iris: Implement gem_create() in Xe kmd backend
|
---|
2695 | - iris: Implement gem_mmap() in Xe kmd backend
|
---|
2696 | - iris: Store iris_context's priority
|
---|
2697 | - iris: Move to i915/iris_batch.c code to create and replace i915 context
|
---|
2698 | - iris: Move to iris_i915_batch.c code to destroy i915 context
|
---|
2699 | - intel: Move memory aligment information to intel_device_info
|
---|
2700 | - anv: Use intel_device_info memory alignment
|
---|
2701 | - intel: Set mem_alignment in Xe kmd
|
---|
2702 | - anv: Apply memory alignment requirements in Xe kmd
|
---|
2703 | - intel: Add TODO about removal of 2Mb alignment in i915
|
---|
2704 | - anv: Replace I915_ENGINE_CLASS_VIDEO by INTEL_ENGINE_CLASS_VIDEO
|
---|
2705 | - anv: Create Xe engines
|
---|
2706 | - anv: Implement Xe version of check_status()
|
---|
2707 | - anv: Handle Xe queue/engine priority
|
---|
2708 | - anv: Implement Xe version of execute_simple_batch()
|
---|
2709 | - iris: Prepare iris_bufmgr functions for vm bind error paths
|
---|
2710 | - iris: Add vm bind and unbind to kmd backend
|
---|
2711 | - iris: Implement gem_vm_bind() and gem_vm_unbind() in Xe kmd backend
|
---|
2712 | - iris: Ajust gem buffer allocation size in Xe kmd
|
---|
2713 | - intel: Sync xe_drm.h
|
---|
2714 | - anv: Partialy import drm-uapi/gpu_scheduler.h and use it
|
---|
2715 | - anv: Fetch max_context_priority from drm_xe_query_config
|
---|
2716 | - intel: Allocate mesh shader URB space before task shader
|
---|
2717 | - anv: Move to a function code to clflush batch buffers
|
---|
2718 | - anv: Implement Xe version of anv_queue_exec_locked() and queue_exec_trace()
|
---|
2719 | - anv: Disable anv_bo_sync_type for Xe kmd
|
---|
2720 | - anv: Add assert in functions not supported by Xe kmd
|
---|
2721 | - iris: Add BO_ALLOC_SHARED
|
---|
2722 | - iris: Handle allocation of exported buffers in Xe kmd
|
---|
2723 | - iris: Handle allocation of scanout buffers in Xe
|
---|
2724 | - iris: Implement Xe version of bo_madvise() and bo_set_caching()
|
---|
2725 | - anv: Fix vm bind of imported buffers
|
---|
2726 | - iris: Add function to close gem bos
|
---|
2727 | - iris: Handle Xe syncronization with syncobjs
|
---|
2728 | - loader: Add Xe KMD support
|
---|
2729 | - iris: Fix close of exported bos
|
---|
2730 | - iris: Allow shared scanout buffer to be placed in smem as well
|
---|
2731 |
|
---|
2732 | Juan A. Suarez Romero (8):
|
---|
2733 |
|
---|
2734 | - v3d/v3dv: define performance counters in common
|
---|
2735 | - v3d: cache pipe query results
|
---|
2736 | - v3d: include offset as part of streamout target
|
---|
2737 | - v3d: implement NV_conditional_render extension
|
---|
2738 | - v3d: fix condition for EZ disabling when stencil on
|
---|
2739 | - v3d: set depth compare function correctly
|
---|
2740 | - v3d: use primitive type to get stream output offset
|
---|
2741 | - v3d: apply 1D texture miplevel alignment in arrays
|
---|
2742 |
|
---|
2743 | Julia Tatz (3):
|
---|
2744 |
|
---|
2745 | - zink: zink_heap isn't 1-to-1 with memoryTypeIndex
|
---|
2746 | - zink: trival renames heap_idx -> memoryTypeIndex
|
---|
2747 | - zink: correct sparse bo mem_type_idx placement
|
---|
2748 |
|
---|
2749 | Juston Li (29):
|
---|
2750 |
|
---|
2751 | - venus: refactor out vn_feedback_event_cmd_record2
|
---|
2752 | - venus: refactor VK_KHR_synchronization2 ext sync fd requirements
|
---|
2753 | - venus: require importable external semaphores for WSI
|
---|
2754 | - venus: require exportable bit for ext fence sync fd
|
---|
2755 | - venus: require exportable/importable bit for ext semaphores sync fd
|
---|
2756 | - venus: remove filtering external semaphores for QueueSubmit
|
---|
2757 | - venus: drop VkQueueBindSparse
|
---|
2758 | - venus: append fence feedback batch
|
---|
2759 | - venus: refactor QueueSubmit/QueueSubmit2
|
---|
2760 | - venus: vn_queue: align vulkan object variable naming
|
---|
2761 | - docs/envvars: add missing mesa disk cache envvars
|
---|
2762 | - util/fossilize_db: don't destroy foz on RO load fail
|
---|
2763 | - utils/fossilize_db: refactor out loading RO foz dbs
|
---|
2764 | - util/fossilize_db: add runtime RO foz db loading via FOZ_DBS_DYNAMIC_LIST
|
---|
2765 | - util/fossilize_db: fix macOS inotify build error
|
---|
2766 | - util/fossilize_db: add ifdef for inotify header
|
---|
2767 | - util/tests/cache_test: Skip Cache.List if not supported
|
---|
2768 | - anv: check initial cmd_buffer is chainable
|
---|
2769 | - venus: refactor semaphore helper functions
|
---|
2770 | - venus: refactor batch submission fixup
|
---|
2771 | - venus: add NO_TIMELINE_SEM_FEEDBACK perf option
|
---|
2772 | - venus: add timeline semaphore feedback cmds
|
---|
2773 | - venus: enable timeline semaphore feedback
|
---|
2774 | - venus: add SHADER_DEVICE_ADDRESS_BIT to buffer cache
|
---|
2775 | - venus: switch to lazy VkBuffer cache
|
---|
2776 | - venus: add VN_DEBUG_CACHE flag
|
---|
2777 | - venus: Add VkBuffer cache statistics for debug
|
---|
2778 | - venus: shader cache fossilize replay fix
|
---|
2779 | - util/disk_cache: use posix_fallocate() for index files
|
---|
2780 |
|
---|
2781 | Kai Wasserbäch (5):
|
---|
2782 |
|
---|
2783 | - fix: gallivm: limit usage of LLVMContextSetOpaquePointers() to LLVM 15
|
---|
2784 | - fix(FTBFS): gallivm: fix LLVM #include of Triple.h, moved to TargetParser
|
---|
2785 | - fix(FTBFS): clover: fix LLVM #include of Triple.h, moved to TargetParser
|
---|
2786 | - fix: clover/llvm: replace llvm::None with std::nullopt for LLVM 17+
|
---|
2787 | - fix: gallivm: fix LLVM #include of Host.h, moved to TargetParser
|
---|
2788 |
|
---|
2789 | Kai-Heng Feng (2):
|
---|
2790 |
|
---|
2791 | - iris: Retry DRM_IOCTL_I915_GEM_EXECBUFFER2 on ENOMEM
|
---|
2792 | - Revert "iris: Avoid abort() if kernel can't allocate memory"
|
---|
2793 |
|
---|
2794 | Karmjit Mahil (28):
|
---|
2795 |
|
---|
2796 | - pvr: Process set and reset event sub commands.
|
---|
2797 | - pvr: Process wait event sub command.
|
---|
2798 | - pvr: Add SPM scratch buffer infrastructure.
|
---|
2799 | - pvr: Acquire scratch buffer on framebuffer creation.
|
---|
2800 | - pvr: Update comment about ZS and MSAA buffers for pvrsrvkm submission.
|
---|
2801 | - pvr: Set SPMSCRATCHBUFFER flag.
|
---|
2802 | - pvr: Add SPM load usc empty programs
|
---|
2803 | - pvr: Upload spm load programs to device.
|
---|
2804 | - pvr: Add support for VK_ATTACHMENT_LOAD_OP_LOAD.
|
---|
2805 | - pvr: Move descriptor write into pvr_write_descriptor_set()
|
---|
2806 | - pvr: Add support to copy descriptors on vkUpdateDescriptorSets()
|
---|
2807 | - pvr: Handle VK_QUERY_RESULT_WAIT_BIT.
|
---|
2808 | - pvr: Store enum pvr_stage_allocation instead of VkShaderStageFlags
|
---|
2809 | - pvr: Put old descriptor set approach behind a hardcoding check
|
---|
2810 | - pvr: Change last_DMA to last_dma
|
---|
2811 | - pvr: Write descriptor set addrs table dev addr into shareds
|
---|
2812 | - pvr: Add PVR_SELECT() helper macro
|
---|
2813 | - pvr: Add push consts support to descriptor program.
|
---|
2814 | - pvr: Add support for dynamic buffers descriptors
|
---|
2815 | - pvr: Add support for blend constants.
|
---|
2816 | - pvr: Move PBE START_POS into csb enum helpers header
|
---|
2817 | - pvr: Setup SPM EOT state
|
---|
2818 | - pvr: Remove unused msaa_mode field
|
---|
2819 | - pvr: Remove component_alignment
|
---|
2820 | - pvr: Setup SPM background object
|
---|
2821 | - pvr: Don't advertise currently unsupported features
|
---|
2822 | - pvr: Advertise STORAGE_IMAGE_BIT for B10G11R11_UFLOAT_PACK32
|
---|
2823 | - pvr: Don't advertise S8_UINT support
|
---|
2824 |
|
---|
2825 | Karol Herbst (44):
|
---|
2826 |
|
---|
2827 | - rusticl: fix build error with valgrind being enabled
|
---|
2828 | - rusticl/util: extract offset_of macro
|
---|
2829 | - rusticl/icd: Make it work in case Rustc shuffles struct around
|
---|
2830 | - rusticl/kernel: fix clGetKernelInfo CL_KERNEL_ATTRIBUTES for non source programs
|
---|
2831 | - rusticl/program: enable spirv
|
---|
2832 | - llvmpipe/ci: increase deqp-runner timeout
|
---|
2833 | - rusticl/device: fix some device limits
|
---|
2834 | - rusticl/device: limit CL_DEVICE_MAX_CONSTANT_ARGS
|
---|
2835 | - rusticl: no compute only
|
---|
2836 | - rusticl: allocate printf buffer as staging
|
---|
2837 | - nir: Skip samplers and textures in lower_explicit_io
|
---|
2838 | - nir/deref: don't replace casts with deref_struct if we'd lose the stride
|
---|
2839 | - ci/zink: move threading tests to flakes
|
---|
2840 | - rusticl/kernel: Images arg sizes also have to match the host pointer size
|
---|
2841 | - gallivm: fix lp_vec_add_offset_ptr for 32 bit builds
|
---|
2842 | - nvc0: enable fp helper invocation memory loads on Turing+
|
---|
2843 | - nir: track existence of variable shared memory
|
---|
2844 | - rusticl/kernel: set has_variable_shared_mem on the nir
|
---|
2845 | - gallium: add get_compute_state_info
|
---|
2846 | - lp: implement get_compute_state_info
|
---|
2847 | - iris: implement get_compute_state_info
|
---|
2848 | - nv50: implement get_compute_state_info
|
---|
2849 | - nvc0: implement get_compute_state_info
|
---|
2850 | - panfrost: move max_thread_count and take reg_count into account
|
---|
2851 | - panfrost: implement get_compute_state_info
|
---|
2852 | - rusticl/kernel: make use of cso info
|
---|
2853 | - radeonsi: implement get_compute_state_info
|
---|
2854 | - radeonsi: use default float mode for CL
|
---|
2855 | - rusticl: enable radeonsi
|
---|
2856 | - rusticl: split platform into core and api parts
|
---|
2857 | - rusticl/platform: rename _cl_platform_id to Platform
|
---|
2858 | - rusticl/platform: move getter into the type
|
---|
2859 | - rusticl/platform: move device initialization to the platform
|
---|
2860 | - rusticl/program: allow dumping compilation logs through RUSTICL_DEBUG
|
---|
2861 | - rusticl/program: make IL programs look closer to CLC ones
|
---|
2862 | - clc: add clc_validate_spirv
|
---|
2863 | - rusticl/program: validate the SPIR-V when created from IL
|
---|
2864 | - rusticl/program: extract common code of compile and build
|
---|
2865 | - rusticl/program: rework source code tracking
|
---|
2866 | - rusticl/event: drop work item before updating status
|
---|
2867 | - radeonsi: lower mul_high
|
---|
2868 | - ac/llvm: support shifts on 16 bit vec2
|
---|
2869 | - rusticl: don't set size_t-is-usize for >=bindgen-0.65
|
---|
2870 | - nvc0: do not randomly emit fences.
|
---|
2871 |
|
---|
2872 | Kenneth Graunke (37):
|
---|
2873 |
|
---|
2874 | - intel/blorp: Lower base_workgroup_id to zero
|
---|
2875 | - intel/compiler: Move atomic op translation into emit_*_atomic()
|
---|
2876 | - intel/compiler: Use LSC opcode enum rather than legacy BRW_AOPs
|
---|
2877 | - intel/compiler: Add an lsc_op_num_data_values() helper
|
---|
2878 | - intel/compiler: Eliminate SHADER_OPCODE_UNTYPED_ATOMIC_FLOAT
|
---|
2879 | - intel/compiler: Drop redundant 32-bit expansion for shared float atomics
|
---|
2880 | - intel/compiler: Delete fs_visitor::nir_emit_{ssbo,shared}_atomic_float()
|
---|
2881 | - intel/compiler: Combine nir_emit_{ssbo,shared}_atomic into one helper
|
---|
2882 | - intel/compiler: Delete all the A64 atomic variants for type sizes
|
---|
2883 | - intel/compiler: Drop dest checking in atomic code
|
---|
2884 | - intel/compiler: Use more symbolic source names in components_read()
|
---|
2885 | - anv: Add missing untyped data port flush on PIPELINE_SELECT
|
---|
2886 | - iris: Add missing untyped data port flush on PIPELINE_SELECT
|
---|
2887 | - loader: Add infrastructure for tracking active CRTC resources
|
---|
2888 | - egl: Rewrite eglGetMscRateANGLE to avoid probes and handle multi-monitor
|
---|
2889 | - iris: Perform load_constant address math in 32-bit rather than 64-bit
|
---|
2890 | - anv: Perform load_constant address math in 32-bit rather than 64-bit
|
---|
2891 | - anv: Make a batch decoder for each queue family
|
---|
2892 | - nir: Print divergence information for registers as well as SSA defs
|
---|
2893 | - nir: Fix merge_set_dump() to compile again
|
---|
2894 | - nir: Fix typos in the from-SSA pass comments
|
---|
2895 | - intel: Use common helpers for TCS passthrough shaders
|
---|
2896 | - intel/fs: Fix inferred_sync_pipe for F16TO32 opcodes
|
---|
2897 | - intel/fs: Add builder helpers for F32TO16/F16TO32 that work on Gfx7.x
|
---|
2898 | - intel/fs: Delete a TODO about using brw_F32TO16.
|
---|
2899 | - intel/fs: Use new F16TO32 helpers for unpack_half_split_* opcodes
|
---|
2900 | - Revert "intel/fs: Fix inferred_sync_pipe for F16TO32 opcodes"
|
---|
2901 | - intel/fs: Use F32TO16/F16TO32 helpers in fquantize16 handling
|
---|
2902 | - intel/fs: Move packHalf2x16 handling to lower_pack()
|
---|
2903 | - intel/eu: Simplify brw_F32TO16 and brw_F16TO32
|
---|
2904 | - intel/vec4: Retype texture/sampler indexes to UD
|
---|
2905 | - intel/fs: Make bld.F16TO32 actually emit F16TO32 not F32TO16
|
---|
2906 | - i965/vec4: Implement uclz in the vec4 backend
|
---|
2907 | - st/mesa, iris: Add optional CPU-based ASTC void extent denorm flushing
|
---|
2908 | - intel/compiler: Use nir_dest_bit_size() for ballot bit size check
|
---|
2909 | - iris: Extend resource creation helpers to allow for explicit strides
|
---|
2910 | - iris: Hack around gbm_gralloc stride restrictions
|
---|
2911 |
|
---|
2912 | Konrad Dybcio (3):
|
---|
2913 |
|
---|
2914 | - freedreno/registers: Add RBBM_GPR0_CNTL for non-GMU operation
|
---|
2915 | - freedreno: Add A2xx perf counter reg values
|
---|
2916 | - freedreno: Add A2xx REG_A2XX_RBBM_PM_OVERRIDE2 bitfields
|
---|
2917 |
|
---|
2918 | Konstantin Seurer (78):
|
---|
2919 |
|
---|
2920 | - vulkan: Track the nullDescriptor feature
|
---|
2921 | - radv: Add a helper for finding memory indices
|
---|
2922 | - radv: Create a null TLAS as meta state
|
---|
2923 | - radv: Use the null accel struct instead of emitting 0
|
---|
2924 | - radv/rt: Get rid of accel struct null checks
|
---|
2925 | - radv: Advertise rt pipelines for Control (DX12)
|
---|
2926 | - radv/bvh/meson: Add the option to set defines
|
---|
2927 | - radv/bvh: Add a define for extended SAH
|
---|
2928 | - radv: Add a shader variant for PLOC with extended SAH
|
---|
2929 | - radv: Wrap internal build type inside a build_config struct
|
---|
2930 | - radv: Enable extended SAH for shallow BVHs
|
---|
2931 | - radv: Merge the leaf and internal converter
|
---|
2932 | - radv: Improve the BVH size estimation
|
---|
2933 | - radv: Fix creating accel structs with unbound buffers
|
---|
2934 | - radv: Work around shader_call_data variables in raygen shaders
|
---|
2935 | - radv/rq: Use 16 stack entries if there is only one ray query
|
---|
2936 | - radv/llvm: Use the shader names as module name
|
---|
2937 | - ac/llvm: Fix validation error with global io
|
---|
2938 | - radv: Scalarize global IO with LLVM enabled
|
---|
2939 | - radv: Make radv_compute_dispatch non-static
|
---|
2940 | - radv: Implement ordered compute dispatches
|
---|
2941 | - radv: Use an ordered dispatch for BVH encoding
|
---|
2942 | - radv: Remove radv_indirect_unaligned_dispatch
|
---|
2943 | - vulkan,nir: Refactor ycbcr conversion state into a struct
|
---|
2944 | - radv: Use common ycbcr conversion lowering
|
---|
2945 | - radv/rra: Find copy memory index when initializing the trace state
|
---|
2946 | - radv/rra: Hide deferred accel struct data destruction behind an env var
|
---|
2947 | - radv: Hash VK_PIPELINE_CREATE_RAY_TRACING_NO_NULL_* flags
|
---|
2948 | - radv: Clean up dynamic RT stack allocation
|
---|
2949 | - ac/llvm: Implement bvh64_intersect_ray_amd
|
---|
2950 | - radv: Make accel struct meta state initialization thread safe
|
---|
2951 | - radv: Force ACO for BVH build shaders
|
---|
2952 | - radv: Pre-compile BVH build shaders if there is a cache
|
---|
2953 | - radv: Advertise ray query support with LLVM
|
---|
2954 | - radv/rt: Skip instances after loading the entire node
|
---|
2955 | - vulkan: Add vk_acceleration_structure
|
---|
2956 | - radv: Use vk_acceleration_structure
|
---|
2957 | - anv: Use vk_acceleration_structure
|
---|
2958 | - radv/bvh/encode: Use the node type for identifying internal nodes
|
---|
2959 | - radv/bvh: Replace is_final_tree with bvh_offset
|
---|
2960 | - radv/bvh/encode: Move bvh_offset NULL check to the top of the loop
|
---|
2961 | - radv/bvh/encode: Introduce is_root_node
|
---|
2962 | - radv/bvh/encoder: Move dst_node initialization into the loop
|
---|
2963 | - radv: Add a build config for compact builds
|
---|
2964 | - radv/bvh: Implement compact encoding
|
---|
2965 | - radv: Use compact encoding
|
---|
2966 | - radv: Move the geometry infos before the BVH
|
---|
2967 | - radv/bvh: Move the size header field up
|
---|
2968 | - radv/bvh: Add a shader for filling the header
|
---|
2969 | - radv: Use indirect header filling for compact builds
|
---|
2970 | - nir: Add cull_mask_and_flags_amd intrinsic
|
---|
2971 | - radv/rt: Merge cull_mask and flags
|
---|
2972 | - radv/rt: Pre shift cull_mask
|
---|
2973 | - radv: Move header and geometry info init into separate functions
|
---|
2974 | - radv: Only init geometry infos if RRA is enabled
|
---|
2975 | - radv/rt: Use ushr for extracting the cull mask
|
---|
2976 | - radv/rt: Fix updating stack_size if the shader uses scratch
|
---|
2977 | - radv/rt: Use vk_pipeline_hash_shader_stage for RT stages
|
---|
2978 | - vulkan: Add vk_shader_module_init
|
---|
2979 | - radv/rt: Properly handle pNext of pipeline library stages
|
---|
2980 | - radv/sqtt: Skip dumping pipeline libraries
|
---|
2981 | - radv: Fix loading stack_size from the cache
|
---|
2982 | - radv: Fix inserting stack_size into the cache
|
---|
2983 | - radv/rt: Handle load_constant instructions when inlining shaders
|
---|
2984 | - nir/lower_shader_calls: Remat derefs before lowering resumes
|
---|
2985 | - radv/rt: Refactor rq_load lowering
|
---|
2986 | - radv/rq: Rematerialize inv_dir before proceed
|
---|
2987 | - radv: Set user SGPR locations when declaring args
|
---|
2988 | - radv: Stop counting user SGPRS separately
|
---|
2989 | - radv/ci: Update ray tracing pipeline fail/skip lists
|
---|
2990 | - radv: Add radv_shader_type to fix gs_copy and trap handler handling
|
---|
2991 | - radv: Remove some dead radv_shader_args setup
|
---|
2992 | - aco: Remove is_gs_copy_shader
|
---|
2993 | - radv: Remove has_previous_stage
|
---|
2994 | - radv: Pack and encode geometry id and flags on the CPU
|
---|
2995 | - radv/bvh: Remove calculate_node_bounds
|
---|
2996 | - radv: Remove radv_bvh_aabb_node::aabb
|
---|
2997 | - nir/lower_fp16_casts: Fix SSA dominance
|
---|
2998 |
|
---|
2999 | Kurt Kartaltepe (1):
|
---|
3000 |
|
---|
3001 | - drirc: Set limit_trig_input_range option for Nier games
|
---|
3002 |
|
---|
3003 | Lang Yu (1):
|
---|
3004 |
|
---|
3005 | - amd/common: fix a typo
|
---|
3006 |
|
---|
3007 | Lepton Wu (1):
|
---|
3008 |
|
---|
3009 | - egl/dri2: Use primary device in EGL device platform for kms_swrast
|
---|
3010 |
|
---|
3011 | Lina Versace (8):
|
---|
3012 |
|
---|
3013 | - util/glsl2spirv: Fix build with Python 3.6
|
---|
3014 | - docs: Lower Python requirement to 3.6
|
---|
3015 | - mailmap: Lina is Chad's new name
|
---|
3016 | - mailmap: Add Lina's new google.com address
|
---|
3017 | - venus: Update protocol for VK_EXT_memory_budget
|
---|
3018 | - venus: Delete vn_renderer_info::has_cache_management
|
---|
3019 | - venus: Refactor vn_physical_device_init_memory_properties
|
---|
3020 | - venus: Enable VK_EXT_memory_budget
|
---|
3021 |
|
---|
3022 | Lionel Landwerlin (151):
|
---|
3023 |
|
---|
3024 | - pps: print out message when we get the first counters
|
---|
3025 | - anv: record secondaries' traces into primaries
|
---|
3026 | - intel/ds: track secondary cmdbuffers in perfetto
|
---|
3027 | - intel/ds: move event_id access to perfetto lambda
|
---|
3028 | - util/u_trace: add support for variable length trace points
|
---|
3029 | - vulkan/debug_utils: copy debug util labels
|
---|
3030 | - anv: add support for command buffer tagging in traces
|
---|
3031 | - intel/ds: add INTEL_GPU_TRACEPOINT envvar to toggle tracepoints
|
---|
3032 | - intel/ds: remove unused trace point
|
---|
3033 | - intel/utrace: document tracepoints
|
---|
3034 | - Revert "ci: build hasvk if we're building anv"
|
---|
3035 | - intel: use a shared UUID with other drivers
|
---|
3036 | - nir/divergence: add missing RT intrinsinc handling
|
---|
3037 | - anv: fix generated indirect draw shader stats checks
|
---|
3038 | - nir/lower_io: fix bounds checking for 64bit_bounded_global
|
---|
3039 | - anv: fix preemption enable emission in gpu_memcpy
|
---|
3040 | - intel/fs: avoid cmod optimization on instruction with different write_mask
|
---|
3041 | - intel/decoder: print out compute push constants
|
---|
3042 | - intel/common: add a INTEL_DECODE variable to parameter decoder at runtime
|
---|
3043 | - vulkan/wsi/wayland: improve same gpu detection
|
---|
3044 | - intel/fs: drop FS_OPCODE_UNIFORM_PULL_CONSTANT_LOAD_GFX7
|
---|
3045 | - anv: fix null descriptors
|
---|
3046 | - docs: list anv in EXT_extended_dynamic_state3 support
|
---|
3047 | - intel: add missing PS restriction on BDW+
|
---|
3048 | - anv: expose EXT_load_store_op_none
|
---|
3049 | - intel/fs: make alpha_to_coverage a tristate
|
---|
3050 | - anv: rename RT pipeline function helper
|
---|
3051 | - anv: rename a few internal functions to highlight gfx use
|
---|
3052 | - intel/ds: track end of pipe bits
|
---|
3053 | - anv: use malloc for host only descriptor sets
|
---|
3054 | - anv: put the video extension behind a variable
|
---|
3055 | - intel/fs: fix mesh indirect movs
|
---|
3056 | - intel/dev: add a default urb value for intel_stub_gpu on dg2
|
---|
3057 | - anv/hasvk: handle a SAMPLED_READ/STORAGE_READ access flags
|
---|
3058 | - anv: remove copied information from runtime graphics state
|
---|
3059 | - anv: move 3DSTATE_VERTEX_ELEMENT emission to dynamic path
|
---|
3060 | - anv: implement VK_EXT_vertex_input_dynamic_state
|
---|
3061 | - intel/fs: bound subgroup invocation read to dispatch size
|
---|
3062 | - hasvk: fix KHR_shader_float_controls reporting
|
---|
3063 | - intel/perf: also add the oa timestamp shift on MTL
|
---|
3064 | - anv: move debug submit to helper and call it on execbuf failure
|
---|
3065 | - anv: track vram only BOs to print things out on ENOMEM execbuf
|
---|
3066 | - anv: fix vma heap memory leak
|
---|
3067 | - anv: fix invalid masking of 48bit address
|
---|
3068 | - anv: remove assert typed write support when using NULL surface
|
---|
3069 | - vulkan/runtime: store parameters of VK_EXT_sliced_view_of_3d
|
---|
3070 | - anv: fixup Wa_16011107343 for Gfx12 only
|
---|
3071 | - iris: fix Wa_16011107343 for Gfx12
|
---|
3072 | - anv: remove more Gfx7 code
|
---|
3073 | - genxml: Fix STATE_BASE_ADDRESS::BindlessSurfaceStateSize field size
|
---|
3074 | - genxml: fix border color offset field on Gfx12+
|
---|
3075 | - anv/hasvk: speed up null image/view descriptor writes
|
---|
3076 | - anv: fix scratch buffer reloc in 3DSTATE_HS
|
---|
3077 | - anv: fixup condition for Wa_14016118574
|
---|
3078 | - anv: pull Wa_14016118574 out of some loop not changing state
|
---|
3079 | - util/glsl2spirv: add support for include directive
|
---|
3080 | - anv: fix incorrect parameter
|
---|
3081 | - anv: correctly reset generation address on command buffer reset
|
---|
3082 | - anv: fix generated forward jump with more than 67M draws
|
---|
3083 | - anv: remove copied code from generation shader
|
---|
3084 | - anv: remove BTI related flush in generation shaders
|
---|
3085 | - anv: correctly program 3DSTATE_SF in generation shaders
|
---|
3086 | - anv: limit push constant dirtyness with generation shaders
|
---|
3087 | - anv: remove pre hasvk split assert
|
---|
3088 | - anv: remove commented code
|
---|
3089 | - anv: fix 3DSTATE_PS emission in generation shaders
|
---|
3090 | - anv: fix indirect draws VF cache tracking of index buffer
|
---|
3091 | - anv: make sure mi_memcpy lands before push constant loads
|
---|
3092 | - anv: remove MI_NOOPs at the end of the generation batch
|
---|
3093 | - anv: use a single generation shader for indirect draws
|
---|
3094 | - anv: rename generated draws for Gfx11
|
---|
3095 | - anv: use 64bit int support in generation shaders
|
---|
3096 | - anv: pack more data into generated draws input
|
---|
3097 | - anv: move common shader code into header
|
---|
3098 | - anv: use a list of generated shaders
|
---|
3099 | - anv: remove unused item_count parameter
|
---|
3100 | - anv: add gfx9 generated draw support
|
---|
3101 | - blorp: add dependency on idep_intel_dev
|
---|
3102 | - vulkan/runtime: only consider slice info with 3D image views
|
---|
3103 | - anv: VK_EXT_image_sliced_view_of_3d
|
---|
3104 | - nir: fix nir_ishl_imm
|
---|
3105 | - anv: enable VK_EXT_pipeline_library_group_handles
|
---|
3106 | - anv/iris: report counter symbols with debug option
|
---|
3107 | - intel/fs: report max register pressure in shader stats
|
---|
3108 | - anv: report max register pressure in pipeline properties
|
---|
3109 | - anv: force MEDIA_INTERFACE_DESCRIPTOR_LOAD reemit after 3D->GPGPU switch
|
---|
3110 | - radv: use 1ull for alignment computations
|
---|
3111 | - util: allow align64() to do alignments >= 4Gb
|
---|
3112 | - docs: fix invalid link
|
---|
3113 | - iris: trace frames with u_trace
|
---|
3114 | - anv: export EXT_pipeline_library_group_handles only with RT
|
---|
3115 | - docs: update Anv features support
|
---|
3116 | - anv: more formats for acceleration structure vertices
|
---|
3117 | - intel/fs: don't SEND messages as partial writes
|
---|
3118 | - intel/fs: fix nir_opt_peephole_ffma max vec assumption
|
---|
3119 | - intel/fs: fixup sources number from opt_algebraic
|
---|
3120 | - intel/fs: add MOV source count validation
|
---|
3121 | - intel/fs: prevent large vector ops generated by peephole_ffma
|
---|
3122 | - intel/fs: fix subgroup invocation read bounds checking
|
---|
3123 | - vulkan/wsi: add a headless swapchain implementation/option
|
---|
3124 | - intel/compiler: report max dispatch width statistic
|
---|
3125 | - anv: report shader max dispatch width in pipeline props
|
---|
3126 | - intel/devinfo: add an option to pick platform to print
|
---|
3127 | - intel/devinfo: printout URB entries
|
---|
3128 | - intel/dev: use generated WA helpers for Wa_22012575642
|
---|
3129 | - intel/devinfo: dedicated entries for XeHP
|
---|
3130 | - intel/devinfo: initialize pci_device_id with from_pci_id()
|
---|
3131 | - intel/dev: fold Gfx12 URB entries in Gfx12 HW info
|
---|
3132 | - util/u_trace: move needs_cs_param option to tracepoints
|
---|
3133 | - vulkan/runtime: also copy strings on queue debug utils
|
---|
3134 | - intel/ds: rename frame timeline row to queue
|
---|
3135 | - anv: fix incorrect utrace bo release
|
---|
3136 | - anv: fixup locking for utrace submission increments
|
---|
3137 | - anv: rename anv_utrace_flush_copy in anv_utrace_submit
|
---|
3138 | - anv: add utrace support for queue debug utils
|
---|
3139 | - anv: implement recommended flush/wait of AUX-TT invalidation
|
---|
3140 | - iris: implement recommended flush/wait of AUX-TT invalidation
|
---|
3141 | - anv: hash immutable sampler conversion data not pointers
|
---|
3142 | - anv: compute the largest GRL kernel scratch size
|
---|
3143 | - anv: move queue check helpers to anv_private
|
---|
3144 | - anv: take care of maxStorageBufferRange being uint32_t
|
---|
3145 | - isl: update max buffer size for SKL+
|
---|
3146 | - intel/dev: set a default valid kmd_type
|
---|
3147 | - intel/perf: fix OA format selection on MTL
|
---|
3148 | - intel/fs: run VGRF compaction just before max live register accounting
|
---|
3149 | - intel/fs: don't consider fixup_nomask_control_flow SENDs predicate
|
---|
3150 | - intel/fs: UNDEF fixup_nomask_control_flow temp register
|
---|
3151 | - intel/fs: copy instruction sources in logical send lowering
|
---|
3152 | - intel/fs: factor out lsc surface descriptor settings
|
---|
3153 | - nir: reuse nir_component_mask() where it makes sense
|
---|
3154 | - nir: add 2 new intel intrinsics for uniform ssbo/shared loads
|
---|
3155 | - intel/fs: optimize uniform SSBO & shared loads
|
---|
3156 | - intel/fs: also allow vec8+ vectorization of load_global_const_block_intel
|
---|
3157 | - anv: pass steam output as argument for anv_dump_pipe_bits
|
---|
3158 | - anv: replace query flush before gpu copy by semaphore wait
|
---|
3159 | - anv: fixup streamout write barriers
|
---|
3160 | - intel/fs: use nomask for setting cr0 for float controls
|
---|
3161 | - anv: exclude performance queries from blorp clears
|
---|
3162 | - intel/ds: add a new timeline row for frames
|
---|
3163 | - anv: add utrace tracking of frame boundaries
|
---|
3164 | - vulkan/runtime: discard unused graphics stages in libraries
|
---|
3165 | - intel/vec4: force exec_all on float control instruction
|
---|
3166 | - vulkan/overlay: deal with unknown pNext structures
|
---|
3167 | - isl: don't set inconsistent fields for depth when using stencil only
|
---|
3168 | - isl: fix a number of errors on storage format support on Gfx9/12.5
|
---|
3169 | - anv: rework Wa_14017076903 to only apply with occlusion queries
|
---|
3170 | - nir/divergence: add missing load_global_constant_* intrinsics
|
---|
3171 | - anv: fix anv_nir_lower_ubo_loads pass
|
---|
3172 | - intel/fs: fix per vertex input clamping
|
---|
3173 | - intel/compiler: make uses_pos_offset a tri-state
|
---|
3174 | - intel/fs: fix scheduling of HALT instructions
|
---|
3175 |
|
---|
3176 | Liviu Prodea (1):
|
---|
3177 |
|
---|
3178 | - meson: Ignore unused variables when assertions are disabled
|
---|
3179 |
|
---|
3180 | Lone_Wolf (3):
|
---|
3181 |
|
---|
3182 | - compiler/clc: Fix embedded clang headers (microsoft-clc) for LLVM 16+
|
---|
3183 | - clc: Add clangASTMatchers to fix static llvm build of microsoft-clc with LLVM 16+
|
---|
3184 | - clc: Add clang frontendhlsl module to fix build of microsoft-clc with llvm 16+
|
---|
3185 |
|
---|
3186 | Luc Ma (2):
|
---|
3187 |
|
---|
3188 | - xlib: fix glXDestroyContext in Gallium frontends
|
---|
3189 | - meson: keep Mako version checking in accord with build msg
|
---|
3190 |
|
---|
3191 | Lucas Fryzek (11):
|
---|
3192 |
|
---|
3193 | - crocus: Add support for \`get_screen_fd`
|
---|
3194 | - tegra: Add support for \`get_screen_fd`
|
---|
3195 | - nouveau: Add support for \`get_screen_fd`
|
---|
3196 | - zink: Add support for \`get_screen_fd`
|
---|
3197 | - iris: Add support for \`get_screen_fd`
|
---|
3198 | - i915: Add support for \`get_screen_fd`
|
---|
3199 | - svga: Add support for \`get_screen_fd`
|
---|
3200 | - virgl: Add support for \`get_screen_fd`
|
---|
3201 | - r300/r600/radeon_si: Add support for \`get_screen_fd`
|
---|
3202 | - d3d12/llvmpipe/softpipe: Add support for \`get_screen_fd`
|
---|
3203 | - gallium: Modify default path for DMABUF to use DRM
|
---|
3204 |
|
---|
3205 | Lucas Stach (7):
|
---|
3206 |
|
---|
3207 | - etnaviv: don't drop TS capability on GPUs with MMUv2
|
---|
3208 | - etnaviv: drm: fix BO array leaks
|
---|
3209 | - etnaviv: free pm queries dynarray on screen destroy
|
---|
3210 | - etnaviv: drm: fix check if BO is on a deferred destroy list
|
---|
3211 | - etnaviv: fix double scanout import of multiplanar resources
|
---|
3212 | - etnaviv: flush VS texture cache when texture data is changed
|
---|
3213 | - etnaviv: fix texture barrier implementation
|
---|
3214 |
|
---|
3215 | Luigi Santivetti (4):
|
---|
3216 |
|
---|
3217 | - pvr: fix uses_tile_buffers in clear color attachment
|
---|
3218 | - pvr: add support for tile buffer output clear
|
---|
3219 | - pvr: add padding bytes when allocating buffer memory
|
---|
3220 | - pvr: fix segfault in dEQP-VK.ycbcr.query.*
|
---|
3221 |
|
---|
3222 | Luna Nova (3):
|
---|
3223 |
|
---|
3224 | - device_select_layer: fix inverted strcmp in device_select_find_dri_prime_tag_default (v1)
|
---|
3225 | - device_select_layer: apply DRI_PRIME even if default device is > 1 to match opengl behavior
|
---|
3226 | - device_select_layer: pick a default device before applying DRI_PRIME
|
---|
3227 |
|
---|
3228 | Lynne (1):
|
---|
3229 |
|
---|
3230 | - aco_validate: allow for wave32 in p_dual_src_export_gfx11
|
---|
3231 |
|
---|
3232 | M Henning (1):
|
---|
3233 |
|
---|
3234 | - nouveau/codegen: Check nir_dest_num_components
|
---|
3235 |
|
---|
3236 | Maarten Lankhorst (1):
|
---|
3237 |
|
---|
3238 | - iris: Place scanout buffers only into lmem for discrete GPUs
|
---|
3239 |
|
---|
3240 | Marcin Ślusarz (23):
|
---|
3241 |
|
---|
3242 | - intel/compiler: fix generation of vec8/vec16 alu instruction
|
---|
3243 | - intel/compiler/mesh: handle const data in task & mesh programs
|
---|
3244 | - intel/compiler: fine-grained control of dispatch widths
|
---|
3245 | - nir: add nir_mod_analysis & its tests
|
---|
3246 | - intel/compiler/mesh: optimize indirect writes
|
---|
3247 | - intel/compiler/mesh: support longer write messages
|
---|
3248 | - intel/compiler/mesh: remove dead code path supporting >4 dword writes
|
---|
3249 | - intel/compiler/mesh: use U888X packed index format
|
---|
3250 | - anv: bump ANV_MAX_QUEUE_FAMILIES
|
---|
3251 | - intel/compiler: replace gl_Layer & gl_ViewportIndex by 0 in fs if ms doesn't write it
|
---|
3252 | - anv: fix how unset gl_Viewport & gl_Layer are handled in mesh case
|
---|
3253 | - intel/compiler/mesh: use slice id of task urb handles in mesh shaders
|
---|
3254 | - anv: enable task redistribution
|
---|
3255 | - intel/compiler/mesh: apply URB payload mask once per program
|
---|
3256 | - intel/compiler/mesh: follow the type of offset variable
|
---|
3257 | - intel/compiler: remove unused field from fs_thread_payload
|
---|
3258 | - anv: halve the push constants space in mesh pipelines
|
---|
3259 | - crocus/meson: add back dependency on libintel_dev
|
---|
3260 | - anv,hasvk: remove stale comments
|
---|
3261 | - anv: call nir_shader_gather_info early
|
---|
3262 | - anv: work around for per-prim attributes corruption
|
---|
3263 | - intel/compiler: compactify locations of mesh outputs
|
---|
3264 | - anv: ignore structure types handled in vk_device_memory_create
|
---|
3265 |
|
---|
3266 | Marek Olšák (212):
|
---|
3267 |
|
---|
3268 | - glthread: fix an upload buffer leak
|
---|
3269 | - util: fix util_is_vbo_upload_ratio_too_large
|
---|
3270 | - mesa: allow GL_UNSIGNED_INT64_ARB as vertex format for ARB_bindless_texture
|
---|
3271 | - glapi: autogenerate function parameters with no space between * and variable
|
---|
3272 | - glthread: handle GL_*_ARRAY in glEnable/Disable
|
---|
3273 | - glthread: set GL_OUT_OF_MEMORY if we fail to upload indices
|
---|
3274 | - glthread: set GL_OUT_OF_MEMORY if we fail to upload vertices
|
---|
3275 | - glthread: execute glMultiDrawArrays(draw_count < 0) asynchronously
|
---|
3276 | - glthread: change multi_draw_elements_async() to never fail due to large size
|
---|
3277 | - glthread: do vertex uploads if an index buffer is present for glDrawElements
|
---|
3278 | - mesa: move gl_vertex_format_user definition into glthread.h
|
---|
3279 | - glthread: pack and name the type of glthread_vao::Attrib
|
---|
3280 | - glthread: make marshal functions for glBegin/End attribs non-static
|
---|
3281 | - glthread: remove the vbo_upload_ratio_too_large fallback for glMultiDrawElements
|
---|
3282 | - glthread: do vertex uploads if an index buffer is present for MultiDrawElements
|
---|
3283 | - glthread: disallow glthread if buffer uploads are unsupported
|
---|
3284 | - ac/llvm: run the LLVM sinking pass because LLVM will stop running it
|
---|
3285 | - ac/llvm: run the IPSCCP pass
|
---|
3286 | - ac/llvm: remove llvm:: now that we use "using namespace llvm"
|
---|
3287 | - amd: update amdgpu_drm.h
|
---|
3288 | - ac/gpu_info: add PCIe info
|
---|
3289 | - radeonsi/ci: update gfx10.3 results
|
---|
3290 | - radeonsi/ci: add gfx1100 results
|
---|
3291 | - radeonsi: fix RB+ blending with sRGB formats
|
---|
3292 | - radeonsi/gfx11: unset SAMPLE_MASK_TRACKER_WATERMARK to fix hangs
|
---|
3293 | - amd: split GFX1103 into GFX1103_R1 and GFX1103_R2
|
---|
3294 | - amd: fix tile_swizzle on gfx11 - should be shifted by 10 bits, not 8
|
---|
3295 | - amd: update SX_BLEND_OPT_EPSILON.MRT0_EPSILON enum definitions
|
---|
3296 | - amd: update shadowed register tables for gfx11
|
---|
3297 | - amd: improve RB+ blending precision
|
---|
3298 | - radeonsi: implement RB+ depth-only rendering for better perf
|
---|
3299 | - radeonsi/gfx11: remove the INST_PREF_SIZE workaround
|
---|
3300 | - radeonsi/gfx11: add a comment why we use PRIM_GRP_SIZE <= 252
|
---|
3301 | - radeonsi/gfx11: adjust ACCUM_* fields for tessellation
|
---|
3302 | - radeonsi/gfx11: fix blend->cb_target_mask dependency for shader keys
|
---|
3303 | - radeonsi/gfx11: move the PIXEL_PIPE_STAT_CONTROL event into the GFX preambles
|
---|
3304 | - radeonsi/gfx11: use new packet EVENT_WRITE_ZPASS
|
---|
3305 | - radeonsi: deduplicate VS/TES/GS update code
|
---|
3306 | - radeonsi/gfx11: always set MSAA_NUM_SAMPLES=0 for DCC_DECOMPRESS
|
---|
3307 | - radeonsi: merge si_ps_key_update_framebuffer_blend & .._update_blend_rasterizer
|
---|
3308 | - radeonsi: determine alpha_to_coverage robustly in si_update_framebuffer_blend_rasterizer
|
---|
3309 | - radeonsi: never set INTERPOLATE_COMP_Z
|
---|
3310 | - amd: unify and tune the attribute ring size for gfx11
|
---|
3311 | - amd: change pbb_max_alloc_count for gfx11
|
---|
3312 | - amd: update the cache size for gfx1103_r1
|
---|
3313 | - amd: update late_alloc_wave64 for gfx11
|
---|
3314 | - amd: sort and re-indent packet definitions
|
---|
3315 | - amd: fix typo in shadowed uconfig registers on gfx11
|
---|
3316 | - amd: document OOB behavior on gfx11
|
---|
3317 | - amd/registers: remove confusing definitions from gfx10-rsrc.json
|
---|
3318 | - radeonsi: set NEVER as the depth compare func if depth compare is disabled
|
---|
3319 | - amd/llvm: fix LLVM 15 & 16 crashes in SelectionDAG.cpp
|
---|
3320 | - radeonsi: call ac_init_llvm_once before any util_queue initialization
|
---|
3321 | - radeonsi: set sampler COMPAT_MODE in the corresponding branch
|
---|
3322 | - amd/ci: update sanctuary trace sha1
|
---|
3323 | - radeonsi/gfx11: don't add mrt0 export for alpha-to-coverage if mrtz is present
|
---|
3324 | - radeonsi/gfx11: don't add alpha to mrt0 format for A2C if exporting via mrtz
|
---|
3325 | - amd: define new SET_*_REG_PAIRS packets
|
---|
3326 | - radeonsi: clean up si_set_mutable_tex_desc_fields
|
---|
3327 | - amd/surface: clean up is_dcc_supported_by_L2
|
---|
3328 | - amd,util: fix how lod bias is converted to fixed-point
|
---|
3329 | - amd: don't hardcode real VGPR allocation granularity on gfx10.3 and gfx11
|
---|
3330 | - glthread: track the current element array buffer in the Core profile too
|
---|
3331 | - mesa: ignore indices[i] if count[i] == 0 for MultiDrawElements
|
---|
3332 | - glthread: initialize indices[i] for no-op MultiDrawElements
|
---|
3333 | - glthread: upload non-BO indices in the core profile to fix GStreamer
|
---|
3334 | - glthread: add a heuristic to stop locking global mutexes with multiple contexts
|
---|
3335 | - glthread: ignore non-VBO vertex arrays with NULL data pointers
|
---|
3336 | - Revert "ci/zink: Disable Amnesia trace until the linked issue gets fixed."
|
---|
3337 | - glthread: rewrite glMultiDrawArrays to never fail to upload vertices
|
---|
3338 | - glthread: change glMultiDrawElements to execute draw_count < 0 asynchronously
|
---|
3339 | - glthread: don't execute glDraw code if we're inside glBegin/End
|
---|
3340 | - glthread: don't pass index bounds to the driver for async calls
|
---|
3341 | - glthread: move some draw call parameters closer to their use
|
---|
3342 | - glthread: don't bind/unbind uploaded indexbuf, pass it to glDraw directly
|
---|
3343 | - glthread: don't bind/unbind uploaded indexbuf, pass it to glMultiDraw directly
|
---|
3344 | - glthread: track vertex formats for all attributes
|
---|
3345 | - glthread: add a vertex upload path that unrolls indices for glDrawElements
|
---|
3346 | - glthread: reorder draw code a little
|
---|
3347 | - glthread: add ctx->GLThread.draw_always_async to simplify draw checking
|
---|
3348 | - glthread: remove goto statements and add unlikely() into draw functions
|
---|
3349 | - glthread: inline draw functions that have only one use
|
---|
3350 | - glthread: don't execute Draw and BufferSubData calls if the context is lost
|
---|
3351 | - glthread: handle non-VBO uploads for glMultiModeDraw{Arrays,Elements}IBM
|
---|
3352 | - glthread: add API to allow passing DrawID from glthread to mesa
|
---|
3353 | - glthread: convert (Multi)DrawIndirect into direct if user buffers are present
|
---|
3354 | - glthread: remove unnecessary debug code
|
---|
3355 | - glthread: don't free glthread for GL_DEBUG_OUTPUT_SYNCHRONOUS, only disable it
|
---|
3356 | - glthread: don't restore non-VBO vertex arrays after all draws
|
---|
3357 | - Revert "radeonsi/ci: Update stoney test expectations"
|
---|
3358 | - radeonsi: fix COMPAT_MODE on gfx8-9
|
---|
3359 | - amd: fix LOD_BIAS on gfx6-9 and adjust the lod bias CAP
|
---|
3360 | - amd: add missing gfx11 register definitions
|
---|
3361 | - amd: bump AMD_MAX_SE and change the CU mask type to 16 bits
|
---|
3362 | - radeonsi/gfx11: fix the CU_EN clear mask for RSRC4_GS
|
---|
3363 | - radeonsi/gfx11: don't set non-existent VGT_STRMOUT_BUFFER_CONFIG
|
---|
3364 | - radeonsi/gfx11: set CB_COLORi_INFO.MAX_COMP_FRAG on GFX1103_R2
|
---|
3365 | - radeonsi: move a few DB_SHADER_CONTROL states into si_shader_ps
|
---|
3366 | - radeonsi: change si_shader::ctx_reg to a nameless union for better readability
|
---|
3367 | - radeonsi: remove no-op setting of THDS_PER_SUBGRP
|
---|
3368 | - radeonsi: use SPI_SHADER_USER_DATA_HS_0 definition instead of LS_0
|
---|
3369 | - radeonsi: set PA_SU_VTX_CNTL consecutively with PA_CL_GB_VERT_CLIP_ADJ
|
---|
3370 | - radeonsi/gfx11: ignore alpha_is_on_msb because the hw ignores it
|
---|
3371 | - radeonsi: replace si_screen::has_out_of_order_rast with the radeon_info field
|
---|
3372 | - radeonsi: disable Smart Access Memory because CPU access has large overhead
|
---|
3373 | - amd,radeonsi: remove unused LLVM functions
|
---|
3374 | - amd/registers: unify VRS combiner definition names between gfx103 and gfx11
|
---|
3375 | - amd: replace SI_BIG_ENDIAN with UTIL_ARCH_BIG_ENDIAN
|
---|
3376 | - radeonsi: remove returns from si_emit_global_shader_pointers
|
---|
3377 | - radeonsi: reformat emit_cb_render_state, create_blend_state, create_rs_state
|
---|
3378 | - radeonsi: remove a gfx11 check in si_shader_gs (legacy GS)
|
---|
3379 | - radeonsi: remove unused VS_STATE_LS_OUT_PATCH_SIZE
|
---|
3380 | - radeonsi: always add 1 to lshs_vertex_stride now that LS_OUT_PATCH_SIZE is gone
|
---|
3381 | - radeonsi: correct and clean up obsolete vs_state_bits comments
|
---|
3382 | - radeonsi: rename esgs_itemsize -> esgs_vertex_stride
|
---|
3383 | - amd: query the per-SIMD VGPR counts from the kernel, don't hardcode them
|
---|
3384 | - radeonsi: don't clamp z_samples to fix Unreal Tournament 99
|
---|
3385 | - amd/registers: only define SPI and COMPUTE registers in the 0xB000 range
|
---|
3386 | - radeonsi: reorganize emit_db_render_state and simplify VRS code
|
---|
3387 | - radeonsi: reorganize si_initialize_color_surface for better readability
|
---|
3388 | - radeonsi: reorganize si_init_depth_surface for better readability
|
---|
3389 | - radeonsi: don't set PACKET_TO_ONE_PA for line stippling
|
---|
3390 | - radeonsi/gfx11: change the default of COMPUTE_DISPATCH_INTERLEAVE to 256
|
---|
3391 | - amd: implement conformant TRUNC_COORD behavior for gfx11
|
---|
3392 | - amd/gpu_info: add a workaround for SI_FORCE_FAMILY=gfx1100
|
---|
3393 | - nir,amd: add and use nir_intrinsic_load_esgs_vertex_stride_amd
|
---|
3394 | - nir: lower to fragment_mask_fetch/load_amd with EQAA correctly
|
---|
3395 | - glthread: fix a perf regression due to draw_always_async flag, fix DrawIndirect
|
---|
3396 | - mesa: fix glPopClientAttrib with fixed-func VP and zero-stride varyings
|
---|
3397 | - mesa: remove a redundant call to _mesa_update_edgeflag_state_vao
|
---|
3398 | - mesa: initialize VertexProgram._VaryingInputs before the first use
|
---|
3399 | - amd: update amdgpu_drm.h
|
---|
3400 | - amd,radeonsi: change enabled_rb_mask to 64 bits
|
---|
3401 | - amd: query cache sizes from the kernel
|
---|
3402 | - ac/nir: don't use load_esgs_vertex_stride_amd on gfx6-8
|
---|
3403 | - amd: massively simplify how info->spi_cu_en is applied
|
---|
3404 | - amd/rtld: allow 64K LDS for all shader stages except for gfx6
|
---|
3405 | - radeonsi/ci: update flakes and gfx8-polaris11 results
|
---|
3406 | - radeonsi: remove Smart Access Memory because CPU access has large overhead
|
---|
3407 | - radeonsi: reorganize si_emit_framebuffer_state for better readability
|
---|
3408 | - radeonsi: don't merge SET_* packets that have a different index in si_pm4_state
|
---|
3409 | - radeonsi: reindent code in si_state_binning.c
|
---|
3410 | - radeonsi: add si_pm4_set_reg_va to simplify setting reg_va_low_idx for RGP
|
---|
3411 | - radeonsi: check the pm4.reg_va_low_idx assertion unconditionally
|
---|
3412 | - radeonsi: simplify encoding VGPRS and SGPRS
|
---|
3413 | - radeonsi: assume shader is never NULL in si_emit_shader_*
|
---|
3414 | - nir: return progress from nir_lower_io_to_scalar
|
---|
3415 | - nir: skip nir_op_unpack_32_4x8 in nir_lower_alu_width
|
---|
3416 | - ac/nir: add ac_nir_lower_subdword_loads to lower 8/16-bit loads to 32 bits
|
---|
3417 | - aco: implement nir_op_unpack_32_4x8
|
---|
3418 | - ac/llvm: implement nir_op_unpack_32_4x8
|
---|
3419 | - amd: lower subdword UBO loads in NIR
|
---|
3420 | - amd: lower multi-component subdword SSBO loads in NIR
|
---|
3421 | - lavapipe/ci: add a new flake
|
---|
3422 | - amd: add nir_intrinsic_xfb_counter_sub_amd and fix overflowed streamout offsets
|
---|
3423 | - amd/llvm,radeonsi/gfx11: switch to using GDS_STRMOUT registers
|
---|
3424 | - radeonsi/gfx11: only allocate GDS OA for streamout, GDS memory is not needed
|
---|
3425 | - radeonsi: emulate VGT_ESGS_RING_ITEMSIZE in the shader on gfx9-11
|
---|
3426 | - radeonsi: merge si_emit_initial_compute_regs with si_init_cs_preamble_state
|
---|
3427 | - radeonsi: separate nir_texop_descriptor_amd lowering
|
---|
3428 | - radeonsi: lower nir_texop_sampler_descriptor_amd
|
---|
3429 | - radeonsi: set pm4.atom.emit in si_get_shader_pm4_state
|
---|
3430 | - radeonsi: reindent si_shader_ls, si_shader_es, si_shader_gs, si_shader_vs
|
---|
3431 | - radeonsi: reorganize si_shader_hs
|
---|
3432 | - radeonsi: reorganize si_shader_ngg
|
---|
3433 | - radeonsi: reorganize si_shader_ps
|
---|
3434 | - radeonsi: other cosmetic changes in si_state_shaders.cpp
|
---|
3435 | - radeonsi: allow using 64K LDS for NGG to allow larger workgroups
|
---|
3436 | - radeonsi: increase NGG workgroup size to 256 for VS/TES with streamout and GS
|
---|
3437 | - glapi: move files specific to shared-glapi into the shared-glapi subdirectory
|
---|
3438 | - glapi: inline the meson list files_mapi_util
|
---|
3439 | - mesa: move ctx->Table -> ctx->Dispatch.Table except Client & MarshalExec
|
---|
3440 | - mesa: rename CurrentClientDispatch to GLApi
|
---|
3441 | - mesa: put dispatch table initialization into one place
|
---|
3442 | - glthread: qualify the \*cmd unmarshal parameter with restrict
|
---|
3443 | - vbo: fix current attribs not updating gallium vertex elements
|
---|
3444 | - radeonsi: remove unused TCS/TES SGPR fields
|
---|
3445 | - radeonsi: dump shader stats only if dumping asm shaders
|
---|
3446 | - radeonsi: replace nonir,noir,noasm,preoptir options with new reworked options
|
---|
3447 | - radeonsi: remove duplicated gfx11 check in si_msaa_resolve_blit_via_CB
|
---|
3448 | - radeonsi: rework MSAA resolve averaging to exploit instruction-level parallelism
|
---|
3449 | - radeonsi: add AMD_DEBUG=nowcstream to enable caching for stream_uploader
|
---|
3450 | - radeonsi: don't print the base non-view texture format for AMD_TEST=computeblit
|
---|
3451 | - radeonsi: fix AMD_TEST=computeblit being rejected on gfx < 11
|
---|
3452 | - radeonsi: don't convert to fp16 in the compute blit if not testing
|
---|
3453 | - radeonsi: don't use fp16_rtz for FP formats in the compute blit
|
---|
3454 | - radeonsi: correct an assertion if we get a display list with no vertex buffers
|
---|
3455 | - ac/nir: don't emit duplicated parameter exports
|
---|
3456 | - ac/nir: use plural correctly in the ac_nir_export_parameters name
|
---|
3457 | - radeonsi: remove unused vs_output_param_mask
|
---|
3458 | - egl: reorder code in _eglQueryDevicesEXT, add \*swrast variable
|
---|
3459 | - egl: don't expose swrast device if swrast is not built
|
---|
3460 | - amd/llvm: fix handling of unsupported vec3 loads on gfx6
|
---|
3461 | - amd/llvm: remove no-op code for vec3 loads in ac_build_tbuffer_load
|
---|
3462 | - amd: update addrlib
|
---|
3463 | - amd: rename GFX1036 -> RAPHAEL_MENDOCINO
|
---|
3464 | - amd: set the correct LLVM processor name for gfx1036
|
---|
3465 | - radeonsi/gfx11: reduce MSAA samples to 8 for no-attachment framebuffer
|
---|
3466 | - radeonsi: simplify binning settings to work around GPU hangs
|
---|
3467 | - amd: add gfx940 register definitions
|
---|
3468 | - amd: add initial code for gfx940
|
---|
3469 | - radeonsi: use COMPUTE_DISPATCH_SCRATCH_BASE on gfx940
|
---|
3470 | - radeonsi: always use ffma32 on gfx940
|
---|
3471 | - ac/surface: force linear image layout for chips not supporting image opcodes
|
---|
3472 | - radeonsi: add an emulated image descriptor for gfx940
|
---|
3473 | - ac/nir: implement image opcode emulation for CDNA, enable it in radeonsi
|
---|
3474 | - radeonsi: don't set registers that don't exist on gfx940
|
---|
3475 | - amd/registers: simplify integer division by 0x1000 in the parser
|
---|
3476 | - amd/registers: fix the parser to include CP_COHER registers for gfx940
|
---|
3477 | - amd/registers: update gfx940.json
|
---|
3478 | - amd/registers: use gfx9 packet definitions for gfx940
|
---|
3479 | - nir: fix 2 bugs in nir_create_passthrough_tcs
|
---|
3480 |
|
---|
3481 | Mario Kleiner (1):
|
---|
3482 |
|
---|
3483 | - v3dv: Enable (leased) direct display extensions.
|
---|
3484 |
|
---|
3485 | Mark Collins (4):
|
---|
3486 |
|
---|
3487 | - meson: update flex/bison requirement to cover all usages
|
---|
3488 | - meson: forcefully disable libdrm when host doesn't have it
|
---|
3489 | - tu: KGSL backend rewrite
|
---|
3490 | - tu: fix tu_GetInstanceProcAddr not handling null instance
|
---|
3491 |
|
---|
3492 | Mark Janes (13):
|
---|
3493 |
|
---|
3494 | - intel: Implement Wa_16011448509
|
---|
3495 | - util: add macro to support gcc/clang poison
|
---|
3496 | - intel/dev: generate helpers to identify platform workarounds
|
---|
3497 | - intel/dev: Print required workarounds with intel_dev_info
|
---|
3498 | - intel/fs: use generated workaround helpers for Wa_14010017096
|
---|
3499 | - intel/fs: use generated helpers for Wa_1209978020 / Wa_18012201914
|
---|
3500 | - intel/fs: use generated workaround helpers for Wa_14017989577
|
---|
3501 | - intel: use generated workaround helpers for Wa_1409600907
|
---|
3502 | - intel: use generated helpers for Wa_1409433168/Wa_16011107343
|
---|
3503 | - intel/fs: use generated helpers for Wa_14013363432 / Wa_14012688258
|
---|
3504 | - intel/dev: fix macro string concatenation for INTEL_WA_{id}_GFX_VER
|
---|
3505 | - intel/dev: fix macro naming convention in gen_wa_helpers.py
|
---|
3506 | - intel/dev: use GFX_VERx10 to detect genX compilation
|
---|
3507 |
|
---|
3508 | Martin Roukala (né Peres) (22):
|
---|
3509 |
|
---|
3510 | - ci/deqp-runner: compress results.csv before uploading it to GitLab
|
---|
3511 | - ci/piglit: compress results.csv before uploading it to GitLab
|
---|
3512 | - zink/ci/radv: remove a test from the fails list
|
---|
3513 | - zink/ci: add a fail to the VG flake list
|
---|
3514 | - zink/ci: relocate radv testing from radv's gitlab-ci.yml
|
---|
3515 | - zink/ci: add spec\@!opengl 1.1\@line-smooth-stipple to the fails list
|
---|
3516 | - ci/b2c: uprev to b2c v0.9.9
|
---|
3517 | - ci/debian/x86_test-vk: drop an outdated dependency
|
---|
3518 | - ci/core-manual-rules: enclose the whole condition in quotes
|
---|
3519 | - zink/ci: allow running manual jobs again on RADV
|
---|
3520 | - ci/init-stage2: allow sourcing the job env vars from the CWD
|
---|
3521 | - ci/init-stage2: always set XDG_RUNTIME_DIR
|
---|
3522 | - ci/b2c: move away from the hand-rolled initscript
|
---|
3523 | - ci: bring back the valve farm online
|
---|
3524 | - ci/valve-farm-rules: allow running jobs from outside the mesa namespace
|
---|
3525 | - radv/ci: reduce the parallelism for vkcts-vangogh
|
---|
3526 | - zink/ci: increase the parallelism of zink-radv-vangogh-valve
|
---|
3527 | - zink/ci: update the radv expectations
|
---|
3528 | - radv/ci: update VanGogh's expectations
|
---|
3529 | - ci/b2c: increase the console timeout to 4 minutes
|
---|
3530 | - radv/ci: update the navi10 expectations
|
---|
3531 | - zink/ci: add a test to the fails list
|
---|
3532 |
|
---|
3533 | Matt Coster (6):
|
---|
3534 |
|
---|
3535 | - pvr: Extract setup of winsys job submit flags into separate functions
|
---|
3536 | - pvr: Add support for geometry-only render jobs
|
---|
3537 | - pvr: Add pvr_csb_bake()
|
---|
3538 | - pvr: Rename global_queue_job_count to global_cmd_buffer_submit_count
|
---|
3539 | - pvr: Split render job submission for multi-layer framebuffers
|
---|
3540 | - pvr: Add firmware stream support for transfer submit
|
---|
3541 |
|
---|
3542 | Matthieu Bouron (1):
|
---|
3543 |
|
---|
3544 | - lavapipe: honor dst base array layer when resolving color attachments
|
---|
3545 |
|
---|
3546 | Mauro Rossi (1):
|
---|
3547 |
|
---|
3548 | - hasvk: include "vk_android.h" header in anv_android.c
|
---|
3549 |
|
---|
3550 | Maíra Canal (1):
|
---|
3551 |
|
---|
3552 | - v3dv: remove unused clamp_to_transparent_black_border property
|
---|
3553 |
|
---|
3554 | Michel Dänzer (29):
|
---|
3555 |
|
---|
3556 | - mesa/st: Fix GL_EXT_texture_type_2_10_10_10_REV name in comment
|
---|
3557 | - mesa/st: Handle all 10 bpc types in st_choose_format
|
---|
3558 | - glsl/standalone: Fix up _mesa_reference_shader_program_data signature
|
---|
3559 | - glsl/standalone: Do not pass memory allocated with ralloc_size to free
|
---|
3560 | - anv/grl: Use union for reinterpreting integer as float
|
---|
3561 | - clover: Reserve vector memory in make_text_section
|
---|
3562 | - ci: Update Fedora image to 36
|
---|
3563 | - ci: Re-enable intel-clc in fedora-release job
|
---|
3564 | - ci: Enable i915 Gallium driver in fedora-release job
|
---|
3565 | - ci: Enable the hasvk Vulkan driver in the fedora-release job
|
---|
3566 | - frontend/dri: Initialize callbacks in dri_swrast_kms_init_screen
|
---|
3567 | - nouveau: Make getSize return unsigned int
|
---|
3568 | - r600: Use container_of instead of direct pointer cast
|
---|
3569 | - crocus: Use ralloc_free for memory allocated with rzalloc
|
---|
3570 | - iris: Use ralloc_free for memory allocated with rzalloc
|
---|
3571 | - ci: Remove some -Werror workarounds for debian-android job
|
---|
3572 | - ci: Split up -Werror workarounds for debian-mingw32-x86_64 job
|
---|
3573 | - intel/vk/grl: Do not use no_override_init_args for C++
|
---|
3574 | - ci: Pass -Werror to compiler linking stage for LTO
|
---|
3575 | - ci: Allow passing c{,pp}_link_args to meson
|
---|
3576 | - ci: Make ccache optional
|
---|
3577 | - ci: Drop ccache from Fedora image
|
---|
3578 | - ci: Install procps-ng in Fedora image
|
---|
3579 | - ci: Enable LTO for fedora-release job
|
---|
3580 | - vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definitions
|
---|
3581 | - svga: Make vmw_svga_winsys_buffer_map definition match declaration
|
---|
3582 | - svga: Make declaration of emit_input_declaration match definition
|
---|
3583 | - clover/llvm: Use llvm::DataLayout::getABITypeAlign with LLVM >= 16
|
---|
3584 | - clover/llvm: Use std::nullopt already with LLVM 16
|
---|
3585 |
|
---|
3586 | Michel Zou (4):
|
---|
3587 |
|
---|
3588 | - ci/mingw: drop useless -Wno-error flags
|
---|
3589 | - vulkan/wsi: fix -Wnarrowing warning
|
---|
3590 | - vk/entry_points:: fix mingw build
|
---|
3591 | - mesa/draw: fix -Wformat warning
|
---|
3592 |
|
---|
3593 | Mike Blumenkrantz (536):
|
---|
3594 |
|
---|
3595 | - zink: simplify get_slot_components() for xfb emission
|
---|
3596 | - zink: add renderdoc handling
|
---|
3597 | - zink: prune old swapchains on present
|
---|
3598 | - zink: break out implicit feedback loop detection into separate function
|
---|
3599 | - zink: set textures_used in analyze_io
|
---|
3600 | - zink: outdent code in add_implicit_color_feedback_loop()
|
---|
3601 | - zink: make implicit feedback loop application stricter
|
---|
3602 | - zink: skip implicit feedback loop layout changes if feedback loop not present
|
---|
3603 | - zink: store drm format as internal_format for imported resources
|
---|
3604 | - zink: handle modifier nplanes queries correctly for planar formats
|
---|
3605 | - zink: NV_compute_shader_derivatives
|
---|
3606 | - zink: preserve present resources during async presentation
|
---|
3607 | - zink: add a util function for creating semaphores
|
---|
3608 | - zink: add a binary semaphore cache
|
---|
3609 | - zink: move semaphore caching to zink_reset_batch_state()
|
---|
3610 | - zink: consolidate semaphore creation where possible
|
---|
3611 | - zink: simplify some dynarray concat descriptor code
|
---|
3612 | - zink: delete need_blend_constants
|
---|
3613 | - zink: don't use ds3 blend states without color attachments
|
---|
3614 | - radv: repack radv_graphics_pipeline struct
|
---|
3615 | - radv: reorder dynamic state checks during bind
|
---|
3616 | - radv: simplify depth aspect check in radv_handle_image_transition()
|
---|
3617 | - radv: add some graphics pipeline hints to optimize pipeline bind
|
---|
3618 | - radv: remove redundant type sizing
|
---|
3619 | - radv: add an early out in radv_cmd_buffer_flush_dynamic_state()
|
---|
3620 | - zink: use actual swapchain object for surface comparison
|
---|
3621 | - radv: stop using radv_pipeline_has_stage() in BindPipeline
|
---|
3622 | - zink: flag old-style shadow tex mask for fragment shaders
|
---|
3623 | - zink: break out tex dest rewriting into separate function
|
---|
3624 | - zink: add an extra_data param to zink_shader_compile
|
---|
3625 | - zink: track depth swizzle on samplerviews
|
---|
3626 | - zink: add a fs shader key member to indicate depth texturing mode
|
---|
3627 | - zink: rework depth sampler splatting in shaders
|
---|
3628 | - zink: block pipeline fast-pathing for any programs using depth texture modes
|
---|
3629 | - zink: plug in the program/module parts of shadow texture mode emulation
|
---|
3630 | - zink: create another samplerview for shadow textures
|
---|
3631 | - zink: remove old depth swizzle workaround
|
---|
3632 | - zink: pass depth swizzle data block to shader compile
|
---|
3633 | - mesa: remove dead parameter doc for _mesa_new_texture_object()
|
---|
3634 | - mesa: populate gl_program::ShadowSamplers mask from shader data
|
---|
3635 | - mesa: (more) correctly handle incomplete depth textures
|
---|
3636 | - zink: fix implicit feedback loop detection
|
---|
3637 | - radv: Move constant flushing check out to callers.
|
---|
3638 | - zink: fix VK_DYNAMIC_STATE_LINE_WIDTH usage
|
---|
3639 | - zink: move barrier jit to zink_context.c
|
---|
3640 | - zink: don't skip repeated handling feedback loops
|
---|
3641 | - zink: return false for implicit feedback loop check with image binds
|
---|
3642 | - zink: update sampler layout when detecting feedback loop for first time
|
---|
3643 | - zink: force GENERAL layout for all fb attachments with image binds
|
---|
3644 | - zink: validation ci updates
|
---|
3645 | - zink: reorder commands more aggressively
|
---|
3646 | - Revert "zink: allow direct memory mapping for any COHERENT+CACHED buffer"
|
---|
3647 | - zink: fix heap/memory type selection
|
---|
3648 | - zink: add VK_BUFFER_USAGE_SHADER_DEVICE_ADDRESS_BIT for buffers if ext is enabled
|
---|
3649 | - zink: set vkusage/vkflags for buffer resource objects
|
---|
3650 | - zink: assert that buffer descriptor usage is populated before bind
|
---|
3651 | - zink: always set RESOURCE usage for descriptor buffers
|
---|
3652 | - zink: disable bindless texture ext with descriptor buffer
|
---|
3653 | - zink: set VK_PIPELINE_CREATE_DESCRIPTOR_BUFFER_BIT_EXT when using DB
|
---|
3654 | - zink: don't add dmabuf export type if dmabuf isn't supported
|
---|
3655 | - zink: stop leaking push descriptor templates
|
---|
3656 | - zink: always unref old images when adding new binds
|
---|
3657 | - zink: hook up VK_EXT_multisampled_render_to_single_sampled
|
---|
3658 | - zink: shrink zink_render_pass_state::msaa_expand_mask
|
---|
3659 | - zink: use VK_EXT_multisampled_render_to_single_sampled for EXT_multisample_render_to_texture
|
---|
3660 | - lavapipe: move noop fs creation to device
|
---|
3661 | - lavapipe: add refcounting for shader nir
|
---|
3662 | - lavapipe: refcount nir shaders instead of cloning
|
---|
3663 | - lavapipe: break out (and slightly refactor) gallium shader cso creation
|
---|
3664 | - lavapipe: create gfx gallium csos at pipeline bind
|
---|
3665 | - lavapipe: delete unused pipelines immediately
|
---|
3666 | - lavapipe: delete lvp_pipeline::mem_ctx
|
---|
3667 | - lavapipe: try harder to reuse pipeline layouts during merge
|
---|
3668 | - zink: only set VkPipelineColorBlendStateCreateInfo::attachmentCount without full ds3
|
---|
3669 | - zink: fix zink_mem_type_idx_from_bits()
|
---|
3670 | - zink: rework descriptor buffer templating to use offsets
|
---|
3671 | - Revert "zink: fix zink_mem_type_idx_from_bits()"
|
---|
3672 | - zink: enable PIPE_CAP_ALLOW_GLTHREAD_BUFFER_SUBDATA_OPT
|
---|
3673 | - zink: make bindless buffer_infos a union
|
---|
3674 | - zink: fix bindless struct member comments
|
---|
3675 | - zink: skip updating descriptor buffer sets that aren't active
|
---|
3676 | - zink: set VK_PIPELINE_CREATE_DESCRIPTOR_BUFFER_BIT_EXT on compute pipelines
|
---|
3677 | - zink: break out descriptor binding into separate function
|
---|
3678 | - zink: add a flag to indicate whether a descriptor buffer is bound
|
---|
3679 | - zink: implement descriptor buffer handling of bindless texture
|
---|
3680 | - zink: enable bindless texture with ZINK_DESCRIPTORS=db
|
---|
3681 | - zink: free descriptor buffer maps on batch state destroy
|
---|
3682 | - zink: fix more cases of heap/memtype suballocator mismatch
|
---|
3683 | - zink: cache and reuse dummy inputattachment for fbfetch
|
---|
3684 | - zink: handle missing line rasterization modes with ds3
|
---|
3685 | - zink: add back VK_DESCRIPTOR_BINDING_PARTIALLY_BOUND_BIT for bindless
|
---|
3686 | - gallium: add PIPE_CAP_NULL_TEXTURES
|
---|
3687 | - radeonsi: set PIPE_CAP_NULL_TEXTURES
|
---|
3688 | - zink: conditionally enable PIPE_CAP_NULL_TEXTURES
|
---|
3689 | - zink: fix max acquired image count
|
---|
3690 | - lavapipe: disable VK_FORMAT_FEATURE_2_COLOR_ATTACHMENT_BLEND_BIT for int formats
|
---|
3691 | - zink: set PIPE_CAP_VALIDATE_ALL_DIRTY_STATES
|
---|
3692 | - zink: move bindless_layout to screen and init on creation
|
---|
3693 | - zink: take screen param in init_db_template_entry()
|
---|
3694 | - zink: const-ify a bunch of shader key inlines
|
---|
3695 | - zink: move gpl usability checks to static inline for reuse
|
---|
3696 | - zink: remove duplicated gpl output blend initializations
|
---|
3697 | - zink: store last pipeline directly for zink_gfx_program::last_pipeline
|
---|
3698 | - zink: pass screen to descriptor_util_pool_key_get()
|
---|
3699 | - zink: delete zink_screen::framebuffer_cache
|
---|
3700 | - zink: make last_vertex_stage the first bit in zink_vs_key_base
|
---|
3701 | - zink: ralloc zink_shader structs
|
---|
3702 | - zink: add a define for the "default" optimal key
|
---|
3703 | - zink: add a define for testing that an optimal key is the default
|
---|
3704 | - zink: add VK_PIPELINE_CREATE_DESCRIPTOR_BUFFER_BIT_EXT for gpl libs
|
---|
3705 | - zink: don't set blend_id with full_ds3
|
---|
3706 | - zink: set gfx feedback loop bit in pipeline state for driver workaround
|
---|
3707 | - zink: set zs feedback loop bit from driver workaround on ctx create
|
---|
3708 | - zink: fix gpl lib hashing
|
---|
3709 | - zink: use screen indexing for bindless descriptor set in db bind
|
---|
3710 | - zink: use screen indexing for bindless descriptor set in template bind
|
---|
3711 | - util/vbuf: fix multidraw unrolling
|
---|
3712 | - zink: flag bindless_init before calling zink_batch_bind_db() in init
|
---|
3713 | - zink: avoid the descriptor set multiplier for bindless buffers
|
---|
3714 | - zink: split out VkShaderModule creation
|
---|
3715 | - zink: add flags param to zink_pipeline_layout_create()
|
---|
3716 | - zink: split out gfx pipeline library creation
|
---|
3717 | - zink: add gpl flags for libraries based on shaders passed
|
---|
3718 | - zink: allow multiple gpl libraries in zink_create_gfx_pipeline_combined()
|
---|
3719 | - zink: move gpl input/output funcs to zink_pipeline.c
|
---|
3720 | - zink: enable combining intermediate gpl libs from combine function
|
---|
3721 | - zink: use GPL to handle (simple) separate shader objects
|
---|
3722 | - zink: set PIPE_CAP_SURFACE_REINTERPRET_BLOCKS
|
---|
3723 | - zink: store gfx_hash on zink_gfx_program
|
---|
3724 | - zink: break out zink_gfx_program::libs into refcounted object
|
---|
3725 | - zink: implement cross-program pipeline library sharing
|
---|
3726 | - zink: add newlines to some debug printfs
|
---|
3727 | - zink: rename some variables in zink_set_shader_images()
|
---|
3728 | - zink: unset gfx shader read when unbinding shader images
|
---|
3729 | - zink: remove stale comment
|
---|
3730 | - zink: unref image buffer descriptors on unbind
|
---|
3731 | - zink: rework set_shader_images() hook
|
---|
3732 | - zink: pull out image descriptor updating in set_shader_images
|
---|
3733 | - zink: add a local is_compute var for set_shader_images
|
---|
3734 | - zink: allocate all batch command buffers in one call
|
---|
3735 | - zink: sync LTO compiles for GPL pipelines on shader free
|
---|
3736 | - zink: fix descriptor pool free iterating
|
---|
3737 | - zink: don't fetch/update pipeline cache for separate shader programs
|
---|
3738 | - zink: assert that the found program matches the expected one in shader_free
|
---|
3739 | - zink: flag gfx programs as removed-from-cache by default
|
---|
3740 | - zink: rework separate shader descriptor iterating
|
---|
3741 | - zink: use a single descriptor buffer for all non-bindless types
|
---|
3742 | - zink: add an io assignment pass for separate shaders
|
---|
3743 | - zink: rename a struct member for clarity
|
---|
3744 | - zink: move zink_batch_state::db_bound reset to zink_batch_descriptor_reset()
|
---|
3745 | - zink: move db_bound to batch descriptor data
|
---|
3746 | - zink: ensure db is bound before separate shader update
|
---|
3747 | - zink: store base descriptor size on the screen
|
---|
3748 | - zink: implement a scaling descriptor buffer size
|
---|
3749 | - zink: calloc separable program zink_gfx_library_key struct
|
---|
3750 | - zink: also replace hash_entry::key when replacing separable program
|
---|
3751 | - zink: always use NEAREST for zs blits
|
---|
3752 | - zink: fix indentation of rebind_image()
|
---|
3753 | - zink: only try for a fb rebind if fb binds exist in rebind_image()
|
---|
3754 | - zink: account for null surface when trying to retain clears on fb bind
|
---|
3755 | - zink: break out pipe_surface init for new surface creation
|
---|
3756 | - zink: const-ify a surface param
|
---|
3757 | - zink: don't handle mutable init on surface creation with tc enabled
|
---|
3758 | - zink: verify compressed format layer count when creating surfaces
|
---|
3759 | - zink: set ZINK_DESCRIPTORS=db for radv jobs
|
---|
3760 | - zink: enable renderpass optimizations by default for selected drivers
|
---|
3761 | - Revert "zink: always use NEAREST for zs blits"
|
---|
3762 | - zink: block LINEAR filtered blits for zs formats
|
---|
3763 | - lavapipe: enable linear filtering for depth formats
|
---|
3764 | - gallium: plumb resolve attachments through from frontends -> pipe_framebuffer_state
|
---|
3765 | - dri3: avoid deadlocking when polling deleted windows for events
|
---|
3766 | - vulkan/wsi: avoid deadlocking dri3 when polling deleted windows for events
|
---|
3767 | - vl/dri3: avoid deadlocking when polling deleted windows for events
|
---|
3768 | - zink: delete some now-broken ntv dref sampling code
|
---|
3769 | - zink: more accurately handle i/o for separate shaders
|
---|
3770 | - zink: handle semi-matching i/o for separate shaders
|
---|
3771 | - zink: enable renderpass optimizing on lavapipe
|
---|
3772 | - kopper: fix loop iterating for msaa texture creation
|
---|
3773 | - zink: move db input attachment size check to screen init
|
---|
3774 | - zink: make ZINK_DESCRIPTOR_MODE=db the default
|
---|
3775 | - driconf: add zink glthread disable for a game
|
---|
3776 | - aux/tc: add a 'has_resolve' member to tc_renderpass_info
|
---|
3777 | - zink: actually hook up ZINK_DEBUG=norp
|
---|
3778 | - zink: add ZINK_DEBUG=map
|
---|
3779 | - zink: add debug marker tracing for qbo updates
|
---|
3780 | - util/box: add intersection test functions for 1d/3d
|
---|
3781 | - zink: add some tracking for copy box regions
|
---|
3782 | - zink: add a util function for optimizing TRANSFER_DST image barriers
|
---|
3783 | - zink: utilize copy box tracking to avoid barrier emission for buf2img copies
|
---|
3784 | - zink: fix slab allocator sizing
|
---|
3785 | - zink: delete dead uniform variables
|
---|
3786 | - zink: fix shader read access removal for barrier generation
|
---|
3787 | - zink: rework descriptor unbind params to use is_compute directly
|
---|
3788 | - zink: fix bindless texture barrier generation
|
---|
3789 | - zink: delete unused emit_image param in ntv
|
---|
3790 | - zink: simplify/rework image typing in ntv
|
---|
3791 | - zink: avoid adding ubo/ssbo bindings multiple times for different bitsizes
|
---|
3792 | - zink: add locking for zink_screen::copy_context and defer creation
|
---|
3793 | - zink: allow direct memory mapping for any COHERENT+CACHED buffer
|
---|
3794 | - lavapipe: EXT_image_sliced_view_of_3d
|
---|
3795 | - vulkan/wsi: switch to using an options struct for last param
|
---|
3796 | - vulkan/wsi/x11: make 4 image minimum for xwayland driver-specific
|
---|
3797 | - docs: add pipeline library support for tu
|
---|
3798 | - aux/tc: track whether queries have been terminated in a renderpass
|
---|
3799 | - aux/tc: only call tc_parse_draw() when parsing renderpass info
|
---|
3800 | - zink: move zink_batch_no_rp call for query reset
|
---|
3801 | - zink: remove suspended queries from list before resuming
|
---|
3802 | - zink: reset queries on the promoted cmdbuf when possible
|
---|
3803 | - zink: rewrite zink_query_start struct
|
---|
3804 | - zink: fix zink_query_start initialization
|
---|
3805 | - zink: fix possible query destroy leak
|
---|
3806 | - zink: make zink_vk_query unref consistent
|
---|
3807 | - zink: delete zink_query::last_start_idx
|
---|
3808 | - zink: handle multiple query starts in qbo update
|
---|
3809 | - zink: don't auto-sync qbos on query end
|
---|
3810 | - zink: add zink_query::suspended to indicate suspended state of queries
|
---|
3811 | - zink: un-suspend queries if they end while suspended
|
---|
3812 | - zink: don't double suspend queries
|
---|
3813 | - zink: refuse to start cs invocation queries in renderpass
|
---|
3814 | - zink: only try doing qbo updates on query suspend if !in_rp
|
---|
3815 | - zink: handle null query results
|
---|
3816 | - zink: handle null query results for conditional render
|
---|
3817 | - zink: only update qbo for TIME_ELAPSED on start if !in_rp
|
---|
3818 | - zink: try updating qbos on query resume if !in_rp
|
---|
3819 | - zink: reorder some query code
|
---|
3820 | - zink: rework find_or_allocate_qp()
|
---|
3821 | - zink: create/use query pools dynamically
|
---|
3822 | - zink: rework query pool overflow
|
---|
3823 | - zink: track whether queries were started in a renderpass
|
---|
3824 | - zink: break out query suspend functionality for reuse
|
---|
3825 | - zink: use more consistent check for deleting zink_query::stats_list links
|
---|
3826 | - zink: pull 'was_line_loop' into ctx for query updating
|
---|
3827 | - zink: always start/stop/resume queries inside renderpasses
|
---|
3828 | - zink: use tc renderpass optimizing to more optimally start queries
|
---|
3829 | - zink: skip buffer barriers for ACCESS_NONE -> ACCESS_READ / ACCESS_WRITE
|
---|
3830 | - zink: disable queries for clear_texture()
|
---|
3831 | - zink: resume queries after conditional render and clears are processed
|
---|
3832 | - zink: only resume queries inside renderpasses from set_active_query_state
|
---|
3833 | - zink: track whether a primgen query is suspended and needing color write hacks
|
---|
3834 | - zink: rework xfb queries for drivers with poor primgen support
|
---|
3835 | - zink: merge qbo update copies when possible
|
---|
3836 | - zink: set predicate_dirty on query creation
|
---|
3837 | - zink: eliminate internal qbo copy barrier
|
---|
3838 | - util/box: fix off-by-one calc error in intersection funcs
|
---|
3839 | - zink: avoid ballooning of copy box tracking
|
---|
3840 | - zink: add perfetto tracing for barriers
|
---|
3841 | - zink: avoid unnecessary read-only layout changes for zs attachments
|
---|
3842 | - zink: skip attachment barrier for redundant layout-setting if !valid
|
---|
3843 | - zink: add functions for faster batch-usage completion checks
|
---|
3844 | - zink: fix zink_resource_access_is_write()
|
---|
3845 | - zink: use split image barriers if the image can be easily proved idle
|
---|
3846 | - zink: skip buffer barriers if the buffer can be easily proved idle
|
---|
3847 | - zink: determine whether debug markers are used on screen create
|
---|
3848 | - zink: be more descriptive with perfetto buffer barriers
|
---|
3849 | - zink: reorder some blit debug markers
|
---|
3850 | - zink: pass cmdbuf to debug marker begin
|
---|
3851 | - zink: remove debug markers for u_blitter ops
|
---|
3852 | - zink: when skipping a TRANSFER_DST image barrier, set access tracking
|
---|
3853 | - zink: add frame trace markers on the queue
|
---|
3854 | - zink: unbind fb on context destroy
|
---|
3855 | - zink: only add deferred barrier on fb unbind when layout needs to change
|
---|
3856 | - zink: fix descriptor update flagging on null ssbo set
|
---|
3857 | - zink: propagate valid_buffer_range when replacing buffer storage
|
---|
3858 | - zink: check for layout updates when unbinding samplerviews
|
---|
3859 | - zink: eliminate pre barrier for adding resource binds
|
---|
3860 | - zink: don't unset existing access when adding resource binds
|
---|
3861 | - zink: explicitly flush src clears when u_blittering
|
---|
3862 | - zink: always set color writes on the unordered cmdbuf
|
---|
3863 | - zink: bind descriptor buffers to unordered cmdbuf
|
---|
3864 | - zink: set dynamic pcp for unordered cmdbuf
|
---|
3865 | - zink: always set batch usage for descriptors after barrier
|
---|
3866 | - llvmpipe: fix LP_PERF=no_depth to ignore depth format
|
---|
3867 | - zink: track the last write access for resources
|
---|
3868 | - zink: add a mechanism for managing TRANSFER_DST buffer barriers
|
---|
3869 | - zink: add a mechanism to trigger copy box resets from batch state reset
|
---|
3870 | - zink: add a driver workaround to disable copy box optimizations
|
---|
3871 | - zink: hook up buffer TRANSFER_DST barrier optimizing
|
---|
3872 | - aux/tc: fix rp info resizing clobbering current info
|
---|
3873 | - vulkan/wsi: fix crash in failed swapchain creation for wayland
|
---|
3874 | - lavapipe: split out spirv compile of shaders
|
---|
3875 | - lavapipe: split out shader struct members into their own struct
|
---|
3876 | - lavapipe: pass shader struct and layout to scan_pipeline_info()
|
---|
3877 | - lavapipe: more small shader struct usage tweaks
|
---|
3878 | - lavapipe: move xfb init to shader struct
|
---|
3879 | - lavapipe: rename inline uniform function params
|
---|
3880 | - lavapipe: move uniform inline functions to shader struct
|
---|
3881 | - lavapipe: break out main shader lowering into separate function
|
---|
3882 | - Revert "Revert "ci: disable mesa-swrast runner jobs""
|
---|
3883 | - zink: ignore renderdoc if ZINK_RENDERDOC isn't in use
|
---|
3884 | - radv: delete radv_graphics_pipeline_compile() asserts
|
---|
3885 | - radv: avoid a huge memset in radv_graphics_pipeline_compile()
|
---|
3886 | - aux/tc: use renderpass tracking to optimize texture_subdata calls
|
---|
3887 | - lavapipe: fix dynamic depth clamping
|
---|
3888 | - lavapipe: set render_condition_enabled=false for vkCmdClearDepthStencilImage
|
---|
3889 | - lavapipe: add command debugging
|
---|
3890 | - lavapipe: beef up LVP_POISON_MEMORY
|
---|
3891 | - ci: fix LVP_POISON_MEMORY usage
|
---|
3892 | - zink: rework zink_resource::valid_buffer_range
|
---|
3893 | - zink: return the unordered state from zink_resource_buffer_transfer_dst_barrier()
|
---|
3894 | - zink: unify image TRANSFER_DST barrier checks
|
---|
3895 | - zink: rename zink_check_transfer_dst_barrier()
|
---|
3896 | - zink: super reorder buffer copies
|
---|
3897 | - tu: don't set startup debug on debug builds
|
---|
3898 | - zink: fix copy box iteration
|
---|
3899 | - glthread: align small buffer uploads to 4 bytes
|
---|
3900 | - zink: fix copy box iteration when adding
|
---|
3901 | - zink: fix copy box merging adjacency
|
---|
3902 | - aux/trace: delete GALLIUM_TRACE_NIR log message
|
---|
3903 | - zink: fix layer check for compressed format surface creation
|
---|
3904 | - zink: track current queue for resources
|
---|
3905 | - zink: remove redundant dmabuf_acquire setting
|
---|
3906 | - zink: use res->queue to auto-handle queue transitions back to gfx queue
|
---|
3907 | - zink: track tc fences better
|
---|
3908 | - zink: add an assert to catch renderpass optimizing bugs
|
---|
3909 | - zink: don't use/update tc rp info while blitting
|
---|
3910 | - zink: reset tc fb info upon splitting a renderpass
|
---|
3911 | - zink: add and use a function for "safely" ending renderpasses
|
---|
3912 | - zink: disable queries when flushing clears from set_fb
|
---|
3913 | - zink: disable tc flush notify with rp optimizing
|
---|
3914 | - zink: trigger oom flushes more aggressively from copy ops
|
---|
3915 | - aux/tc: flag late zs clears as partial clears
|
---|
3916 | - aux/tc: use a local 'deferred' variable in tc_flush()
|
---|
3917 | - aux/tc: fix renderpass splitting on flush
|
---|
3918 | - aux/tc: track the number of active queries
|
---|
3919 | - aux/tc: don't sync for get_sample_position
|
---|
3920 | - aux/tc: add a function to reset rp info
|
---|
3921 | - aux/tc: fix initial rp info allocation
|
---|
3922 | - aux/tc: make some of the rp tracking api private
|
---|
3923 | - aux/tc: rework inter-batch renderpass info handling
|
---|
3924 | - zink: only flag rp info for updating on flush, don't actually update
|
---|
3925 | - zink: add tracing for copy ops
|
---|
3926 | - zink: expand ZINK_DEBUG=sync to cover copy ops
|
---|
3927 | - zink: add some asserts for zs layout in dynamic render
|
---|
3928 | - zink: double check layouts for possible feedback loop images
|
---|
3929 | - zink: end rp earlier in set_framebuffer_state
|
---|
3930 | - zink: add a function for applying u_blitter barriers
|
---|
3931 | - zink: add a dynamic render version of clear_texture hook
|
---|
3932 | - zink: reset fb clears using the clears_enabled mask
|
---|
3933 | - zink: manually apply barriers whenever zink_context::blitting is set
|
---|
3934 | - zink: split out pipeline rp info update function
|
---|
3935 | - zink: add zink_context::unordered_blitting to preserve unordered flags
|
---|
3936 | - zink: implement unordered u_blitter calls
|
---|
3937 | - zink: round geometry for u_blitter debug markers
|
---|
3938 | - zink: catch zs u_blitter ops for draw markers
|
---|
3939 | - zink: add debug markers for draws
|
---|
3940 | - zink: track zsbuf info even when rp optimizing is disabled
|
---|
3941 | - zink: fix dsa state parsing for tc info
|
---|
3942 | - zink: track whether the fb zsbuf is readonly
|
---|
3943 | - zink: add a fixup case for readonly zsbuf clears
|
---|
3944 | - zink: improve no-oping of write -> readonly zsbuf layouts
|
---|
3945 | - zink: don't flag rp layout check on next draw when rp optimizing
|
---|
3946 | - zink: allow zink_is_zsbuf_used() without tc / rp optimizing
|
---|
3947 | - zink: rename add_implicit_color_feedback_loop()
|
---|
3948 | - zink: explicitly eliminate feedback loops for unused zsbufs
|
---|
3949 | - zink: further eliminate zs implicit feedback loops for read-only access
|
---|
3950 | - zink: split out luminance/alpha clear conversion code for reuse
|
---|
3951 | - zink: convert luminance/alpha clear colors in dynamic texture clear
|
---|
3952 | - zink: handle swapchain creation failure less lazily
|
---|
3953 | - zink: simplify resource_check_defer_buffer_barrier()
|
---|
3954 | - zink: delete unused barrier api
|
---|
3955 | - zink: never split a renderpass for a loadop change
|
---|
3956 | - zink: flag some rp ends as unsafe
|
---|
3957 | - zink: add batch refs for framebuffer surfaces on bind and ref update
|
---|
3958 | - zink: fix unordered access for image descriptors
|
---|
3959 | - zink: force unordered_write=false when binding image descriptors
|
---|
3960 | - zink: add an assert to ensure zsbuf invalidation doesn't break rendering
|
---|
3961 | - zink: only run post-fb-unbind layout stuff if the resource isn't being destroyed
|
---|
3962 | - zink: always set sampler layouts when unbinding fb images while rp optimizing
|
---|
3963 | - zink: rework handling of unordered->ordered write buffer barriers
|
---|
3964 | - zink: don't update fbfetch in db mode if inputAttachmentDescriptorSize==0
|
---|
3965 | - zink: add ZINK_DEBUG=flushsync
|
---|
3966 | - zink: track whether zsbuf is unused
|
---|
3967 | - zink: flag rp layout change if zsbuf usedness changes on dsa/fs state bind
|
---|
3968 | - driconf: make glthread=true default for source games
|
---|
3969 | - aux/trace: dump blend states with enums
|
---|
3970 | - aux/trace: fix GALLIUM_TRACE_NIR handling
|
---|
3971 | - zink: fix some type mismatches for c++ compilation
|
---|
3972 | - zink: break out a src region barrier check for reuse
|
---|
3973 | - zink: move all barrier-related functions to c++
|
---|
3974 | - zink: use c++ template to deduplicate all the buffer barrier code
|
---|
3975 | - zink: minor tweaks for image barriers
|
---|
3976 | - zink: use c++ template to deduplicate image barrier functions
|
---|
3977 | - zink: stop leaking separate shader nir
|
---|
3978 | - lavapipe: always copy streamout info when creating shaders
|
---|
3979 | - lavapipe: don't memcpy tess_ccw when copying pipeline library shaders
|
---|
3980 | - lavapipe: refactor shader compile functions to not take pipeline params
|
---|
3981 | - lavapipe: track bound shader stages on rendering_state
|
---|
3982 | - lavapipe: add a device member to rendering_state
|
---|
3983 | - lavapipe: stop using rendering_state::pipeline
|
---|
3984 | - lavapipe: refactor compute shader binding
|
---|
3985 | - lavapipe: merge some loops in handle_graphics_pipeline()
|
---|
3986 | - lavapipe: PIPE_SHADER\_ -> MESA_SHADER\_
|
---|
3987 | - lavapipe: don't access pipeline shader structs as much during bind
|
---|
3988 | - lavapipe: pull out dynamic tess origin check in gfx pipeline bind
|
---|
3989 | - lavapipe: break out all the important parts of gfx pipeline setting for reuse
|
---|
3990 | - lavapipe: delete unused struct member
|
---|
3991 | - lavapipe: refactor pipeline destroy a bit
|
---|
3992 | - lavapipe: add a ref for the tess_ccw nir on creation
|
---|
3993 | - lavapipe: unify lvp_pipeline_nir creation
|
---|
3994 | - lavapipe: dynamically bind noop fs at draw time when needed
|
---|
3995 | - lavapipe: don't double unbind gfx stages on pipeline bind
|
---|
3996 | - lavapipe: split out gfx stage unbinding
|
---|
3997 | - lavapipe: only update shader access for bind/unbind stages
|
---|
3998 | - lavapipe: only unset tess_states pointers on tes bind
|
---|
3999 | - lavapipe: avoid uniformly unsetting gs_output_lines
|
---|
4000 | - lavapipe: move default rasterizer state values to rendering_state init
|
---|
4001 | - mapi: add InternalInvalidateFramebufferAncillaryMESA
|
---|
4002 | - glthread: add _mesa_glthread_invalidate_zsbuf()
|
---|
4003 | - kopper: apply ancillary invalidation through glthread on swapbuffers
|
---|
4004 | - llvmpipe: fix linear fs analysis with nonzero fs outputs
|
---|
4005 | - llvmpipe: fix handling of unused color attachments
|
---|
4006 | - zink: add spirv builder function for terminate
|
---|
4007 | - zink: set src access when rebinding buffers, unset unordered_*
|
---|
4008 | - zink: fix quads emulation gs with array variables
|
---|
4009 | - zink: block resolves where src extents > dst extents
|
---|
4010 | - zink: omit VkPipelineVertexInputStateCreateInfo with dynamic vinput
|
---|
4011 | - zink: flag vertex buffers for rebind after vstate draws
|
---|
4012 | - zink: use search_or_add for masking vstate
|
---|
4013 | - zink: bind vertex state directly from draw hook
|
---|
4014 | - zink: add another vstate draw template for popcnt presence
|
---|
4015 | - zink: explicitly pass null velems when creating pipelines with dynamic vinput
|
---|
4016 | - zink: don't swizzle velems state for vstate draws
|
---|
4017 | - zink: use fast popcnt for vstate draws
|
---|
4018 | - zink: stop caching vertex states
|
---|
4019 | - lavapipe: break out pipeline layout creation for reuse
|
---|
4020 | - lavapipe: implement EXT_shader_object
|
---|
4021 | - lavapipe: advertise EXT_shader_object
|
---|
4022 | - zink: delete shader reordering in assign_io()
|
---|
4023 | - zink: add and populate a shader_info struct to zink_shader
|
---|
4024 | - zink: pass nir_shader to update_so_info()
|
---|
4025 | - zink: generate flat_flags during shader creation
|
---|
4026 | - zink: use zink_shader::info instead of zink_shader::nir::info
|
---|
4027 | - zink: simplify fbfetch output detection from fs
|
---|
4028 | - zink: pass nir directly to zink_shader_tcs_create()
|
---|
4029 | - zink: swap nir pointers when compiling compute shaders
|
---|
4030 | - zink: directly return nir from zink_shader_tcs_create
|
---|
4031 | - zink: streamline nir cloning for assign_io
|
---|
4032 | - zink: store nir as serialized on zink_shader structs
|
---|
4033 | - zink: simplify assign_io() further
|
---|
4034 | - zink: break out nir blob deserializing
|
---|
4035 | - zink: move nir cloning out to callers of zink_shader_compile
|
---|
4036 | - zink: store num_inlinable_uniforms separately for cs programs
|
---|
4037 | - zink: always store nir serialized
|
---|
4038 | - zink: be explicit about separate shader dsl indexing during creation
|
---|
4039 | - zink: rework choose_pdev (again)
|
---|
4040 | - glthread: use id 0 for internal buffer objects
|
---|
4041 | - radv: fix leak of nir from retained shaders
|
---|
4042 | - zink: don't try copying multiple results for conditional render copy
|
---|
4043 | - zink: more explicitly track/check rp optimizing per-context
|
---|
4044 | - zink: don't access non_fs part of zink_shader from fs
|
---|
4045 | - zink: reuse d3d12 variable copying to make passthrough gs more robust
|
---|
4046 | - zink: reuse copy_vars for generated tcs
|
---|
4047 | - zink: don't trigger shader variants on pcp change if driver supports dynamic pcp
|
---|
4048 | - Revert "zink: don't trigger shader variants on pcp change if driver supports dynamic pcp"
|
---|
4049 | - zink: try to prune resources from barrier jit on fb unbind
|
---|
4050 | - lavapipe: copy fragment shader when merging GPL pipelines
|
---|
4051 | - lavapipe: refactor/consolidate GPL shader copying
|
---|
4052 | - lavapipe: don't double-inline ubo0
|
---|
4053 | - lavapipe: implement inline variant caching
|
---|
4054 | - zink: block oom flushes during unordered blits
|
---|
4055 | - zink: unroll array loop when copying vars for passthrough shaders
|
---|
4056 | - zink: free GPL input/output libs on context destroy to avoid leaking
|
---|
4057 | - zink: fix GPL lib leaking
|
---|
4058 | - zink: remove redundant 'blitting' check in zink_prep_fb_attachment()
|
---|
4059 | - zink: break out feedback loop pipeline state flagging for reuse
|
---|
4060 | - zink: pre-convert attachment id to attachment idx
|
---|
4061 | - zink: eliminate implicit feedback loops on rp begin
|
---|
4062 | - zink: track per-image swapchain layouts
|
---|
4063 | - zink: handle swapchain handoffs around makecurrent
|
---|
4064 | - zink: remove a fixed validation error for ci
|
---|
4065 | - mesa/st/program: don't init xfb info if there are no outputs
|
---|
4066 | - zink: remove atomics from zink_query
|
---|
4067 | - zink: pass ctx through query destroy paths
|
---|
4068 | - zink: always defer query pool deletion
|
---|
4069 | - zink: move memoryTypeIndex selection down in general bo allocation
|
---|
4070 | - zink: slightly rework memoryTypeIndex selection to pre-determine heap
|
---|
4071 | - zink: restore BAR allocation failure demotion
|
---|
4072 | - zink: make general bo allocation more robust by iterating
|
---|
4073 | - zink: avoid zero-sized memcmp for descriptor layouts
|
---|
4074 | - iris: use util_framebuffer_get_num_samples when setting ps dispatch samples
|
---|
4075 | - zink: manually re-set framebuffer after msrtss replicate blit
|
---|
4076 | - zink: handle 'blitting' flag better in msrtss replication
|
---|
4077 | - zink: skip msrtss replicate if the attachment will be full-cleared
|
---|
4078 | - zink: avoid recursion during msrtss blits from flushing clears
|
---|
4079 | - nir/lower_alpha_test: rzalloc state slots
|
---|
4080 | - zink: fix non-db bindless texture buffers
|
---|
4081 | - zink: emit demote cap when using demote
|
---|
4082 | - zink: only print copy box warning once per resource
|
---|
4083 | - util/debug: move null checks out of debug message macro
|
---|
4084 | - zink: don't bitcast bool deref loads/stores
|
---|
4085 | - drisw: don't leak the winsys
|
---|
4086 | - zink: check for extendedDynamicState3DepthClipNegativeOneToOne for ds3 support
|
---|
4087 | - draw: fix viewmask iterating
|
---|
4088 | - zink: don't pin flush queue threads if no threads exist
|
---|
4089 | - zink: add z32s8 as mandatory GL3.0 profile attachment format
|
---|
4090 | - nir/gs: fix array type copying for passthrough gs
|
---|
4091 | - zink: fix array copying in pv lowering
|
---|
4092 | - gallivm: break out native vector width calc for reuse
|
---|
4093 | - llvmpipe: do late init for llvm builder
|
---|
4094 | - zink: break out VkImageViewUsageCreateInfo applying for reuse
|
---|
4095 | - zink: reapply VkImageViewUsageCreateInfo when rebinding a surface
|
---|
4096 | - draw: fix robust ubo size calc
|
---|
4097 | - llvmpipe: fix native vector width init
|
---|
4098 | - zink: add extendedDynamicState3DepthClipNegativeOneToOne to profile
|
---|
4099 | - zink: only unset a generated tcs if the bound tcs is the generated one
|
---|
4100 | - zink: set depth dynamic state values unconditionally
|
---|
4101 | - zink: null some descriptor buffer pointers during destruction
|
---|
4102 | - zink: sync queries at the end of cmdbufs
|
---|
4103 | - cso: unbind fb state when unbinding the context
|
---|
4104 | - i915: use util_copy_framebuffer_state to set fb state
|
---|
4105 | - i915: use util_unreference_framebuffer_state to unref fb state
|
---|
4106 | - iris: use util_unreference_framebuffer_state to unref fb state
|
---|
4107 | - softpipe: use util_unreference_framebuffer_state to unref fb state
|
---|
4108 | - v3d: use util_unreference_framebuffer_state to unref fb state
|
---|
4109 | - vc4: use util_unreference_framebuffer_state to unref fb state
|
---|
4110 | - llvmpipe: use util_unreference_framebuffer_state to unref fb state
|
---|
4111 | - svga: use util_unreference_framebuffer_state to unref fb state
|
---|
4112 | - zink: don't init mutable resource bit for swapchain images
|
---|
4113 | - zink: don't init mutable for swapchain src during blit
|
---|
4114 | - zink: allow vk 1.2 timelineSemaphore feature if extension isn't supported
|
---|
4115 | - zink: stringify unsupported prim restart log error
|
---|
4116 | - zink: delete persistent map tracking
|
---|
4117 | - zink: add PERSISTENT for db buffer maps
|
---|
4118 | - zink: delete unnecessary pipeline stage flags from inference
|
---|
4119 | - zink: use an intermediate variable for binding ssbo slots
|
---|
4120 | - zink: unbind the ssbo slot being iterated, not the index of the buffer
|
---|
4121 | - zink: flush INDIRECT_BUFFER mem barrier for compute
|
---|
4122 | - zink: disable batched unordered barries with ZINK_DEBUG=noreorder
|
---|
4123 | - zink: block batching of unordered barriers if previous usage was write
|
---|
4124 | - zink: fix uncached memory readback
|
---|
4125 | - glsl/lower_samplers_as_deref: apply bindings for unused samplers
|
---|
4126 | - zink: bind bindless db set when updating separate shader db sets
|
---|
4127 | - zink: compare desc set to detect bindless vars in separate shaders
|
---|
4128 | - zink: adjust bindless texel buffer handle before indexing
|
---|
4129 | - zink: block more flushes during unordered blits
|
---|
4130 | - zink: also cache swapchain semaphores
|
---|
4131 |
|
---|
4132 | Mohamed Ahmed (3):
|
---|
4133 |
|
---|
4134 | - vulkan/runtime: move common buffer related entrypoints to vk_buffer.c
|
---|
4135 | - vulkan/runtime: implement vkGetBufferMemoryRequirements2()
|
---|
4136 | - anv: remove GetBufferMemoryRequirements2()
|
---|
4137 |
|
---|
4138 | Nanley Chery (16):
|
---|
4139 |
|
---|
4140 | - docs: Document the implicit barriers around blits
|
---|
4141 | - glsl: Add compute shaders to encode DXT5/BC3
|
---|
4142 | - glsl: Modify the #includes in the DXT5 shaders
|
---|
4143 | - mesa: Create _mesa_CreateShaderProgramv_impl
|
---|
4144 | - mesa/st: Add get_compute_program
|
---|
4145 | - mesa/st: Add and use create_bc1_endpoint_ssbo
|
---|
4146 | - mesa/st: Add st_compute_transcode_astc_to_dxt5
|
---|
4147 | - mesa/st: Add st_texture_image_resource_level
|
---|
4148 | - mesa/st: Enable compute-based transcoding to DXT5
|
---|
4149 | - mesa/st: Measure compressed fallback unmap paths
|
---|
4150 | - iris: Update comment in iris_cache_flush_for_render
|
---|
4151 | - iris: Flush caches for aux-mode changes more often
|
---|
4152 | - iris: Drop iris_cache_flush_for_render
|
---|
4153 | - iris: Allocate ZEROED BOs for shared resources
|
---|
4154 | - iris/bufmgr: Add and use zero_bo
|
---|
4155 | - iris/bufmgr: Handle flat_ccs for BO_ALLOC_ZEROED
|
---|
4156 |
|
---|
4157 | Nataraj Deshpande (1):
|
---|
4158 |
|
---|
4159 | - anv: Bump VkDeviceMemory objects limit to 4GB
|
---|
4160 |
|
---|
4161 | Neha Bhende (1):
|
---|
4162 |
|
---|
4163 | - docs: Add GL 4.3 support info in mesa docs
|
---|
4164 |
|
---|
4165 | Nicolas Dufresne (1):
|
---|
4166 |
|
---|
4167 | - util/format: Fix wrong colors when importing YUYV and UYVY
|
---|
4168 |
|
---|
4169 | Nicolas F (1):
|
---|
4170 |
|
---|
4171 | - driconf: remove the adaptive sync special case for mpv
|
---|
4172 |
|
---|
4173 | Oleksii Bozhenko (5):
|
---|
4174 |
|
---|
4175 | - glsl: fix gl_CullDistance lowering from float[8] to vec4[2]
|
---|
4176 | - ci: Uprev Piglit
|
---|
4177 | - Move combining clip and cull optimization before linking
|
---|
4178 | - wsi: add rgb_component_bits_are_equal
|
---|
4179 | - wsi: remove get_sorted_vk_formats duplication
|
---|
4180 |
|
---|
4181 | Patrick Lerda (25):
|
---|
4182 |
|
---|
4183 | - lima: fix memory leak related to u_transfer_helper_create()
|
---|
4184 | - mesa/program: fix memory leak triggered by parser errors
|
---|
4185 | - mesa/st: fix possible crash related to arb invalid memory access
|
---|
4186 | - r600: fix shader blob memory leak
|
---|
4187 | - vbo/save: fix possible crash related to fixup_vertex()
|
---|
4188 | - mesa/shaderapi: fix path memory leak
|
---|
4189 | - mesa/framebuffer: fix gl_framebuffer.resolve refcnt imbalance
|
---|
4190 | - mesa/program: fix memory leak triggered by invalid extended swizzle selector
|
---|
4191 | - mesa/program: fix memory leak triggered by multiple targets used on one texture image unit
|
---|
4192 | - mesa/program: fix memory leak triggered by arb alias
|
---|
4193 | - radeonsi: fix memory leak related to ureg_get_tokens()
|
---|
4194 | - glx: fix memory leak related to __glXCloseDisplay()
|
---|
4195 | - r600: fix refcnt imbalance related to shader
|
---|
4196 | - intel: fix memory leak related to brw_nir_create_passthrough_tcs()
|
---|
4197 | - r600: fix typo that could lead to a possible crash
|
---|
4198 | - egl: fix memory leak related to _eglRefreshDeviceList()
|
---|
4199 | - r600: fix refcnt imbalance related to r600_set_vertex_buffers()
|
---|
4200 | - r600: fix refcnt imbalance related to evergreen_set_shader_images()
|
---|
4201 | - lima: fix refcnt imbalance related to framebuffer
|
---|
4202 | - r600/sfn: fix memory leak related to sh_info->arrays
|
---|
4203 | - aux/draw: fix memory leak related to ureg_get_tokens()
|
---|
4204 | - crocus: fix refcnt imbalance related to framebuffer
|
---|
4205 | - crocus: fix refcnt imbalance related to crocus_create_surface()
|
---|
4206 | - r600: fix refcnt imbalance related to atomic_buffer_state
|
---|
4207 | - radeonsi: set proper drm_amdgpu_cs_chunk_fence alignment
|
---|
4208 |
|
---|
4209 | Paul Gofman (1):
|
---|
4210 |
|
---|
4211 | - driconf: add a workaround for Kaiju-A-Gogo
|
---|
4212 |
|
---|
4213 | Paulo Zanoni (8):
|
---|
4214 |
|
---|
4215 | - anv: don't leave undefined values in exec->syncobj_values
|
---|
4216 | - anv: check the return value of anv_execbuf_add_bo_bitset()
|
---|
4217 | - anv: run buf_finish() if add_bo() fails during execute_simple_batch()
|
---|
4218 | - anv: rename anv_execbuf->array_length to bo_array_length
|
---|
4219 | - anv: use vk_realloc for the anv_execbuf arrays
|
---|
4220 | - hasvk: don't leave undefined values in exec->syncobj_values
|
---|
4221 | - hasvk: check the return value of anv_execbuf_add_bo_bitset()
|
---|
4222 | - anv: there's no need to set exec_obj offsets twice
|
---|
4223 |
|
---|
4224 | Pavel Ondračka (16):
|
---|
4225 |
|
---|
4226 | - nir/lower_bool: ntt: Generate a good opcode for bcsel
|
---|
4227 | - r300: update rv515 ci failures list
|
---|
4228 | - r300: skip sin/cos input range transformation for nine and ntt
|
---|
4229 | - r300: remove backend input range transformation for sin and cos
|
---|
4230 | - ntt: pass ubo_vec4_max nir_opt_offsets flag through ntt options
|
---|
4231 | - r300: set ubo_vec4_max ntt option properly
|
---|
4232 | - r300: remove backend negative addressing emulation
|
---|
4233 | - nir: nir opt_shrink_vectors whitespace fix
|
---|
4234 | - nir: mark progress when removing trailing unused alu channels
|
---|
4235 | - nir: mark progress when removing trailing unused load_const channels
|
---|
4236 | - r300: set register file to none if swizzles are constant only
|
---|
4237 | - nir: shrink phi nodes in nir_opt_shrink_vectors
|
---|
4238 | - r300: drop VDPAU support
|
---|
4239 | - r300: simplify KILL transformation
|
---|
4240 | - nine: use separate register for aL emulation
|
---|
4241 | - r300: fix unconditional KIL on R300/R400
|
---|
4242 |
|
---|
4243 | Pedro J. Estébanez (4):
|
---|
4244 |
|
---|
4245 | - spirv_to_dxil: Unify spirv_to_nir_options
|
---|
4246 | - spirv2dxil: Split read-only image as SRV logic into declared and inferred
|
---|
4247 | - spirv: Assume input attachments are read-only
|
---|
4248 | - Revert "microsoft/compiler: Use SRVs for read-only images"
|
---|
4249 |
|
---|
4250 | Philip Langdale (1):
|
---|
4251 |
|
---|
4252 | - radeonsi: correctly declare YUV420_10 RT Format support for AV1
|
---|
4253 |
|
---|
4254 | Philipp Zabel (2):
|
---|
4255 |
|
---|
4256 | - vulkan/wsi/wayland: fix acquire_next_image to report timeouts properly
|
---|
4257 | - zink: fix build with -Dvulkan-beta=true
|
---|
4258 |
|
---|
4259 | Pierre-Eric Pelloux-Prayer (37):
|
---|
4260 |
|
---|
4261 | - radeonsi: simplify dpbb settings
|
---|
4262 | - ac/info: move pci bus info in a struct
|
---|
4263 | - ac: add ac_query_pci_bus_info helper
|
---|
4264 | - ac: don't call ac_query_pci_bus_info from ac_query_gpu_info
|
---|
4265 | - radeonsi/sqtt: don't read results for disabled SEs
|
---|
4266 | - radeonsi/sqtt: disable SE1+ on GFX11
|
---|
4267 | - radeonsi/sqtt: update registers for gfx11
|
---|
4268 | - radeonsi/sqtt: implement offset workaround for gfx11
|
---|
4269 | - vbo: remove bogus assert
|
---|
4270 | - vbo: lower VBO_SAVE_BUFFER_SIZE to avoid large VRAM usage
|
---|
4271 | - glthread: fix glArrayElement handling
|
---|
4272 | - drm-uapi/dma-buf.h: use __u32/__u64 types
|
---|
4273 | - winsys/amdgpu: use DMA_BUF_SET_NAME_B if available
|
---|
4274 | - radeonsi/gfx11: clamp PRIM_GRP_SIZE
|
---|
4275 | - radeonsi/gfx11: fix ge_cntl programming
|
---|
4276 | - amd/surface: fix base_mip_width of subsampled formats
|
---|
4277 | - winsys/amdgpu: use amdgpu_device_get_fd
|
---|
4278 | - radeonsi/video: use specific PIPE_BIND\_ value for video buffers
|
---|
4279 | - radeonsi: fix incorrect vgpr indices in the ps_prolog
|
---|
4280 | - radeonsi/test: use gbm-skips.txt
|
---|
4281 | - radeonsi/test: update test results
|
---|
4282 | - radeonsi: don't use PKT3_SET_SH_REG_INDEX on gfx9 and older
|
---|
4283 | - radeonsi: fix fast depth_clear_value/stencil_clear_value
|
---|
4284 | - yegl/wayland: fix glthread deadlocks
|
---|
4285 | - Revert "driconf: add a workaround for plasmashell freezing"
|
---|
4286 | - ac/llvm: fix build with LLVM 17
|
---|
4287 | - mesa: fix CopyImageSubDataOES with GL_TEXTURE_EXTERNAL_OES
|
---|
4288 | - amd/surface: rename metadata functions
|
---|
4289 | - ac/surface: introduce umd metadata v2
|
---|
4290 | - radeonsi: add AMD_DEBUG=extra_md
|
---|
4291 | - radeonsi: don't use si_decompress_dcc if the blitter is running
|
---|
4292 | - radv: add RADV_DEBUG=extra_md
|
---|
4293 | - radeonsi: don't use alignment_log2 of imported buffers
|
---|
4294 | - mesa: fix invalid index_bo refcounting
|
---|
4295 | - util/vbuf: clarify indirect draws handling
|
---|
4296 | - util/vbuf: fix index_bo leak
|
---|
4297 | - radeonsi: update test results
|
---|
4298 |
|
---|
4299 | Pino Toscano (1):
|
---|
4300 |
|
---|
4301 | - symbols-check: support OSes based on GNU toolchain
|
---|
4302 |
|
---|
4303 | Qiang Yu (78):
|
---|
4304 |
|
---|
4305 | - radeonsi: implement nir_load_ring_gsvs_amd
|
---|
4306 | - radeonsi: implement nir_load_ring_gs2vs_offset_amd
|
---|
4307 | - radeonsi: lower nir streamout intrinsics in abi
|
---|
4308 | - radeonsi: use nir_print_xfb_info to replace si_dump_streamout
|
---|
4309 | - radeonsi: use ac_nir_lower_legacy_vs to replace si_llvm_vs_build_end
|
---|
4310 | - radeonsi: add nir implementation of gs copy shader generation
|
---|
4311 | - radeonsi: build legacy gs output info when shader compile
|
---|
4312 | - radeonsi: replace llvm gs copy shader generation with nir
|
---|
4313 | - radeonsi: remove llvm gs copy shader generate
|
---|
4314 | - radeonsi: replace llvm legacy gs code with nir lowering
|
---|
4315 | - radeonsi: move gfx10_ngg_export_vertex to si_shader_llvm.c
|
---|
4316 | - gallium/aux: remove nir_helpers
|
---|
4317 | - nir/xfb_info: nir_gather_xfb_info_from_intrinsics update nir xfb_info
|
---|
4318 | - radeonsi: update nir xfb info after medium io lowering
|
---|
4319 | - nir: add nir_export_amd intrinsic
|
---|
4320 | - ac/llvm: implement nir_export_amd
|
---|
4321 | - aco: implement nir_export_amd
|
---|
4322 | - ac/nir: gs and nogs use ac_nir_export_primitive
|
---|
4323 | - ac/nir: add ac_nir_export_position
|
---|
4324 | - ac/nir: add ac_nir_export_parameter
|
---|
4325 | - ac/nir: add force_vrs to ac_nir_export_position
|
---|
4326 | - amd,radeonsi: implement nir_load_force_vrs_rates_amd in driver abi
|
---|
4327 | - radeonsi: clamp vertex color in legacy gs instead of gs copy shader
|
---|
4328 | - radeonsi: update outputs written nir info
|
---|
4329 | - radeonsi: remove the extra handling for VS/TES primitive id
|
---|
4330 | - radeonsi: set nr_pos_exports outside of llvm translation
|
---|
4331 | - ac/nir,radv,radeonsi: legacy vs use ac_nir_export_(position|parameter)
|
---|
4332 | - ac/nir,radv,radeonsi: gs copy shader use ac_nir_export_(position|parameter)
|
---|
4333 | - ac/nir/ngg: fix clip dist culling mask uninitialized
|
---|
4334 | - ac/nir/ngg: change clipdist_neg_mask_var type to uint32
|
---|
4335 | - ac/nir/ngg,radv,radeonsi: nogs use ac_nir_export_(position|parameter)
|
---|
4336 | - ac/nir/ngg: prepare gather_vs_outputs to be used by gs
|
---|
4337 | - ac/nir/ngg: gs use ac_nir_export_(position|parameter)
|
---|
4338 | - ac/nir/ngg,radv: ms use ac_nir_export_(primitive|position|parameter)
|
---|
4339 | - nir,ac/llvm,aco: remove nir_export_primitive_amd
|
---|
4340 | - nir,ac/llvm,aco,radv,radeonsi: remove nir_export_vertex_amd
|
---|
4341 | - aco: remove early_rast wait insert
|
---|
4342 | - radv: move radv_consider_force_vrs above radv_fill_shader_info
|
---|
4343 | - radv: use amd common force_vrs option
|
---|
4344 | - ac/llvm,radeonsi: lower nir_load_barycentric_at_sample in abi
|
---|
4345 | - radeonsi: add num_component param to load_internal_binding
|
---|
4346 | - ac/llvm,radeonsi: lower fbfetch in abi
|
---|
4347 | - radeonsi: only init llvm output when needed.
|
---|
4348 | - ac/llvm: only init outputs when fragment shader for radv
|
---|
4349 | - aco: only ls and ps use store output now
|
---|
4350 | - aco, radv: Add load_grid_size_from_user_sgpr to aco options.
|
---|
4351 | - aco, radv: Move is_trap_handler_shader to aco info.
|
---|
4352 | - ac/nir: move store_var_components to common place
|
---|
4353 | - ac/nir: tcs write tess factor support pass by reg
|
---|
4354 | - ac/nir: init tess factor location with IO remap
|
---|
4355 | - ac/nir: handle tess factor output missing case
|
---|
4356 | - ac/llvm,radeonsi: lower nir_load_ring_tess_factors_amd
|
---|
4357 | - radeonsi: lower nir_load_ring_tess_factors_offset_amd
|
---|
4358 | - radeonsi: monolithic TCS emit tessfactor in nir directly
|
---|
4359 | - ac/llvm: respect channel_type when ac_build_buffer_load
|
---|
4360 | - ac/llvm: add missing type convert for nir_load_buffer_amd
|
---|
4361 | - nir: pack_(s|u)norm_2x16 support float16 as input
|
---|
4362 | - ac/llvm: implement float16 nir_op_pack_(s|u)norm_2x16
|
---|
4363 | - aco: implement float16 nir_op_pack_(s|u)norm_2x16
|
---|
4364 | - nir,radeonsi: add and implement nir_load_alpha_reference_amd
|
---|
4365 | - nir: add nir_fisnan helper function
|
---|
4366 | - ac/nir: add ac_nir_lower_ps
|
---|
4367 | - radeonsi: monolithic PS emit epilog in nir directly
|
---|
4368 | - radeonsi: expose si_nir_load_internal_binding
|
---|
4369 | - ac/nir: add ac_nir_load_arg_at_offset
|
---|
4370 | - radeonsi: add si_nir_lower_vs_inputs
|
---|
4371 | - ac/llvm: vs_rel_patch_id can also be fixed up
|
---|
4372 | - ac/llvm: move ac_fixup_ls_hs_input_vgprs to amd common
|
---|
4373 | - radeonsi: monolithic VS emit prolog in nir directly
|
---|
4374 | - ac/llvm,radeonsi: remove abi->load_inputs implementation
|
---|
4375 | - ac/llvm: remove ac_build_opencoded_load_format
|
---|
4376 | - radeonsi: fix max scrach lds size calculation when ngg
|
---|
4377 | - ac/nir/ngg: fix gs culling vertex liveness check for odd vertices
|
---|
4378 | - ac/nir/ngg: fix store shared alignment
|
---|
4379 | - ac/llvm: remove some unused code replaced by nir
|
---|
4380 | - ac,aco: move gfx10 ngg prim count zero workaround to nir
|
---|
4381 | - aco: fix nir_f2u64 translation
|
---|
4382 | - ac/nir/cull: fix line position w culling
|
---|
4383 |
|
---|
4384 | Raun (2):
|
---|
4385 |
|
---|
4386 | - dzn: Enable VK_KHR_bind_memory2
|
---|
4387 | - dzn: Enable VK_KHR_get_memory_requirements2
|
---|
4388 |
|
---|
4389 | Rhys Perry (48):
|
---|
4390 |
|
---|
4391 | - radv: implement GS load_ring_gsvs_amd/load_ring_gs2vs_offset_amd
|
---|
4392 | - radv,aco: use ac_nir_lower_legacy_gs
|
---|
4393 | - aco: restore semantic_can_reorder for GS output stores
|
---|
4394 | - ac/nir: use store_buffer_amd's base index
|
---|
4395 | - ac/llvm: add support for fp32 addition atomics
|
---|
4396 | - aco: add support for fp32 addition atomics
|
---|
4397 | - radv: load ssbo_atomic_fadd descriptor
|
---|
4398 | - radv/gfx11: expose shaderBufferFloat32AtomicAdd
|
---|
4399 | - aco/tests: fix assembler.gfx11.vop12c_v128 with LLVM 15
|
---|
4400 | - aco/tests: update assembler tests for latest LLVM 16
|
---|
4401 | - radv: skip creation of null TLAS for null winsys
|
---|
4402 | - aco: set has_color_exports with GPL
|
---|
4403 | - aco: end reduce tmp after control flow, when used within control flow
|
---|
4404 | - aco/tests: add setup_reduce_temp.divergent_if_phi
|
---|
4405 | - aco/spill: always end spill vgpr after control flow
|
---|
4406 | - aco: limit VALUPartialForwardingHazard search
|
---|
4407 | - radv: set state.vbo_misaligned_mask_invalid in radv_bind_vs_input_state
|
---|
4408 | - ac: move ring_offsets to ac_shader_args
|
---|
4409 | - ac/llvm: let ring_offsets be accessed like a normal arg
|
---|
4410 | - radv/llvm: use the ring_offsets shader arg
|
---|
4411 | - aco: fix out-of-bounds access when moving s_mem(real)time across SMEM
|
---|
4412 | - aco: don't modify exec in p_interp_gfx11
|
---|
4413 | - aco: don't apply modifiers through DPP to unsupported instructions
|
---|
4414 | - aco: fix pathological case in LdsDirectVALUHazard
|
---|
4415 | - aco: always update orig_names in get_reg_phi()
|
---|
4416 | - radv: remove is_internal pipeline creation parameter
|
---|
4417 | - aco/tests: add tests for v_fma_f32 with 2 fp16 literals
|
---|
4418 | - aco: make IDSet sparse
|
---|
4419 | - nir/range_analysis: fix vectorized phis and intrinsics
|
---|
4420 | - nir: use xyzw order for precise fdot
|
---|
4421 | - nir: make fdph lowering match fdot
|
---|
4422 | - nir: add nir_lower_alu_width_test.fdot_order
|
---|
4423 | - aco/gfx11: fix RT prolog scratch initialization
|
---|
4424 | - aco: set needs_flat_scr=true for RT
|
---|
4425 | - util/dynarray: allow an initial stack allocation to be used
|
---|
4426 | - nir/range_analysis: add missing masking of shift amounts
|
---|
4427 | - nir/range_analysis: add helpers for limiting stack usage
|
---|
4428 | - nir/range_analysis: use perform_analysis() in nir_unsigned_upper_bound()
|
---|
4429 | - nir/range_analysis: use perform_analysis() in nir_analyze_range()
|
---|
4430 | - radv: fix setting radv_shader_info::user_data_0 with rt
|
---|
4431 | - aco: don't optimize s_or_b64(v_cmp_u_f32(a, b), cmp(a, a))
|
---|
4432 | - aco: fix nir_var_shader_out barriers for task shaders
|
---|
4433 | - radv/gfx11: improve RT scratch allocation
|
---|
4434 | - nir: make nir_fisnan helper exact
|
---|
4435 | - aco: remove SMEM_instruction::prevent_overflow
|
---|
4436 | - ac/nir/ps: fix null export write mask miss set to 0xf
|
---|
4437 | - aco: don't move exec reads around exec writes
|
---|
4438 | - aco: don't move exec writes around exec writes
|
---|
4439 |
|
---|
4440 | Rob Clark (180):
|
---|
4441 |
|
---|
4442 | - freedreno/ci: Switch a630 jobs over to manual
|
---|
4443 | - freedreno/ci: Cleanup a618 yaml
|
---|
4444 | - freedreno/ci: Add a618 egl/skqp/piglit jobs
|
---|
4445 | - Revert "freedreno/ci: Switch also performance a630 job to manual"
|
---|
4446 | - Revert "freedreno/ci: Switch a630 jobs over to manual"
|
---|
4447 | - freedreno/ci: Add an a618 flake
|
---|
4448 | - freedreno/drm: Remove assert
|
---|
4449 | - freedreno: Fix tracking of enabled SSBOs
|
---|
4450 | - freedreno/a6xx: Workaround for no pos/psize
|
---|
4451 | - freedreno: Don't re-install a flushed batch
|
---|
4452 | - freedreno/a6xx: Rework barrier handling
|
---|
4453 | - freedreno/ir3: Stop copying options
|
---|
4454 | - freedreno/ir3: Let driver specify fb-read descriptor
|
---|
4455 | - freedreno: Track image/SSBO usage for all stages
|
---|
4456 | - freedreno/ir3: Add descriptor set lowering
|
---|
4457 | - freedreno/a6xx: Pre-bake IBO descriptor sets
|
---|
4458 | - freedreno/a6xx: Add bindless state
|
---|
4459 | - freedreno/a6xx: Switch over to bindless IBO
|
---|
4460 | - freedreno/a6xx: Remove bindfull IBO state
|
---|
4461 | - freedreno/a6xx: Removing munging of tex state for IBO
|
---|
4462 | - freedreno/a6xx: Remove tex fb_read state
|
---|
4463 | - freedreno/a6xx: Move compute to tex state group
|
---|
4464 | - freedreno/a6xx: Move tex state building
|
---|
4465 | - freedreno/a6xx: Expose SSBO/image for all shader stages
|
---|
4466 | - freedreno: Restore GL_VENDOR string
|
---|
4467 | - gallium/util: Add util_writes_depth() helper
|
---|
4468 | - freedreno/a6xx: Add LRZ perf warn for ztest direction changes
|
---|
4469 | - freedreno/a6xx: Invalidate LRZ on blend+depthwrite
|
---|
4470 | - turnip: Rename lrz force_disable_mask
|
---|
4471 | - turnip: Invalidate LRZ on blend+depthwrite
|
---|
4472 | - util/xmlconfig: Use os_get_option()
|
---|
4473 | - freedreno: Add driconf to disable conservative LRZ
|
---|
4474 | - freedreno/a6xx: Add a few kernel regs/etc
|
---|
4475 | - freedreno/drm: Add some ref/unref debugging
|
---|
4476 | - freedreno/drm: Detect zombie BOs
|
---|
4477 | - freedreno/drm: Remove bo_del_or_recycle()
|
---|
4478 | - freedreno/drm: Split out bo->finalize()
|
---|
4479 | - freedreno/drm: Synchronize handle close and lookup
|
---|
4480 | - freedreno/drm/virtio: Flush before CREATE_BLOB
|
---|
4481 | - freedreno/drm: Restart import on zombie race
|
---|
4482 | - freedreno/gmem: Fix for partial z/s fast-clear
|
---|
4483 | - freedreno/decode: Increase size of offsets table
|
---|
4484 | - freedreno/a6xx: LRZ for MSAA
|
---|
4485 | - freedreno/ir3: Scalarize load_ssbo
|
---|
4486 | - freedreno/a6xx: Add missing CS_BINDLESS mapping
|
---|
4487 | - freedreno/a6xx: Add CS instrlen workaround
|
---|
4488 | - freedreno: nondraw-batch
|
---|
4489 | - freedreno: Skip flush_resource with explicit sync
|
---|
4490 | - freedreno/a6xx: Don't double-write SP_CS_OBJ_START
|
---|
4491 | - freedreno: Don't open-code setting dirty CS state
|
---|
4492 | - freedreno/a6xx: Make shader state independent of grid info
|
---|
4493 | - freedreno/a6xx: Also FLUSH_CACHE on image barrier
|
---|
4494 | - freedreno/a6xx: Remove excess CS flushing
|
---|
4495 | - freedreno+ir3: Move storage_16bit to compiler options
|
---|
4496 | - freedreno/a6xx: Move CS state to PROG state group
|
---|
4497 | - freedreno/drm: Move sa_cpu_prep() to core
|
---|
4498 | - freedreno/drm/virtio: Limit guest handles passed to virtgpu
|
---|
4499 | - ir3: Quiet unused variable warning
|
---|
4500 | - freedreno: Quiet unused variable warnings
|
---|
4501 | - freedreno/a2xx: Move pack_rgba()
|
---|
4502 | - freedreno: Indent fixes
|
---|
4503 | - freedreno/a6xx: Move num_driver_params to program state
|
---|
4504 | - freedreno: Move num_vertices calc to backend
|
---|
4505 | - freedreno: Remove impossible NULL check
|
---|
4506 | - freedreno: Add FD_DIRTY_QUERY
|
---|
4507 | - freedreno: Avoid screen lock when no rsc tracking needed
|
---|
4508 | - freedreno: Account for multi-draw in num_draws
|
---|
4509 | - freedreno: Push num_draws down to backend
|
---|
4510 | - freedreno/a6xx: Drop unused return
|
---|
4511 | - freedreno/a6xx: Split out flush_streamout() helper
|
---|
4512 | - freedreno/a6xx: Multi-draw support
|
---|
4513 | - freedreno/a6xx: Do tex-state invalidates in same ctx
|
---|
4514 | - freedreno/drm: Make rb refcnt non-atomic
|
---|
4515 | - freedreno/a6xx: Remove tex-state refcnting
|
---|
4516 | - freedreno: Move blend out of dirty-rsc tracking
|
---|
4517 | - freedreno: Move FD_MESA_DEBUG cases out of draw_vbo
|
---|
4518 | - freedreno/a6xx: Pre-compute PROG related LRZ state
|
---|
4519 | - freedreno: Avoid taking screen lock
|
---|
4520 | - freedreno/batch: Stop tracking cross-context deps
|
---|
4521 | - freedreno: Drop batch lock
|
---|
4522 | - freedreno: Add seqno helper
|
---|
4523 | - freedreno/drm: Optimize stateobj re-emit
|
---|
4524 | - freedreno/a6xx: Move rsc seqno out of tex cache key
|
---|
4525 | - freedreno/a6xx: Fix set_sampler_views(start != 0)
|
---|
4526 | - freedreno/a6xx: Drop unneed fd6_texture_state() arg
|
---|
4527 | - freedreno/a6xx: Fix sampler view rsc_seqno for X32_S8X24
|
---|
4528 | - freedreno/a6xx: Add a way to assert valid format
|
---|
4529 | - freedreno/a6xx: Remove needs_invalidate flag
|
---|
4530 | - freedreno/a6xx: Small cleanup
|
---|
4531 | - freedreno/a6xx: Static-ify sampler_view_update()
|
---|
4532 | - freedreno/a6xx: Fix view_seqno in tex cache key
|
---|
4533 | - freedreno/cffdec: Fix unitialized count for pkt2
|
---|
4534 | - freedreno/cffdec: Add helper to find next pkt
|
---|
4535 | - freedreno/cffdec: Add helper to parse CP_INDIRECT_BUFFER
|
---|
4536 | - freedreno/cffdec: Fix hang location detection
|
---|
4537 | - freedreno/crashdec: Refactor crashdec tests
|
---|
4538 | - freedreno/crashdec: Add another prefetch test
|
---|
4539 | - freedreno/crashdec: Handle multi-IB prefetching
|
---|
4540 | - freedreno/crashdec: Disable GALLIUM_DUMP_CPU
|
---|
4541 | - vk/runtime: Allow enumerate and try_create_for_drm to coexist
|
---|
4542 | - turnip: Move things to prep for multi-kernel support
|
---|
4543 | - turnip: drm code-motion
|
---|
4544 | - turnip: Split out vfuncs for kernel interface
|
---|
4545 | - turnip: Allow knl backend specific entrypoints
|
---|
4546 | - turnip: Move QueueWaitIdle entrypoint to kgsl
|
---|
4547 | - turnip: Handle kgsl vs drm specifics at runtime
|
---|
4548 | - turnip: Refactor device loading
|
---|
4549 | - tu+meson: Re-work KMD selection
|
---|
4550 | - tu/kgsl: Propagate tu_physical_device_init() errors
|
---|
4551 | - Revert "freedreno: Account for multi-draw in num_draws"
|
---|
4552 | - freedreno/a6xx: Namespace reg/pkt packer vars
|
---|
4553 | - freedreno/a6xx: Convert blitter to OUT_REG()
|
---|
4554 | - freedreno/a6xx: Fix mirror x/y blits
|
---|
4555 | - util: Add a simple no-op libdrm shim
|
---|
4556 | - turnip: Use libdrm shim
|
---|
4557 | - loader: Use libdrm shim
|
---|
4558 | - vk/runtime: Use libdrm shim
|
---|
4559 | - freedreno/common: Replace or_mask() with BitsetEnum<T>
|
---|
4560 | - freedreno: Promote non-drawing batches to sysmem
|
---|
4561 | - freedreno: Nerf strict-aliasing warning for all of gcc
|
---|
4562 | - freedreno/registers: Schema validation for gen_header.py
|
---|
4563 | - freedreno/registers: Add regs for a690
|
---|
4564 | - freedreno: Quiet c++ warning about designated initializers
|
---|
4565 | - freedreno/ir3: Un-inline enums
|
---|
4566 | - freedreno/ir3: Don't use negative opc for meta instructions
|
---|
4567 | - freedreno/ir3: c++-proof the headers
|
---|
4568 | - freedreno/ir3+tu: Calculate subgroup size in ir3
|
---|
4569 | - freedreno/ir3: Add missing driver params
|
---|
4570 | - freedreno: Un-inline buffer-mask enum
|
---|
4571 | - freedreno: c++-proofing
|
---|
4572 | - freedreno/a6xx: Rework texture_clear fallback
|
---|
4573 | - freedreno/a6xx: Add missing "inline"
|
---|
4574 | - freedreno/a6xx: Fix designator initializer order
|
---|
4575 | - freedreno/a6xx: Convert to c++
|
---|
4576 | - freedreno/registers: Fix designator order
|
---|
4577 | - freedreno/registers: Add prefix="variant"
|
---|
4578 | - freedreno/registers: Merge a6xx and a7xx regs
|
---|
4579 | - freedreno/registers: Start adding a7xx pipe/control regs
|
---|
4580 | - freedreno/decode: Start adding a7xx support
|
---|
4581 | - freedreno/registers: Start adding stuff for a7xx
|
---|
4582 | - freedreno/registers: Track varset
|
---|
4583 | - freedreno/registers: Split out regpair builder helper
|
---|
4584 | - freedreno/registers: Add c++ magic for register variants
|
---|
4585 | - freedreno/registers: Fix nameless fields
|
---|
4586 | - freedreno/registers: Define rest of CP_REG_WRITE
|
---|
4587 | - freedreno/a6xx: Simplify iova emit
|
---|
4588 | - mesa: Rework discard_framebuffer()
|
---|
4589 | - driconf: Add ignore_discard_framebuffer option
|
---|
4590 | - driconf: Work around incorrect GI discard/invalidate
|
---|
4591 | - freedreno: Specify GMEM tile alignment per GPU
|
---|
4592 | - freedreno+tu: Big GMEM support
|
---|
4593 | - freedreno+tu: Add a690 support
|
---|
4594 | - freedreno/a6xx: Restore mode
|
---|
4595 | - freedreno/rnn: Fix reg names for regs with variants
|
---|
4596 | - freedreno/afuc: Add raw mode for disasm
|
---|
4597 | - freedreno/registers: Add control reg for zap fw base
|
---|
4598 | - Revert "CI: Disable freedreno"
|
---|
4599 | - dri2/android: Bypass throttling
|
---|
4600 | - freedreno/drm: Fast path for idle check
|
---|
4601 | - freedreno/drm: Stop cleanup at first active BO
|
---|
4602 | - mesa: Add a few more function traces
|
---|
4603 | - freedreno/drm: Make threaded-submit optional
|
---|
4604 | - freedreno/drm: Disable threaded-submit for msm
|
---|
4605 | - freedreno: Optimize repeated finishes
|
---|
4606 | - freedreno: Stop being too clever by half
|
---|
4607 | - freedreno: Hoist dirty vars
|
---|
4608 | - freedreno: Extract out a helper
|
---|
4609 | - freedreno: Inline single-use helpers
|
---|
4610 | - freedreno: Re-work dirty-resource tracking
|
---|
4611 | - freedreno: Avoid looping shader stages if nothing dirty
|
---|
4612 | - freedreno: Move driconf settings into sub-struct
|
---|
4613 | - freedreno: Support the disable_throttling=true driconf option
|
---|
4614 | - util/disk_cache: Split out queue initialization
|
---|
4615 | - util/disk_cache: Add NONE type
|
---|
4616 | - util/disk_cache: Use queue state to skip put
|
---|
4617 | - util/disk_cache: Move blob_put_cb to the async queue
|
---|
4618 | - freedreno/a6xx: Allow z24s8 format casts
|
---|
4619 | - freedreno/a6xx: Fix valid_format_cast logic for newer a6xx
|
---|
4620 | - freedreno: Fix resource tracking vs rebind/invalidate
|
---|
4621 | - dri/android: Fix MSAA resolve
|
---|
4622 |
|
---|
4623 | Rohan Garg (9):
|
---|
4624 |
|
---|
4625 | - iris: Don't flush the render cache for a compute batch
|
---|
4626 | - anv: drop unused headers
|
---|
4627 | - anv: reuse the VK_IMAGE_ASPECT_PLANES_BITS_ANV macro
|
---|
4628 | - isl: fix some documentation
|
---|
4629 | - anv/blorp: use existing function to convert the op to a string
|
---|
4630 | - anv: break out of the loop when the first color attachment is found
|
---|
4631 | - anv,hasvk: cleanup unused enum
|
---|
4632 | - intel/genxml: Add the preferred slm size enum for gen125
|
---|
4633 | - anv,blorp,iris: Set PreferredSLMAllocationSize on gfx125+
|
---|
4634 |
|
---|
4635 | Roland Scheidegger (2):
|
---|
4636 |
|
---|
4637 | - llvmpipe: only use accurate_a0 hack if there are no textures bound
|
---|
4638 | - lavapipe, nir: Fix wrong array index scaling in nir_collect_src_uniforms
|
---|
4639 |
|
---|
4640 | Rose Hudson (4):
|
---|
4641 |
|
---|
4642 | - radeonsi: report 0 block size for Polaris HEVC encoding
|
---|
4643 | - asahi: wire up shader disk cache support
|
---|
4644 | - agx: isolate compiler debug flags
|
---|
4645 | - asahi: disable disk cache in debug runs
|
---|
4646 |
|
---|
4647 | Ruijing Dong (5):
|
---|
4648 |
|
---|
4649 | - frontends/va: revert commit 0b02db30
|
---|
4650 | - raseonsi/vcn: fix a h264 decoding issue
|
---|
4651 | - frontends/va: disable skip_frame_enable in vaapi interface.
|
---|
4652 | - radeonsi/vcn: correct cropping for hevc case
|
---|
4653 | - radeonsi/vcn: fix decoding bs buffer alignement issue.
|
---|
4654 |
|
---|
4655 | Ryan Neph (17):
|
---|
4656 |
|
---|
4657 | - ci: fix directory existence racing in parallel test execution
|
---|
4658 | - util/u_process: add MESA_PROCESS_NAME override to util_get_process_name()
|
---|
4659 | - util/u_process: remove util_get_process_name_may_override()
|
---|
4660 | - util/xmlconfig: add MESA_DRICONF_EXECUTABLE_OVERRIDE
|
---|
4661 | - venus: update venus-protocol headers to partially fix WA1
|
---|
4662 | - venus: temporarily redirect VkDrmFormatModifierPropertiesListEXT to "2" variant
|
---|
4663 | - ci: uprev virglrenderer
|
---|
4664 | - venus: update venus-protocol headers to fix WA1
|
---|
4665 | - Revert "venus: temporarily redirect VkDrmFormatModifierPropertiesListEXT to "2" variant"
|
---|
4666 | - venus: add vn_relax_init/_fini()
|
---|
4667 | - venus: set/check ring status bits independently
|
---|
4668 | - venus: init exp features before ring init again
|
---|
4669 | - venus: update to latest protocol for ringMonitoring
|
---|
4670 | - venus: check and configure new ringMonitoring feature
|
---|
4671 | - venus: re-use VN_DEBUG_NO_ABORT to disable ring monitoring abort()
|
---|
4672 | - virgl: hook new get_fd proc for drm winsys
|
---|
4673 | - i915: hook new get_fd proc for drm winsys
|
---|
4674 |
|
---|
4675 | Sagar Ghuge (10):
|
---|
4676 |
|
---|
4677 | - iris: Stop marking context unconditionally as guilty
|
---|
4678 | - intel/fs: Always stall between the fences on Gen11+
|
---|
4679 | - nir: Handle other variants of image_samples properly while lowering
|
---|
4680 | - intel/compiler: Add swsb_stall debug option
|
---|
4681 | - anv: Implement Wa_14015297576
|
---|
4682 | - iris: Implement Wa_14015297576
|
---|
4683 | - intel/compiler: Add Wa_14014063774 for slm_fence
|
---|
4684 | - intel/decoder: Bump the binding table guess value to 32
|
---|
4685 | - anv: Drop unused param from add_surface_reloc
|
---|
4686 | - anv: Drop dead code that sets the L3BypassDisable field
|
---|
4687 |
|
---|
4688 | Sai Teja Pottumuttu (2):
|
---|
4689 |
|
---|
4690 | - iris: Fix to release BO immediately if not busy
|
---|
4691 | - anv: Fix stride mismatch in mesa and minigbm
|
---|
4692 |
|
---|
4693 | Sajeesh Sidharthan (3):
|
---|
4694 |
|
---|
4695 | - radeonsi/vcn: disable fence for JPEG decoding
|
---|
4696 | - radeonsi/vcn: set bitstream buffer size to encoded bitstream size
|
---|
4697 | - radeonsi/vcn: optimize bitstream buffer resize logic
|
---|
4698 |
|
---|
4699 | Sam Edwards (1):
|
---|
4700 |
|
---|
4701 | - nouveau: Fix null dereference in nouveau_pushbuf_destroy
|
---|
4702 |
|
---|
4703 | Samuel Iglesias Gonsálvez (1):
|
---|
4704 |
|
---|
4705 | - docs/developers: Add Igalia as Mesa consultancy
|
---|
4706 |
|
---|
4707 | Samuel Pitoiset (279):
|
---|
4708 |
|
---|
4709 | - radv: fix missing implementation of creating images from swapchains
|
---|
4710 | - radv: fix hashing pipeline keys if RADV_PERFTEST=ngg_streamout is used
|
---|
4711 | - radv: fix re-emitting RB+ when the non-compacted color format changes
|
---|
4712 | - ac/nir: clear unused components before storing XFB outputs to LDS
|
---|
4713 | - ac: add TC_OP_ATOMIC_SUB_32
|
---|
4714 | - radv: fix setting MAX_MIP for BC views
|
---|
4715 | - radv: fix buffer to image copies with BC views on the graphics queue
|
---|
4716 | - radv: fix creating BC image views when the base layer is > 0
|
---|
4717 | - radv: rename ac_surf_nbc_view::max_mip to num_levels
|
---|
4718 | - radv: move some color blend helpers to radv_private.h
|
---|
4719 | - radv: add a new helper for normalizing blend factors
|
---|
4720 | - radv: add support for dynamic blend equation
|
---|
4721 | - radv: enable compiling PS epilogs on-demand for dynamic color blend equations
|
---|
4722 | - radv: fix detecting that blend is enabled when all CB states are dynamic
|
---|
4723 | - radv: advertise extendedDynamicState3ColorBlendEquation
|
---|
4724 | - radv: remove an old FIXME about a possible bug with TC-compat HTILE
|
---|
4725 | - radv/winsys: fix incorrect PCIID for GFX11 in the null winsys
|
---|
4726 | - radv: print depth image size with RADV_DEBUG=img
|
---|
4727 | - radv: fix RADV_DEBUG=hang with multiple cmdbuffer per submission
|
---|
4728 | - radv/winsys: prefix all error messages with RADV
|
---|
4729 | - radv: fix creating libraries with PS epilog and all CB states as dynamic
|
---|
4730 | - radv: fix ignoring graphics shader stages that don't need to be imported
|
---|
4731 | - radv: add a layer for fixing rendering issues with RAGE2
|
---|
4732 | - radv: simplify VK_PIPELINE_CREATE_FAIL_ON_PIPELINE_COMPILE_REQUIRED
|
---|
4733 | - radv: pass the number of stages to radv_hash_shaders()
|
---|
4734 | - radv: split radv_create_shaders() between graphics and compute shaders
|
---|
4735 | - radv: rename radv_create_shaders() to radv_graphics_pipeline_compile()
|
---|
4736 | - radv: ignore all CB dynamic states when there is no color attachments
|
---|
4737 | - radv: regroup dynamic states initialization
|
---|
4738 | - radv: only initialize non-zero values for the default dynamic state
|
---|
4739 | - radv: stop setting INTERPOLATE_COMP_Z
|
---|
4740 | - radv: fix RB+ for SRGB formats
|
---|
4741 | - radv: adjust ACCUM tessellation fields on GFX11+
|
---|
4742 | - radv: fix GPL fast-linking with libs that have retained NIR shaders
|
---|
4743 | - radv: skip shaders cache for fast-linked pipelines with GPL
|
---|
4744 | - radv: remove useless check about CS in radv_lower_io()
|
---|
4745 | - radv: simplify pipeline_has_ngg during graphics shaders compilation
|
---|
4746 | - radv: add helpers for capturing shaders and statistics
|
---|
4747 | - radv: pass radv_graphics_pipeline to radv_graphics_pipeline_compile()
|
---|
4748 | - radv: move retained shaders info to radv_graphics_pipeline
|
---|
4749 | - radv: pass radv_compute_pipeline to radv_compute_pipeline_compile()
|
---|
4750 | - radv: pass pCreateInfo to radv_graphics_pipeline_compile()
|
---|
4751 | - radv: optimize radv_pipeline_layout_add_set() slightly
|
---|
4752 | - radv: remove redundant zero initialization of pipeline layout
|
---|
4753 | - radv: remove radv_pipeline_stage::spirv::sha1
|
---|
4754 | - radv: allow to create a noop FS in a library with GPL
|
---|
4755 | - radv: remove one unused variable in radv_graphics_lib_pipeline_init()
|
---|
4756 | - radv: pass the lib flags for generating the pipeline key
|
---|
4757 | - radv: return a boolean value in radv_pipeline_needs_dynamic_ps_epilog()
|
---|
4758 | - radv: stop using the graphics pipeline key after compilation
|
---|
4759 | - radv: determine the last VGT API stage earlier
|
---|
4760 | - radv: skip compilation when possible with GPL fast-linking
|
---|
4761 | - radv: simplify an assertion after considering RADV_FORCE_VRS
|
---|
4762 | - radv: do not insert fast-linked libraries to the shaders cache
|
---|
4763 | - radv: fix skipping graphics pipeline compilation when the FS is NULL
|
---|
4764 | - radv: cleanup graphics pipeline library flags uses
|
---|
4765 | - radv: simplify determining when the fragment shader needs an epilog
|
---|
4766 | - radv: regroup PS epilog info when generating the graphics pipeline key
|
---|
4767 | - radv: fix disabling MRT compaction for on-demand PS epilogs
|
---|
4768 | - radv: make sure to disable MRT compaction when compiling a PS epilog with GPL
|
---|
4769 | - radv: simplify creating a FS epilog from a library
|
---|
4770 | - radv: stop skipping the cache for compute/raytracing pipelines with GPL
|
---|
4771 | - radv: stop skipping the cache for monolithic graphics pipelines with GPL
|
---|
4772 | - docs: add missing RADV_PERFTEST=video_decode
|
---|
4773 | - docs: stop reporting RADV_PERFTEST=gpl as experimental/suboptimal
|
---|
4774 | - radv/ci: set RADV_PERFTEST=GPL for all VKCTS jobs
|
---|
4775 | - radv/ci: bump the number of runners to 3 for vkcts-navi21-valve
|
---|
4776 | - radv: restore uploading shaders individually instead of consecutively
|
---|
4777 | - radv: implement graphics shaders relocation for a RGP workaround
|
---|
4778 | - radv: fix importing retained NIR shaders when a lib uses the RETAIN bit
|
---|
4779 | - radv: use last_vgt_api_stage for determining the last stage with XFB
|
---|
4780 | - radv: only initialize shader arguments for the active stages
|
---|
4781 | - radv simplify compiling graphics shaders with a mask of active NIR stages
|
---|
4782 | - radv: disable DCC for mipmaps on GFX11
|
---|
4783 | - radv: ignore registering pipeline libaries with SQTT
|
---|
4784 | - radv/ci: add missing expected failures with RADV_PERFTEST=gpl on GFX1100
|
---|
4785 | - radv: reduce maximum line width to 8.0
|
---|
4786 | - radv: add support for rectangularLines
|
---|
4787 | - Revert "radv: acquire pstate on-demand when capturing with RGP"
|
---|
4788 | - radv/amdgpu: only set a new pstate if the current one is different
|
---|
4789 | - radv: only skip emitting the pipeline blend state if the FS uses an epilog
|
---|
4790 | - radv: stop using a PS epilog when the FS doesn't write any color outputs
|
---|
4791 | - ci: uprev vkd3d-proton
|
---|
4792 | - zink/ci: skip KHR-GL46.texture_swizzle.functional with RADV
|
---|
4793 | - zink/ci: set RADV_PERFTEST=gpl for RADV jobs
|
---|
4794 | - radv/ci: disable vkcts-kabini-valve
|
---|
4795 | - radv/ci: move CI lists for external GPUs in separate folder
|
---|
4796 | - radv: configure SQ_THREAD_TRACE_CTRL.REG_AT_HWM on GFX11
|
---|
4797 | - radv: only enable SQTT for SE0 on GFX11
|
---|
4798 | - radv: make sure to wait for the trace buffer also on GFX11
|
---|
4799 | - radv: implement a workaround for SQTT on GFX11
|
---|
4800 | - radv: disable SPM counters with RGP on GFX11
|
---|
4801 | - radv: enable SQTT tracing on GFX11
|
---|
4802 | - radv: set VS_OUT_MISC_SIDE_BUS_ENA for clip distances on GFX10.3+
|
---|
4803 | - radv/ci: cleanup CI lists for dEQP-VK.memory.* tests that timeout
|
---|
4804 | - ac/nir: add resinfo lowering for sliced storage 3D views
|
---|
4805 | - radv: implement VK_EXT_image_sliced_view_of_3d on GFX10+
|
---|
4806 | - radv: advertise VK_EXT_image_sliced_view_of_3d on GFX10+
|
---|
4807 | - radv: cleanup radv_emit_{conservative,msaa}_state() functions
|
---|
4808 | - radv: stop setting ENABLE_POSTZ_OVERRASTERIZATION to 1
|
---|
4809 | - radv: set MSAA_NUM_SAMPLES to 0 for underestimate rasterization
|
---|
4810 | - radv: enable primitiveUnderestimation on GFX9+
|
---|
4811 | - zink/ci: skip one more test that timeout with RADV
|
---|
4812 | - radv: fix flushing non-coherent images inside secondaries on GFX9+
|
---|
4813 | - radv: fix flushing non-coherent images in EndCommandBuffer()
|
---|
4814 | - radv: fix draw calls with 0-sized index buffers and robustness on NAVI10
|
---|
4815 | - radv: only expose EXT_pipeline_library_group_handles if RT is enabled
|
---|
4816 | - amd,ac/rgp: fix SQTT memory types
|
---|
4817 | - radv: ignore alpha_is_on_msb on GFX11 because the hw ignores it
|
---|
4818 | - radv: use new EVENT_WRITE_ZPASS packet3 on GFX11
|
---|
4819 | - radv: fix DCC decompress on GFX11
|
---|
4820 | - radv: stop allocationg the attr ring BO for compute queues on GFX11
|
---|
4821 | - ci: uprev CTS to 1.3.5.0
|
---|
4822 | - radv/ci: adjust timeouts for Vega10 and Renoir
|
---|
4823 | - radv/ci: stop skipping some graphics pipeline library tests
|
---|
4824 | - radv/ci: update CI lists for CTS 1.3.5.0 on GFX110/POLARIS10/PITCAIRN
|
---|
4825 | - wsi: move an assertion in wsi_xxx_surface_get_capabilities2()
|
---|
4826 | - radv: do not add descriptor BOs on update when the global BO list is used
|
---|
4827 | - radv: fix incorrect stride for primitives generated query with GDS
|
---|
4828 | - radv: fix border color swizzle for stencil-only format on GFX9+
|
---|
4829 | - radv: fix defining RADV_USE_WSI_PLATFORM
|
---|
4830 | - radv: move disabling DCC for VRS rate images in radv_get_surface_flags()
|
---|
4831 | - ac/surface: add RADEON_SURF_VRS_RATE for selecting swizzle mode on GFX11
|
---|
4832 | - radv: add support for VRS attachment on GFX11
|
---|
4833 | - radv: do not emit PA_SC_VRS_OVERRIDE_CNTL from the pipeline on GFX11
|
---|
4834 | - radv: advertise attachmentFragmentShadingRate on GFX11
|
---|
4835 | - radv: enable VK_KHR_fragment_shading_rate on GFX11
|
---|
4836 | - radv: disable DCC with signedness reinterpretation on GFX11
|
---|
4837 | - radv: move instance related code to radv_instance.c
|
---|
4838 | - radv: move physical device related code to radv_physical_device.c
|
---|
4839 | - radv: move queue related code to radv_queue.c
|
---|
4840 | - radv: move sampler related code to radv_sampler.c
|
---|
4841 | - radv: move event related code to radv_event.c
|
---|
4842 | - radv: move buffer related code to radv_buffer.c
|
---|
4843 | - radv: move device memory related code to radv_device_memory.c
|
---|
4844 | - radv: zero-initialize radv_shader_args right before declaring them
|
---|
4845 | - radv: zero-initialize radv_shader_info earlier for graphics pipeline
|
---|
4846 | - radv: fix the error code when the driver fails to create a PS epilog
|
---|
4847 | - radv: determine if a graphics pipeline needs a noop FS earlier
|
---|
4848 | - radv: keep track of the retained NIR shaders sha1 for LTO pipelines
|
---|
4849 | - radv: allow to cache optimized (LTO) pipelines with GPL
|
---|
4850 | - radv: rename RADV_PIPELINE_LIBRARY to RADV_PIPELINE_RAY_TRACING_LIB
|
---|
4851 | - radv: add helpers for destroying various pipeline types
|
---|
4852 | - radv: fix NGG streamout with VS and GPL on GFX11
|
---|
4853 | - spirv: add SpvCapabilityFragmentFullyCoveredEXT
|
---|
4854 | - spirv,nir: add support for SpvBuiltInFullyCoveredEXT
|
---|
4855 | - radv: lower nir_intrinsic_load_fully_covered
|
---|
4856 | - radv: enable SAMPLE_COVERAGE_ENA if the fully covered built-in is used
|
---|
4857 | - radv: implement fullyCoveredFragmentShaderInputVariable
|
---|
4858 | - radv: enable fullyCoveredFragmentShaderInputVariable on GFX9+
|
---|
4859 | - radv: remove set but never used num_preserved_sgprs
|
---|
4860 | - radv: stop storing the binary as part of radv_shader_part
|
---|
4861 | - radv: store spi_shader_col_format to radv_shader_part_binary
|
---|
4862 | - radv: store the total radv_shader_part_binary size
|
---|
4863 | - radv: upload prologs/epilogs as part of radv_shader_part_create()
|
---|
4864 | - radv: allow to return the PS epilog binary to the pipeline
|
---|
4865 | - radv: make radv_shader_part_create() non-static
|
---|
4866 | - radv: add support for caching PS epilogs
|
---|
4867 | - radv: stop using radv_get_shader_shader() for task shaders
|
---|
4868 | - radv: replace radv_lookup_user_sgpr() by radv_get_user_sgpr()
|
---|
4869 | - radv: pass shader/base_reg to radv_emit_descriptor_pointers()
|
---|
4870 | - radv: pass shader/base_reg to radv_emit_inline_push_consts()
|
---|
4871 | - radv: pass shader/base_reg to radv_emit_userdata_address()
|
---|
4872 | - radv: use a separate compute path in radv_flush_constants()
|
---|
4873 | - radv: pass radv_shader to radv_dump_shader_stats()
|
---|
4874 | - radv: rework dumping shaders when a GPU hang is reported
|
---|
4875 | - radv: pass radv_ray_tracing_pipeline to radv_rt_pipeline_compile()
|
---|
4876 | - radv: pass radv_shader to radv_shader_need_indirect_descriptor_sets()
|
---|
4877 | - radv: implement VK_KHR_map_memory2
|
---|
4878 | - radv: use common GetBufferMemoryRequirements2()
|
---|
4879 | - radv: move cs_regalloc_hang_bug to radv_shader_info
|
---|
4880 | - radv: pass a radv_shader to radv_emit_dispatch_packets()
|
---|
4881 | - radv: remove radv_pipeline::device completely
|
---|
4882 | - radv: rework binding shaders to cmdbuf by introducing new helpers
|
---|
4883 | - radv: move radv_meta_* to a new folder
|
---|
4884 | - radv: copy the multisample state to radv_cmd_state
|
---|
4885 | - radv: move uses_user_sample_locations to radv_multisample_state
|
---|
4886 | - radv: separate the sample shading state between FS and graphics pipeline
|
---|
4887 | - radv: add DI_PT_RECTLIST to si_conv_prim_to_gs_out()
|
---|
4888 | - radv: stop checking dynamic states when emitting the guardband state
|
---|
4889 | - radv: rename gfx9_gs_info to radv_legacy_gs_info
|
---|
4890 | - radv: move {esgs,gsvs}_ring_size to radv_legacy_gs_info
|
---|
4891 | - radv/rt: bind the pipeline stack when it's not dynamic
|
---|
4892 | - radv/ci: update CI lists for Polaris10 and Pitcairn
|
---|
4893 | - radv: stop using get_vs_output_info() when emitting VS/NGG shaders
|
---|
4894 | - radv: emit the GS copy shader outside of radv_pipeline_emit_hw_gs()
|
---|
4895 | - radv: add radv_get_last_vgt_shader() helper
|
---|
4896 | - radv: stop using the pipeline for emitting PS inputs
|
---|
4897 | - radv: use the shader info stage to simplify emitting NGG shaders
|
---|
4898 | - radv: use the ES type to apply a workaround for NGG on GFX10
|
---|
4899 | - radv: pass the ES shader to radv_pipeline_emit_hw_ngg()
|
---|
4900 | - radv: stop using the pipeline for emitting shaders
|
---|
4901 | - radv: pass shader/base_reg to radv_emit_view_index_per_stage
|
---|
4902 | - radv: pass a shaders array to radv_get_shader()
|
---|
4903 | - radv: add radv_bind_shader() helper
|
---|
4904 | - radv: add an assertion about shader stage to radv_bind_pre_rast_shader()
|
---|
4905 | - radv: keep track of active stages as part of the cmdbuf state
|
---|
4906 | - radv: determine the last VGT shader at pipeline bind time
|
---|
4907 | - radv: stop using last_vgt_api_stage_{locs} during cmdbuf recording
|
---|
4908 | - radv: move dirtying flags for mesh shading to radv_bind_pre_rast_shader()
|
---|
4909 | - radv: copy bound shaders to the cmdbuf state
|
---|
4910 | - radv: determine and store the next graphics stage to radv_shader_info
|
---|
4911 | - radv: move user_data_0 to the shader info pass
|
---|
4912 | - radv: replace pipeline->is_ngg occurrences during cmdbuf recording
|
---|
4913 | - radv: replace pipeline->force_vrs_per_vertex during cmdbuf recording
|
---|
4914 | - radv: use serialized NIR for graphics libs with the RETAIN flag
|
---|
4915 | - radv: remove radv_graphics_pipeline::use_per_attribute_vb_descs
|
---|
4916 | - radv: remove radv_graphics_pipeline::last_vertex_attrib_bit
|
---|
4917 | - radv: remove radv_graphics_pipeline::next_vertex_stage
|
---|
4918 | - radv: remove radv_graphics_pipeline::can_use_simple_input
|
---|
4919 | - aco: remove unused aco_shader_info::vb_desc_usage_mask
|
---|
4920 | - radv: adjust vb_desc_usage_mask for dynamic VS inputs in the info pass
|
---|
4921 | - radv: remove radv_graphics_pipeline::vb_desc_usage_mask
|
---|
4922 | - radv: remove radv_graphics_pipeline::vb_desc_alloc_size
|
---|
4923 | - radv: rework emitting inner coverage when a fragment shader is bound
|
---|
4924 | - radv: copy custom blend mode to the cmdbuf state
|
---|
4925 | - radv: add a helper that returns the current rasterized primitive
|
---|
4926 | - radv: copy rast_prim to the cmdbuf state
|
---|
4927 | - radv: copy uses_{drawid,baseinstance} to the cmdbuf state
|
---|
4928 | - radv: copy ia_multi_vgt_param to the cmdbuf state
|
---|
4929 | - radv: add a helper to convert a VkPipelineBindPoint
|
---|
4930 | - radv: copy need_indirect_descriptor_sets to radv_cmd_state
|
---|
4931 | - radv: add push constant state to the cmdbuf state
|
---|
4932 | - radv: fix sample shading when a new fragment shader is bound
|
---|
4933 | - vulkan: add dynamic support for rectangles enable/mode
|
---|
4934 | - radv: add dynamic support for rectangles enable/mode
|
---|
4935 | - vulkan: Update XML and headers to 1.3.246
|
---|
4936 | - radv: copy db_render_control to the cmdbuf state
|
---|
4937 | - radv: set PS_ITER_SAMPLE(1) for sample shading during cmdbuf recording
|
---|
4938 | - radv: configure PA_SC_MODE_CNTL_1 during cmdbuf recording
|
---|
4939 | - radv: add the raygen shader BO to the cmdbuf list
|
---|
4940 | - radv: fix binding raytracing/compute pipelines
|
---|
4941 | - zink/ci: remove primitive-id-no-gs-quads from the NAVI10 fail list
|
---|
4942 | - radv/ci: add one more flake
|
---|
4943 | - radv: only copy non-NULL shaders when loaded from the cache
|
---|
4944 | - radv: rely on non-NULL binaries when inserting shaders to the cache
|
---|
4945 | - radv: allow to create/insert PS epilogs from/to the cache for libs
|
---|
4946 | - radv: remove dead code in radv_pipeline_get_nir()
|
---|
4947 | - radv: add VkGraphicsPipelineLibraryFlag to the graphics pipeline key
|
---|
4948 | - radv: ensure to retain NIR shaders for GPL libs found in the cache
|
---|
4949 | - radv: enable shaders cache for libraries with GPL
|
---|
4950 | - radv: fix VS prologs with GPL and static binding stride
|
---|
4951 | - radv: emit the PS epilog after the graphics pipeline
|
---|
4952 | - radv: add a helper for retaining NIR shaders
|
---|
4953 | - radv: move the serialized NIR to radv_graphics_lib_pipeline
|
---|
4954 | - radv: simplify a check when retaining NIR shaders
|
---|
4955 | - radv: do not retain noop FS for libs when a cache hit happened
|
---|
4956 | - radv: import retained NIR shaders later in the compilation process
|
---|
4957 | - radv/rt: stop storing unused hashes/identifiers
|
---|
4958 | - radv: create a helper for copying VkPipelineShaderStageCreateInfo
|
---|
4959 | - radv: copy stages instead of serializing NIR for GPL with the RETAIN flag
|
---|
4960 | - radv: enable VK_EXT_graphics_pipeline_library by default
|
---|
4961 | - radv/ci: update expected failures for PITCAIRN
|
---|
4962 | - radv/ci: remove no longer existing tests for PITCAIRN
|
---|
4963 | - radv/ci: update expected failures with BONAIRE
|
---|
4964 | - docs: add more release notes for RADV
|
---|
4965 | - radv: fix re-emitting vertex user SGPRs when binding a graphics pipeline
|
---|
4966 | - radv/ci: remove one RT test from the expected failures on RDNA3
|
---|
4967 | - radv: split radv_pipeline.c into radv_pipeline_{compute,graphics}.c
|
---|
4968 | - radv: fix pipeline creation feedback with imported graphics libs
|
---|
4969 | - radv: cleanup after splitting radv_pipeline.c
|
---|
4970 | - radv: fix detecting FMASK_DECOMPRESS/DCC_DECOMPRESS meta pipelines
|
---|
4971 | - vulkan: ignore rasterizationSamples when the state is dynamic
|
---|
4972 | - radv: try to keep HTILE compressed for READ_ONLY_OPTIMAL layout
|
---|
4973 | - radv: re-emit the guardband state when related PSO are bound
|
---|
4974 | - radv: disable fast-clears with CMASK for 128-bit formats
|
---|
4975 | - radv: do not allow 1D block-compressed images with (extended) storage on GFX6
|
---|
4976 | - radv: fix usage flag for 3D compressed 128 bpp images on GFX9
|
---|
4977 | - radv: update binning settings to work around GPU hangs
|
---|
4978 | - radv/amdgpu: fix adding continue preambles and postambles BOs to the list
|
---|
4979 | - radv: wait for occlusion queries in the resolve query shader
|
---|
4980 | - radv: delay enabling/disabling occlusion queries at draw time
|
---|
4981 | - radv: track DB_COUNT_CONTROL changes to avoid context rolls
|
---|
4982 | - radv: add the perf counters BO to the preambles BO list
|
---|
4983 | - radv: only enable extendedDynamicState3ConservativeRasterizationMode on GFX9+
|
---|
4984 | - ac/nir: fix 8-bit/10-bit PS exports clamping
|
---|
4985 | - radv: fix dynamic depth clamp enable support
|
---|
4986 | - radv: fix fast-clearing images with VK_REMAINING_{ARRAY_LAYERS,MIP_LEVELS}
|
---|
4987 | - radv: disable RB+ blend optimizations on GFX11 when a2c is enabled
|
---|
4988 |
|
---|
4989 | Sarah Walker (1):
|
---|
4990 |
|
---|
4991 | - pvr: Update FWIF transfer queue register structures
|
---|
4992 |
|
---|
4993 | Sathishkumar S (8):
|
---|
4994 |
|
---|
4995 | - radeonsi/vcn: add register definitions for JPEG 4.0.3
|
---|
4996 | - radeonsi/vcn: use register versions for jpeg
|
---|
4997 | - radeonsi/vcn: add support for picture crop on JPEG 4.0.3
|
---|
4998 | - radeonsi/vcn: support ARGB/RGBA conversion on JPEG 4.0.3
|
---|
4999 | - radeonsi/vcn: set jpeg reg version for gfx940
|
---|
5000 | - radeonsi/vcn: reset to default value when ROI/FC is not used
|
---|
5001 | - frontends/va: support crop region in jpeg decode
|
---|
5002 | - radeonsi/vcn: enable RGBA/ARGB formats on gfx940 jpeg
|
---|
5003 |
|
---|
5004 | Sebastian Wick (1):
|
---|
5005 |
|
---|
5006 | - loader: do not check the mesa DRI_Mesa version if it was not found
|
---|
5007 |
|
---|
5008 | Sergi Blanch Torne (8):
|
---|
5009 |
|
---|
5010 | - ci: disable Collabora's LAVA lab for maintance
|
---|
5011 | - Revert "ci: Collabora's LAVA lab for maintance"
|
---|
5012 | - ci: Uprev kernel to 6.1.7
|
---|
5013 | - ci: disable Collabora's LAVA lab for maintance
|
---|
5014 | - ci: disable Collabora's LAVA lab for maintance
|
---|
5015 | - ci: include setup test environment script in the output artifacts
|
---|
5016 | - Revert "ci: disable Collabora's LAVA lab for maintance"
|
---|
5017 | - ci: disable Collabora's LAVA lab for maintance
|
---|
5018 |
|
---|
5019 | Sidney Just (4):
|
---|
5020 |
|
---|
5021 | - zink: Fix non debug builds failing to compile on
|
---|
5022 | - loader: Add missing brace to fix compile
|
---|
5023 | - zink: add check for samplerMirrorClampToEdge Vulkan 1.2 feature
|
---|
5024 | - zink: Add missing features to the profile file
|
---|
5025 |
|
---|
5026 | Sil Vilerino (15):
|
---|
5027 |
|
---|
5028 | - d3d12: Honor suggested driver profile/level for H264/HEVC encode
|
---|
5029 | - d3d12: Video processing - Fix out of bounds array access
|
---|
5030 | - d3d12: Video Encode - Fix ID3D12CommandAllocator leak
|
---|
5031 | - d3d12: Fix VP9 Decode - Checking 0xFF instead of 0x7F for invalid frame_ref[i].Index7Bits
|
---|
5032 | - frontend/va: Add format support checks for VA_RT_FORMAT_* in VaCreateConfig/VaGetConfigAttributes
|
---|
5033 | - frontend/va: Remove duplicate code in format support checking/reporting.
|
---|
5034 | - frontend/va: Keep track of some VP9 previous frame data for current frame use_prev_in_find_mvs_refs
|
---|
5035 | - d3d12: VP9 Decode - Fix use_prev_in_find_mvs_refs calculation
|
---|
5036 | - d3d12: Fix video decode for interlaced streams with reference only textures required
|
---|
5037 | - d3d12: H264/HEVC Encode - Set both VBV InitialCapacity/Size in CBR Rate Control to same value when requested
|
---|
5038 | - d3d12: Encode H264/HEVC - Do not write PPS unless different from active
|
---|
5039 | - d3d12: Encode - Only upload headers when written headers size is > 0
|
---|
5040 | - nir: Fix use of alloca() without #include c99_alloca.h
|
---|
5041 | - Revert "d3d12: Honor suggested driver profile/level for H264/HEVC encode"
|
---|
5042 | - d3d12: Video processor to only promote resources to permanent residency when there is work to be flushed
|
---|
5043 |
|
---|
5044 | Simon Fels (2):
|
---|
5045 |
|
---|
5046 | - venus: allow vtest socket being specified by env variable
|
---|
5047 | - virgl/vtest: allow socket being specified by env variable
|
---|
5048 |
|
---|
5049 | Simon Perretta (38):
|
---|
5050 |
|
---|
5051 | - pvr: Add new Rogue compiler framework
|
---|
5052 | - pvr: Add support for optional instruction params
|
---|
5053 | - pvr: Support dual-destination ALU instructions
|
---|
5054 | - pvr: Commonise some instruction member defs
|
---|
5055 | - pvr: Drop the ENUM_PACKED macro
|
---|
5056 | - pvr: Keep NIR SSA defs instead of registers
|
---|
5057 | - pvr: Adjust instruction repeat offset
|
---|
5058 | - pvr: Validate instruction repeat and src/dst sizes
|
---|
5059 | - pvr: Add block printing support during validation
|
---|
5060 | - pvr: Clarify unreachable text
|
---|
5061 | - pvr: Add ADD64 support
|
---|
5062 | - pvr: Add memory load support
|
---|
5063 | - pvr: Add bitwise instruction support
|
---|
5064 | - pvr: Additional register subarray support
|
---|
5065 | - pvr: Support loading immediate values
|
---|
5066 | - pvr: Load descriptors from memory
|
---|
5067 | - pvr: Split pvr_private.h
|
---|
5068 | - pvr: Use descriptor/set/table offsets from driver
|
---|
5069 | - pvr: Add NIR pass to lower vars to SSA
|
---|
5070 | - pvr: Amend subarray ownership code
|
---|
5071 | - pvr: Add support for fitr.pixel
|
---|
5072 | - pvr: Add support for sample instructions
|
---|
5073 | - pvr: Add support for validating modifier combos
|
---|
5074 | - pvr: Add support for emitpix
|
---|
5075 | - pvr: Add support for WOP
|
---|
5076 | - pvr: Register allocation improvements
|
---|
5077 | - pvr: Fix descriptor set address calculation
|
---|
5078 | - pvr: Add support for generating per-job EOT program
|
---|
5079 | - pvr: Add support for generating NOP program
|
---|
5080 | - pvr: Add support for IDF
|
---|
5081 | - pvr: Add support for ST
|
---|
5082 | - pvr: Add branch support
|
---|
5083 | - pvr: Add support for TST
|
---|
5084 | - pvr: Add basic support for manual instruction grouping
|
---|
5085 | - pvr: Add support for MOVC
|
---|
5086 | - pvr: Add late op lowering pass and conditional execution
|
---|
5087 | - pvr: Amend definitions for ST and IDF
|
---|
5088 | - pvr: Add encodings for index registers
|
---|
5089 |
|
---|
5090 | Simon Ser (1):
|
---|
5091 |
|
---|
5092 | - egl: fix fd_display_gpu on surfaceless and device platforms
|
---|
5093 |
|
---|
5094 | Sonny Jiang (5):
|
---|
5095 |
|
---|
5096 | - radeonsi: Add NV12 support for AV1
|
---|
5097 | - gallium/pipe: change PIPE_DEFAULT_DECODER_FEEDBACK_TIMEOUT_NS to 1 second
|
---|
5098 | - amd/common: Add gfx940 codec query support
|
---|
5099 | - radeonsi/vcn: Add video capabilities support for gfx940
|
---|
5100 | - radeonsi/vcn: Add decode support for gfx940
|
---|
5101 |
|
---|
5102 | SoroushIMG (31):
|
---|
5103 |
|
---|
5104 | - zink: add pass checking for lod overflow in txf
|
---|
5105 | - zink: add zink_cs_key
|
---|
5106 | - zink: add VK_EXT_image_robustness
|
---|
5107 | - zink: add robust_access field to shader key
|
---|
5108 | - zink: lower LOD-invalid txf when imageRobustAccess2 is missing
|
---|
5109 | - zink: update gl43 profile to allow imageRobustAccess
|
---|
5110 | - zink: fix sparse residency query and minLOD feature checks
|
---|
5111 | - zink: fix cap check for arb sparse texture2
|
---|
5112 | - zink: only save frag const buffers when used by blit
|
---|
5113 | - zink: fix leak when rebinding same image surface
|
---|
5114 | - zink: clear null image surfaces to 0
|
---|
5115 | - zink: fix pointcoord y inversion
|
---|
5116 | - zink: relax bresenhamLines requirement for non-strictLine drivers
|
---|
5117 | - zink: fix compute shader leaks
|
---|
5118 | - zink: allocate program shader caches from the program's mem ctx
|
---|
5119 | - zink: stop creating pipeline library cache for non-optimal_key drivers
|
---|
5120 | - zink: free resource objects' views array during destruction
|
---|
5121 | - zink: fix stale point sprite mode state
|
---|
5122 | - zink: fix shadow mask change logic when binding sampler views
|
---|
5123 | - zink: track shadow swizzle for all shader stages
|
---|
5124 | - zink: minor formatting change
|
---|
5125 | - zink: add needs_zs_shader_swizzle shader key
|
---|
5126 | - zink: extend shadow swizzle pass to all zs textures
|
---|
5127 | - zink: add depth/stencil needs shader swizzle workaround field
|
---|
5128 | - zink: workaround undefined swizzle 1 for z/s textures
|
---|
5129 | - zink: rename shadow key to zs swizzle
|
---|
5130 | - zink: Add driver name and API version to renderer name
|
---|
5131 | - zink: do not emit line stipple dynamic state when emulating
|
---|
5132 | - zink: take location_frac into account in lower_line_smooth_gs
|
---|
5133 | - zink: fix incorrect line mode check for bresenham
|
---|
5134 | - zink: refcount the correct query pool
|
---|
5135 |
|
---|
5136 | Sui Jingfeng (1):
|
---|
5137 |
|
---|
5138 | - meson: add basic support for loongarch
|
---|
5139 |
|
---|
5140 | SureshGuttula (1):
|
---|
5141 |
|
---|
5142 | - radeonsi: Add support for DPB resize
|
---|
5143 |
|
---|
5144 | Sviatoslav Peleshko (9):
|
---|
5145 |
|
---|
5146 | - anv: Handle VkAccelerationStructureBuildRangeInfoKHR::transformOffset
|
---|
5147 | - driconf/anv: Apply limit_trig_input_range WA to Rise of the Tomb Raider
|
---|
5148 | - iris: Avoid creating uncompressed view with unaligned tile offsets on BDW
|
---|
5149 | - anv: Handle all fields in VkAccelerationStructureBuildRangeInfoKHR
|
---|
5150 | - anv: Move WA MEDIA_VFE_STATE after stalling PIPE_CONTROL
|
---|
5151 | - glsl: Fix codegen for constant ir_binop_{l,r}shift with mixed types
|
---|
5152 | - isl: Check all channels in isl_formats_have_same_bits_per_channel
|
---|
5153 | - anv: Handle UNDEFINED format in image format list
|
---|
5154 | - anv: Improve image/view usage bits verification
|
---|
5155 |
|
---|
5156 | Tapani Pälli (42):
|
---|
5157 |
|
---|
5158 | - intel/compiler: add cpp_std=c++17 when building tests
|
---|
5159 | - intel/hasvk: remove some stale comments, wa was removed
|
---|
5160 | - anv: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable
|
---|
5161 | - hasvk: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable
|
---|
5162 | - iris: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable
|
---|
5163 | - mesa: move component bits queries as GL ES only
|
---|
5164 | - intel/genxml: set unused 3DSTATE_PS_EXTRA field as mbz
|
---|
5165 | - intel: enable existing workaround for ICL platform
|
---|
5166 | - intel/blorp: disable REP16 for gfx12+ with R10G10B10_FLOAT_A2
|
---|
5167 | - iris: disable preemption for 3DPRIMITIVE during streamout
|
---|
5168 | - iris: handle error in iris_resource_from_handle
|
---|
5169 | - spirv: add workaround for Metro Exodus in spirv_to_nir
|
---|
5170 | - radv: revert Metro Exodus workaround which was moved to common code
|
---|
5171 | - mesa/st: refactor st_destroy_texcompress_compute condition
|
---|
5172 | - mesa/st: add astc decoder lookup tables
|
---|
5173 | - mesa/st: initialize resources for ASTC decoding
|
---|
5174 | - mesa: add astc decoder shader template (glsl es version)
|
---|
5175 | - mesa/st: support compute shader decoding of ASTC
|
---|
5176 | - anv: Wa_14016407139, add required pc when SBA programmed
|
---|
5177 | - iris: implement emission of 3DSTATE_HS for Wa_1306463417
|
---|
5178 | - anv: emit 3DSTATE_HS in cmd_buffer_flush_gfx_state
|
---|
5179 | - anv: limit generated draws to pipelines without HS stage
|
---|
5180 | - anv: implement emission of 3DSTATE_HS for Wa_1306463417
|
---|
5181 | - iris: emit 3DSTATE_HS for each primitive on gfx12
|
---|
5182 | - anv: emit 3DSTATE_HS for each primitive on gfx12
|
---|
5183 | - intel/compiler: add comment about workaround on simd width
|
---|
5184 | - anv: fix sends_count_expectation assert on simd32
|
---|
5185 | - intel/isl: disable TILE64 for YCRCB formats
|
---|
5186 | - anv: implement occlusion query related Wa_14017076903
|
---|
5187 | - iris: implement occlusion query related Wa_14017076903
|
---|
5188 | - intel/fs: restore message layout changes for cube array
|
---|
5189 | - anv: use primitive ID override when shader does not supply it
|
---|
5190 | - anv: take primitive ID override to account Wa_14015297576
|
---|
5191 | - anv: check for MESA_SHADER_TESS_CTRL with get_tcs_prog_data
|
---|
5192 | - intel/common: limit the amount of SLM with Wa_14017341140
|
---|
5193 | - intel/fs: use intel_needs_workaround for Wa_22013689345
|
---|
5194 | - intel/compiler: use intel_needs_workaround for Wa_14012437816
|
---|
5195 | - isl: disable mcs (and mcs+ccs) for color msaa on gfxver 125
|
---|
5196 | - iris: implement state cache invalidate for Wa_16013063087
|
---|
5197 | - anv: cleanup bitmask construction for PIPELINE_SELECT
|
---|
5198 | - anv: implement state cache invalidate for Wa_16013063087
|
---|
5199 | - isl: fix layout for comparing surf and view properties
|
---|
5200 |
|
---|
5201 | Tatsuyuki Ishi (22):
|
---|
5202 |
|
---|
5203 | - radv: Fix depth-only-with-discard when epilogs are used.
|
---|
5204 | - radv: Fix emitting tess indirect descriptors twice.
|
---|
5205 | - radv: Loop over shader stages in flush_indirect_descriptor_sets.
|
---|
5206 | - radv: Fix noop FS not getting constructed for GPL pipelines.
|
---|
5207 | - radv: Fix missing rbplus_allowed check for dynamic PS epilogs.
|
---|
5208 | - radv: Assert the hardware support rbplus when emitting rbplus state.
|
---|
5209 | - radv: Keep shader code ptr in a separately allocated buffer.
|
---|
5210 | - radv/sqtt: Use code buffer from radv_shader directly instead of copying.
|
---|
5211 | - radv: Replace radv_trap_handler_shader with radv_shader.
|
---|
5212 | - radeonsi: SDMA v4 size field is size - 1
|
---|
5213 | - radv: SDMA v4 size field is size - 1
|
---|
5214 | - radv: Remove SDMA padding from copy helpers.
|
---|
5215 | - radv: Use common helpers to translate format in SDMA copy.
|
---|
5216 | - radv/rt: Don't upload the prolog twice.
|
---|
5217 | - radv: Use radeon_cmdbuf for sdma_copy_image.
|
---|
5218 | - radv: Introduce sdma_copy_buffer for GFX7+.
|
---|
5219 | - radv: Upload shaders to invisible VRAM on small BAR systems.
|
---|
5220 | - radv: Wait for shader uploads asynchronously.
|
---|
5221 | - radv: Fix missing wait of GS copy shader upload for dmashaders.
|
---|
5222 | - amd: Add radv_foreach_stage to ForEachMacros.
|
---|
5223 | - radv: Pre-compute descriptor set layout hash.
|
---|
5224 | - ci/android: Make armv8's arch aarch64 instead of arm.
|
---|
5225 |
|
---|
5226 | Teng, Jin Chung (2):
|
---|
5227 |
|
---|
5228 | - frontend/va: Add large_scale_tile from VADecPictureParameterBufferAV1
|
---|
5229 | - d3d12: AV1 Dec - Set anchor_frame_idx only when large_scale_tile equals 1
|
---|
5230 |
|
---|
5231 | Thomas H.P. Andersen (4):
|
---|
5232 |
|
---|
5233 | - docs/panvk: VK_KHR_descriptor_update_template
|
---|
5234 | - meson: use summary()
|
---|
5235 | - meson: use sections in summary()
|
---|
5236 | - v3dv: use common code for descriptor update template
|
---|
5237 |
|
---|
5238 | Thong Thai (6):
|
---|
5239 |
|
---|
5240 | - gallium/auxiliary/vl: clean-up progressive shader
|
---|
5241 | - radeonsi/vcn: use encoder/decoder caps reported by kernel
|
---|
5242 | - gallium/auxiliary/vl: add crop to compute shader
|
---|
5243 | - mesa/main: rework locale setup/teardown
|
---|
5244 | - util: check and initialize locale before using it
|
---|
5245 | - tgsi: use locale independent float and double parsing
|
---|
5246 |
|
---|
5247 | Timothy Arceri (15):
|
---|
5248 |
|
---|
5249 | - nir/nir_opt_copy_prop_vars: remove extra loop
|
---|
5250 | - nir/nir_opt_copy_prop_vars: avoid comparison explosion
|
---|
5251 | - nir/nir_opt_copy_prop_vars: reuse hash tables
|
---|
5252 | - nir/nir_opt_copy_prop_vars: reuse dynamic arrays
|
---|
5253 | - nir/nir_opt_copy_prop_vars: reorder clone calls
|
---|
5254 | - nir/nir_opt_copy_prop_vars: don't call memset when cloning
|
---|
5255 | - ci: enable dEQP-VK.ubo.random.all_shared_buffer.48
|
---|
5256 | - glsl: copy prop vars before scalarizing alus
|
---|
5257 | - glsl: add _token_list_prepend() helper to the parser
|
---|
5258 | - glsl: isolate object macro replacments
|
---|
5259 | - glsl: remove do_copy_propagation_elements() optimisation pass
|
---|
5260 | - glsl: allow 64-bit integer on RHS of shift
|
---|
5261 | - util/00-mesa-defaults: add Akka Arrh workaround
|
---|
5262 | - mesa: add _mesa_is_api_gles2() helper
|
---|
5263 | - glsl: move some GL ES checks to the NIR linker
|
---|
5264 |
|
---|
5265 | Timur Kristóf (155):
|
---|
5266 |
|
---|
5267 | - aco/optimizer: Add missing v_lshlrev condition to can_apply_extract.
|
---|
5268 | - aco/optimizer: Optimize p_extract + v_mul_u32_u24 to v_mad_u32_u16.
|
---|
5269 | - radv: Make NGG query emission a dirty flag.
|
---|
5270 | - radv: Get rid of app_shaders_internal.
|
---|
5271 | - radv, aco: Add uses_full_subgroups to compute shader info.
|
---|
5272 | - aco: Enable constant exec mask based optimization on compute shaders.
|
---|
5273 | - radv: Lower dynamic VS inputs in NIR.
|
---|
5274 | - aco: Remove dynamic VS input loads.
|
---|
5275 | - nir: Add pack_half_2x16_rtz_split opcode.
|
---|
5276 | - radv, aco, ac: Implement pack_half_2x16_rtz_split.
|
---|
5277 | - nir: Lower pack_half_2x16_split to RTZ if available.
|
---|
5278 | - nir: Add algebraic optimization for VKD3D-Proton fp32->fp16 conversion.
|
---|
5279 | - ac/gpu_info: Add has_pcie_bandwidth_info.
|
---|
5280 | - radv: Don't place CS in VRAM when bandwidth is too low.
|
---|
5281 | - nir/opt_algebraic: Add optimization for ieq/ine and right-shift.
|
---|
5282 | - radv: Disable NGG culling when conservative overestimation is used.
|
---|
5283 | - ac/nir/cull: Alway remove zero-area triangles in face culling.
|
---|
5284 | - ac/nir/ngg: Include culled primitives in query.
|
---|
5285 | - radv: Don't change LDS_SIZE for NGG culling shaders.
|
---|
5286 | - radv: Move checking primitive topology to radv_get_ngg_culling_settings.
|
---|
5287 | - radv: Use shader code to skip NGG culling in small workgroups.
|
---|
5288 | - radv: Remove NGG culling skip from command buffer.
|
---|
5289 | - radv: Refactor radv_emit_ngg_culling_state so it's based on dirty flags.
|
---|
5290 | - nir: Clarify comment above load_buffer_amd.
|
---|
5291 | - ac: Port ACO's get_fetch_format to ac_get_safe_fetch_size.
|
---|
5292 | - ac/llvm: Remove "structurized" argument and instead check vindex.
|
---|
5293 | - ac/llvm: Fix buffer_load_amd with larger than 32-bit channel sizes.
|
---|
5294 | - ac/llvm: Fix ac_build_buffer_load to work with more than 4 channels.
|
---|
5295 | - ac/llvm: Change ac_build_tbuffer_load to take format and channel type.
|
---|
5296 | - radv: Move VS input lowering to new file: radv_nir_lower_vs_inputs.
|
---|
5297 | - aco: Get rid of redundant load_vmem_mubuf function.
|
---|
5298 | - aco: Don't set scalar offset on buffer load instructions when it's zero.
|
---|
5299 | - aco: Remove MTBUF zero operand.
|
---|
5300 | - radv: Call nir_lower_array_deref_of_vec in radv_lower_io_to_scalar_early.
|
---|
5301 | - aco/optimizer: Change v_cmp with subgroup invocation to constant.
|
---|
5302 | - radv: Emulate VGT_ESGS_ITEMSIZE in shaders on GFX9+.
|
---|
5303 | - util: Add util_format_get_array.
|
---|
5304 | - ac: Add pending_vmem field to args.
|
---|
5305 | - radv: Set pending_vmem on dynamic VS input args.
|
---|
5306 | - aco: Generalize vs_inputs to args_pending_vmem.
|
---|
5307 | - aco, radv: Rename aco_*_key to aco_*_info.
|
---|
5308 | - aco, radv: Move PS epilog and VS prolog args to their info structs.
|
---|
5309 | - aco, radv: Don't use radv_shader_args in aco.
|
---|
5310 | - aco: Don't include headers from radv.
|
---|
5311 | - ac/nir: clear nir_var_shader_out from TCS barriers
|
---|
5312 | - aco: Remove vtx_binding from MUBUF/MTBUF instructions.
|
---|
5313 | - nir: Add load_typed_buffer_amd intrinsic.
|
---|
5314 | - aco: Implement load_typed_buffer_amd.
|
---|
5315 | - ac/llvm: Implement typed buffer load intrinsic.
|
---|
5316 | - radv: Lower non-dynamic VS inputs in NIR.
|
---|
5317 | - radv: Apply swizzle and alpha adjust in radv_nir_lower_vs_inputs.
|
---|
5318 | - aco: Remove VS inputs from visit_load_input.
|
---|
5319 | - aco: Rename visit_load_input to visit_load_fs_input.
|
---|
5320 | - radv: Remove VS inputs code from LLVM backend.
|
---|
5321 | - ac/llvm: Remove unused function ac_build_struct_tbuffer_load.
|
---|
5322 | - aco, radv: Remove VS IO information from ACO.
|
---|
5323 | - aco: Don't add soffset to swizzled MUBUF base.
|
---|
5324 | - aco: Use zero for MUBUF/MTBUF when soffset is undefined.
|
---|
5325 | - aco: Disable MUBUF/MTBUF offsets when they are zero.
|
---|
5326 | - aco: Always enable idxen for swizzled buffer access on GFX11.
|
---|
5327 | - ac/nir/ngg: Remove usused lds_es enum values.
|
---|
5328 | - ac/nir/ngg: Rename saved_uniform to reusable_nondeferred_variable.
|
---|
5329 | - ac/nir/ngg: Split some functions out of save_reusable_variables.
|
---|
5330 | - ac/nir/ngg: Move divergence analysis call to analyze_shader_before_culling.
|
---|
5331 | - ac/nir/ngg: Rename state variables to "s".
|
---|
5332 | - ac/nir/ngg: Remove some superfluous variables.
|
---|
5333 | - ac/nir/ngg: Create separate variable for repacked rel_patch_id.
|
---|
5334 | - ac/nir/ngg: Rename repacked variables to clarify their name.
|
---|
5335 | - ac: Add more defines for mesh shading packets.
|
---|
5336 | - radv: Use new mesh shading packet defines.
|
---|
5337 | - radv: Add per-prim attributes to ring_attr stride.
|
---|
5338 | - radv: Use per-prim params in has_param_exports.
|
---|
5339 | - radv: Add extra offset to per-prim params.
|
---|
5340 | - radv: Use PRIM_ATTR for PS inputs on GFX11.
|
---|
5341 | - radv: Include per-prim params in NUM_INTERP on GFX11.
|
---|
5342 | - radv: Adjust mesh draw packets for GFX11.
|
---|
5343 | - ac/nir/ngg: Clarify mesh shader scratch ring.
|
---|
5344 | - ac/nir/ngg: Use attribute ring for mesh shader params.
|
---|
5345 | - ac/nir/ngg: Split legacy workgroup index function.
|
---|
5346 | - ac/nir/ngg: Fix mesh shader layer on GFX11.
|
---|
5347 | - ac/nir/ngg: Store special MS outputs in attribute ring for PS to read.
|
---|
5348 | - radv: Enable mesh shading on GFX11.
|
---|
5349 | - radv: Fix swizzled VS input loads when some components are unused.
|
---|
5350 | - radv: Don't expose NV_mesh_shader and don't use it in CI.
|
---|
5351 | - radv: Remove NV_mesh_shader API entrypoints.
|
---|
5352 | - radv: Remove first_task and ib_addr/ib_stride.
|
---|
5353 | - radv: Clean up emitting zero mesh shader draw id.
|
---|
5354 | - ac/nir/ngg: Remove NV_mesh_shader support.
|
---|
5355 | - ac/nir: Remove ac_nir_apply_first_task_to_task_shader.
|
---|
5356 | - nir: Remove IB address and stride intrinsics.
|
---|
5357 | - radv: Move radv_nir_* to a new folder.
|
---|
5358 | - radv: Move radv_nir_lower_primitive_shading_rate to new file.
|
---|
5359 | - radv: Move radv_nir_lower_fs_intrinsics to new file.
|
---|
5360 | - radv: Move radv_nir_lower_intrinsics_early to new file.
|
---|
5361 | - radv: Move radv_nir_lower_view_index to new file.
|
---|
5362 | - radv: Move radv_nir_lower_viewport_to_zero to new file.
|
---|
5363 | - radv: Move radv_nir_export_multiview to new file.
|
---|
5364 | - radv, ac/nir: Move sin/cos lowering to a common pass.
|
---|
5365 | - radv: Move I/O lowering functions into a new file.
|
---|
5366 | - radv: Use radv_get_shader to get vertex shader when binding pipeline.
|
---|
5367 | - ac/nir/ngg: Slightly improve attribute ring offset calculation.
|
---|
5368 | - ac/nir: Store only lowest 8 bits for task draw ring DWORD3.
|
---|
5369 | - ac/nir: When task->mesh dispatch Y or Z are 0, also set X to 0.
|
---|
5370 | - aco: Consider p_cbranch_nz as divergent branch too.
|
---|
5371 | - aco: Don't remove exec writes that also write other registers.
|
---|
5372 | - aco: Simplify get_phi_operand using Operand::c32_or_c64.
|
---|
5373 | - aco: Don't verify branch exec read when eliminating exec writes.
|
---|
5374 | - aco: Pop branch operands when targets are same in SSA elimination.
|
---|
5375 | - aco: Call dominator_tree before lower_phis.
|
---|
5376 | - aco: Better phi lowering for merge block when else-side is const.
|
---|
5377 | - nir: Gather compile time constant task->mesh dispatch size.
|
---|
5378 | - radv: Use linear_dispatch info in GFX11 task/mesh draw packet.
|
---|
5379 | - radv/amdgpu: Extract CS chain and unchain functions.
|
---|
5380 | - radv/amdgpu: Expose CS chain and unchain on the winsys.
|
---|
5381 | - radv/amdgpu: Extract radv_amdgpu_add_cs_to_bo_list function.
|
---|
5382 | - radv/amdgpu: Remember which CS the current one is chained to.
|
---|
5383 | - radv/amdgpu: Walk chained CS objects for BO list.
|
---|
5384 | - radv/amdgpu: Unchain CS array in queue code not in winsys.
|
---|
5385 | - radv: Chain cmd buffers in queue code when possible, not in winsys.
|
---|
5386 | - radv/amdgpu: Remove can_patch and chained submit code path.
|
---|
5387 | - ac/llvm: Cover runtime 0 in GFX10 gs_alloc_req workaround.
|
---|
5388 | - aco: Fix optimization of v_cmp with subgroup invocation.
|
---|
5389 | - aco: Don't use nir_selection_control in aco_ir.
|
---|
5390 | - aco: Only include nir.h in instruction selection.
|
---|
5391 | - radv: Don't include nir.h in radv_shader.h
|
---|
5392 | - radv: Create continue preamble on GFX6 even when no shader rings are used.
|
---|
5393 | - ac: Add maximum number of submitted IBs.
|
---|
5394 | - radv/amdgpu: Fix mismatching return type of radv_amdgpu_cs_submit.
|
---|
5395 | - radv/amdgpu: Only allow IB BOs on graphics and compute queues.
|
---|
5396 | - radv/amdgpu: Use correct alignment when creating CS BOs.
|
---|
5397 | - radv/amdgpu: Extract radv_amdgpu_cs_add_old_ib_buffer.
|
---|
5398 | - radv/amdgpu: Add a few assertions during submit.
|
---|
5399 | - radv/amdgpu: Remove hw_can_chain in favour of use_ib.
|
---|
5400 | - radv/amdgpu: Rewrite fallback code path so it can split submissions.
|
---|
5401 | - radv/amdgpu: Allow multiple continue preambles.
|
---|
5402 | - radv/amdgpu: Add continue preambles to fallback submit.
|
---|
5403 | - radv/amdgpu: Add postambles to fallback submit.
|
---|
5404 | - radv/amdgpu: Add ability to submit non-chained CS to fallback.
|
---|
5405 | - radv/amdgpu: Split gang submissions correctly when not chained.
|
---|
5406 | - radv: Fill continue preambles and postambles properly.
|
---|
5407 | - radv: Split submission in winsys instead of radv_queue.
|
---|
5408 | - radv/amdgpu: Use fallback submit for queues that can't use IBs.
|
---|
5409 | - radv/amdgpu: Clean up submission functions.
|
---|
5410 | - radv/amdgpu: Respect maximum number of submitted IBs per IP type.
|
---|
5411 | - radv: Allow task/mesh shaders with RADV_DEBUG=noibs.
|
---|
5412 | - radv/amdgpu: Add bool is_secondary argument to cs_create function.
|
---|
5413 | - radv/amdgpu: Extract radv_amdgpu_cs_bo_create function.
|
---|
5414 | - radv/amdgpu: Place secondary CS without IB2 in non-WC GTT.
|
---|
5415 | - ac, aco, radv: Clarify LDS size on GFX6, and NGG shaders.
|
---|
5416 | - radv: Don't hardcode LDS granularity in gfx9_get_gs_info.
|
---|
5417 | - aco: Remove setup_*_variables and add setup_lds_size instead.
|
---|
5418 | - aco, radv: Remove "key" from aco_compiler_options.
|
---|
5419 | - aco, radv: Remove redundant enable_mrt_output_nan_fixup from PS epilog info.
|
---|
5420 | - ac/nir/ngg: Don't store primitive IDs from culled primitives.
|
---|
5421 | - aco: Disallow constant propagation on SOPP and fixed operands.
|
---|
5422 |
|
---|
5423 | Tomeu Vizoso (7):
|
---|
5424 |
|
---|
5425 | - android: Make libbacktrace optional again
|
---|
5426 | - android: Cleanup unneeded headers from the sync stub
|
---|
5427 | - ci: Build for Android with libbacktrace=false
|
---|
5428 | - ci: Use NDK 25b to build for the Android ABI level 33
|
---|
5429 | - etnaviv: handle missing alu conversion opcodes
|
---|
5430 | - etnaviv: print writemask of store operations
|
---|
5431 | - etnaviv: don't read too much from uniform arrays
|
---|
5432 |
|
---|
5433 | Turo Lamminen (4):
|
---|
5434 |
|
---|
5435 | - radv: Change radeon_cmdbuf counters to uint64_t to make alias analysis optimize radeon_emit better
|
---|
5436 | - radv: Clean up variables in si_get_ia_multi_vgt_param
|
---|
5437 | - radv: Avoid redundant fetch of radv_device
|
---|
5438 | - radv: Optimize emitting prefetches
|
---|
5439 |
|
---|
5440 | Val Packett (1):
|
---|
5441 |
|
---|
5442 | - mailmap: Remap name and email for Val Packett
|
---|
5443 |
|
---|
5444 | Vincent Davis Jr (1):
|
---|
5445 |
|
---|
5446 | - gbm/backend: fix gbm compile without dri
|
---|
5447 |
|
---|
5448 | Vinson Lee (2):
|
---|
5449 |
|
---|
5450 | - radv: Fix memory leak.
|
---|
5451 | - pps: Fix build errors.
|
---|
5452 |
|
---|
5453 | Vitaliy Triang3l Kuzmin (2):
|
---|
5454 |
|
---|
5455 | - radv: Set DB_Z_INFO.NUM_SAMPLES to MSAA_EXPOSED_SAMPLES without Z/S
|
---|
5456 | - r600: Alpha to coverage dithering on Evergreen+
|
---|
5457 |
|
---|
5458 | Väinö Mäkelä (12):
|
---|
5459 |
|
---|
5460 | - intel/vec4: Set the rounding mode
|
---|
5461 | - intel/vec4: Don't optimize multiply by 1.0 away
|
---|
5462 | - hasvk: Don't claim shaderDenormPreserveFloat32 on gfx7
|
---|
5463 | - hasvk: Tell spirv_to_nir float controls are always supported
|
---|
5464 | - hasvk: Enable PixelShaderKillsPixel when omask is used
|
---|
5465 | - hasvk: Mark VK_IMAGE_LAYOUT_ATTACHMENT_OPTIMAL as stencil write optimal
|
---|
5466 | - hasvk: Handle subpass self-dependencies for stencil shadow copies
|
---|
5467 | - ci/intel: Update hasvk HSW xfails
|
---|
5468 | - hasvk: Disable non-zero fast clears for 8xMSAA images
|
---|
5469 | - intel/ci: Remove outdated hasvk copy_and_blit xfails
|
---|
5470 | - intel/ci: Remove hasvk xfails missing from the CTS
|
---|
5471 | - intel/ci: Remove skipped float_controls tests from hasvk xfails
|
---|
5472 |
|
---|
5473 | X512 (3):
|
---|
5474 |
|
---|
5475 | - hgl: remove
|
---|
5476 | - haiku: fix build
|
---|
5477 | - EGL: implement Haiku driver over Gallium
|
---|
5478 |
|
---|
5479 | Yevhenii Kolesnikov (2):
|
---|
5480 |
|
---|
5481 | - nir/loop_analyze: Track induction variables incremented by more operations
|
---|
5482 | - nir/loop_analyze: Determine iteration counts for more kinds of loops
|
---|
5483 |
|
---|
5484 | Yiwei Zhang (46):
|
---|
5485 |
|
---|
5486 | - venus: log upon device creation
|
---|
5487 | - venus: lazily query and cache gralloc front rendering usage
|
---|
5488 | - venus: disable non AHB external memory bits on Android
|
---|
5489 | - venus: fix formating
|
---|
5490 | - venus: fix tracing init to include instance creation
|
---|
5491 | - venus: render server enforces blob_id_0
|
---|
5492 | - venus: move exp features init back to use ring submit
|
---|
5493 | - venus: further disallow sparse resource
|
---|
5494 | - venus: replace binary search with hardcode for max buffer size
|
---|
5495 | - venus: start requiring all experimental features
|
---|
5496 | - venus: clean up memoryResourceAllocationSize
|
---|
5497 | - venus: clean up globalFencing
|
---|
5498 | - venus: refactor sync fd fence and sempahore features
|
---|
5499 | - venus: tighten up the sync fd requirements for Android wsi
|
---|
5500 | - venus: distinguish external memory from mappable memory support
|
---|
5501 | - venus: fix external buffer creation
|
---|
5502 | - venus: remove redundant abstractions for wsi struct search
|
---|
5503 | - venus: refactor image create info pnext tracking
|
---|
5504 | - venus: simplify ahb image creation
|
---|
5505 | - venus: simplify support for non-AHB external images
|
---|
5506 | - venus: fix external image creation
|
---|
5507 | - venus: fix device memory export alloc info
|
---|
5508 | - venus: fix VK_EXT_image_view_min_lod feature query
|
---|
5509 | - venus: ensure invariance of buffer memory requirement size
|
---|
5510 | - venus: sync to latest protocol for ring status enum
|
---|
5511 | - venus: abort ring submit when ring is in fatal status
|
---|
5512 | - venus: propagate vn_ring to vn_relax
|
---|
5513 | - venus: vn_relax to abort on ring fatal status upon warn order
|
---|
5514 | - venus: revert back the warn order
|
---|
5515 | - venus: sync to latest protocol for asyncRoundtrip
|
---|
5516 | - venus: switch to use 64bit roundtrip seqno
|
---|
5517 | - venus: make vn_instance_wait_roundtrip asynchronous
|
---|
5518 | - venus: let vn_instance_submit_command track ring seqno
|
---|
5519 | - venus: make common wsi bo submission async
|
---|
5520 | - venus: refactor to add vn_sync_payload_external
|
---|
5521 | - venus: make external fence and semaphore export async
|
---|
5522 | - Revert "zink/kopper: Add extra swapchain images for Venus"
|
---|
5523 | - venus: sync latest protocol for layering extensions
|
---|
5524 | - venus: add VK_EXT_load_store_op_none support
|
---|
5525 | - venus: add VK_EXT_rasterization_order_attachment_access support
|
---|
5526 | - venus/docs: sync to latest venus supported extensions
|
---|
5527 | - venus: requires asyncRoundtrip
|
---|
5528 | - venus: requires ringMonitoring
|
---|
5529 | - venus: move exp feature init back to ring and remove unused function
|
---|
5530 | - venus: forward ARM driverVersion for ANGLE workarounds
|
---|
5531 | - radv: respect VK_QUERY_RESULT_WAIT_BIT in GetQueryPoolResults
|
---|
5532 |
|
---|
5533 | Yogesh Mohan Marimuthu (18):
|
---|
5534 |
|
---|
5535 | - egl: add render_gpu tag to dri2_dpy->fd and dri2_dpy->dri_screen variable
|
---|
5536 | - loader,glx: add render_gpu tag psc->driScreen and psc->fd
|
---|
5537 | - loader,glx,egl,vl,d3d: loader_get_user_preferred_fd() function to return original_fd
|
---|
5538 | - egl: remove is_different_gpu variable from struct dri2_egl_display
|
---|
5539 | - glx: remove is_different_gpu variable from struct dri_screen
|
---|
5540 | - loader,glx,egl: remove is_different_gpu variable from loader
|
---|
5541 | - ac,radeonsi: move shadow regs create ib preamble function to amd common
|
---|
5542 | - radv: add shadowregs variable to RADV_DEBUG environment variable
|
---|
5543 | - radv: add support for register shadowing
|
---|
5544 | - radv: set preemp flag and pre_ena bit for shadowregs
|
---|
5545 | - radv: INDEX_TYPE and NUM_INSTANCES PKT3 are not shadowed
|
---|
5546 | - radv: fence complete struct is 4 qw size
|
---|
5547 | - radv: allow NULL initial_preamble_cs in radv_amdgpu_winsys_cs_submit_sysmem()
|
---|
5548 | - radeonsi: remove some shadow reg optimization for bf1 game
|
---|
5549 | - wsi/display: check alloc failure in wsi_display_alloc_connector()
|
---|
5550 | - ac/surface: only adjust pitch if surf_pitch was modified
|
---|
5551 | - amd/surface: add RADEON_SURF_NO_TEXTURE flag
|
---|
5552 | - radv: set RADEON_SURF_NO_TEXTURE flag in radv_get_surface_flags()
|
---|
5553 |
|
---|
5554 | Yogesh Mohanmarimuthu (7):
|
---|
5555 |
|
---|
5556 | - egl: add fd_display_gpu to struct dri2_egl_display
|
---|
5557 | - egl,egl/x11: keep display fd open for prime
|
---|
5558 | - egl: create DRI screen for display GPU in case of prime
|
---|
5559 | - loader,glx,egl/x11: init dri_screen_display_gpu in struct loader_dri3_drawable
|
---|
5560 | - egl/wayland: keep display fd open for prime
|
---|
5561 | - loader: make image_format_to_fourcc() non-static
|
---|
5562 | - egl/wayland: for prime, allocate linear_copy from display GPU VRAM
|
---|
5563 |
|
---|
5564 | Yonggang Luo (8):
|
---|
5565 |
|
---|
5566 | - util: Implement util_iround with lrintf unconditionally
|
---|
5567 | - util: Fixes error: no previous prototype for 'mesa_cache_db_entry_remove' Fixes: c92c99481fd ("util/mesa-db: Support removal of cache entries")
|
---|
5568 | - vulkan: Use static_assert for check HWVULKAN_DISPATCH_MAGIC == ICD_LOADER_MAGIC
|
---|
5569 | - meson: Split c_cpp_args from pre_args
|
---|
5570 | - meson: Combine duplicated c_args and cpp_args
|
---|
5571 | - meson: When sse2 enabled, both c and cpp using sse2 options
|
---|
5572 | - meson: Split sse2_arg and sse2_args out of c_cpp_args
|
---|
5573 | - meson: Use sse2_arg and sse2_args to replace usage of c and c_sse2_args
|
---|
5574 |
|
---|
5575 | Yusuf Khan (2):
|
---|
5576 |
|
---|
5577 | - nvc0/nv50: support and enable EXT_memory_object*
|
---|
5578 | - gallium: create query_memory_info implementation for sw drivers
|
---|
5579 |
|
---|
5580 | Yuxuan Shui (1):
|
---|
5581 |
|
---|
5582 | - loader: unregister special event in loader_dri3_drawable_fini
|
---|
5583 |
|
---|
5584 | antonino (54):
|
---|
5585 |
|
---|
5586 | - zink: fix line smooth lowering
|
---|
5587 | - zink: add \`zink_emulate_point_smooth` driconf
|
---|
5588 | - zink: add \`lower_point_smooth` to \`zink_fs_key`
|
---|
5589 | - zink/nir_to_spirv: add support for \`nir_intrinsic_load_point_coord`
|
---|
5590 | - nir: handle output beeing written to deref in \`nir_lower_point_smooth`
|
---|
5591 | - zink: handle point_smooth emulation
|
---|
5592 | - drirc: set \`zink_emulate_point_smooth` for Quake II
|
---|
5593 | - zink: fix stipple pattern in oblique lines
|
---|
5594 | - zink: fix \`final_hash` update in \`zink_gfx_program_update`
|
---|
5595 | - mesa: correctly allocate space for converted primtives
|
---|
5596 | - gallium: decompose quad strips into quads if supported
|
---|
5597 | - zink: handle switching between primitives
|
---|
5598 | - nir: handle primitives with adjacency
|
---|
5599 | - nir: avoid generating conflicting output variables
|
---|
5600 | - nir: calculate number of vertices in nir_create_passthrough_gs
|
---|
5601 | - nir: handle edge flags in nir_create_passthrough_gs
|
---|
5602 | - zink: add \`has_edgeflags` flag to zink_shader and zink_gfx_program
|
---|
5603 | - zink: handle edgeflags
|
---|
5604 | - nir: allow to force line strip out in nir_create_passthrough_gs
|
---|
5605 | - zink: force line strip out when emulating stipple
|
---|
5606 | - zink: filled quad emulation gs generation function
|
---|
5607 | - zink: add \`zink_rast_prim` enum
|
---|
5608 | - zink: handle quads
|
---|
5609 | - zink: fix flat shading on filled quads
|
---|
5610 | - zink: add flags to \`zink_gfx_program` and \`zink_context`
|
---|
5611 | - zink: add \`needs_inlining` to \`zink_shader`
|
---|
5612 | - zink: implement flat shading using inlined uniforms
|
---|
5613 | - nir/zink: handle provoking vertex mode in \`nir_create_passthrough_gs`
|
---|
5614 | - zink: handle provoking vertex mode for filled quads
|
---|
5615 | - nir: keep xfb properties in nir_create_passthrough_gs
|
---|
5616 | - zink: keep xfb properties in quad emulation gs
|
---|
5617 | - zink: advertise support for the quad primitive
|
---|
5618 | - zink: prevent crash when freeing
|
---|
5619 | - zink: unified \`zink_set_primitive_emulation_keys` and \`zink_create_primitive_emulation_gs`
|
---|
5620 | - zink: zink: add \`parent` to \`zink_shader::non_fs`
|
---|
5621 | - zink: improve generated gs unbinding
|
---|
5622 | - zink: unbind generated gs in \`bind_last_vertex_stage`
|
---|
5623 | - zink/ci: remove \`primitive-id-no-gs-quads` from radv-vangogh-fails
|
---|
5624 | - nir: only handle flat interpolation when needed in \`nir_create_passthrough_gs`
|
---|
5625 | - zink: simplify logic to call \`zink_set_primitive_emulation_keys`
|
---|
5626 | - zink: add field to 'zink_gs_key' and enum
|
---|
5627 | - zink: add provoking vertex mode lowering
|
---|
5628 | - zink: always advertize provoking vertex mode support
|
---|
5629 | - zink: update requirements now that pv mode can be emulated
|
---|
5630 | - zink: add \`descriptor_bindless_id` to \`zink_shader_info`
|
---|
5631 | - zink: fix sampler array collision in \`nir_to_spirv`
|
---|
5632 | - zink: don't emulate edgeflags for patches
|
---|
5633 | - zink: use correct primitives for passthrough gs with tess
|
---|
5634 | - zink: fix pv mode lowring index calculation
|
---|
5635 | - zink: use ring buffer to preserve last element
|
---|
5636 | - zink: fix exit condition on pv emulation loop
|
---|
5637 | - zink: fix line strip offsets in pv mode emulation
|
---|
5638 | - zink: fix store subsitution in \`lower_pv_mode_gs_store`
|
---|
5639 | - zink: take location_frac into account in pv emulation
|
---|
5640 |
|
---|
5641 | driver1998 (1):
|
---|
5642 |
|
---|
5643 | - gallium: Use DETECT_OS_WINDOWS instead of 'WIN32'
|
---|
5644 |
|
---|
5645 | osy (1):
|
---|
5646 |
|
---|
5647 | - virgl: enable timer queries only if host supports it
|
---|
5648 |
|
---|
5649 | t0b3 (1):
|
---|
5650 |
|
---|
5651 | - nir/nir_opt_move: fix ALWAYS_INLINE compiler error
|
---|
5652 |
|
---|
5653 | volodymyr.o (1):
|
---|
5654 |
|
---|
5655 | - mesa ctx->API --> _mesa_is_foo(ctx)
|
---|
5656 |
|
---|
5657 | xurui (2):
|
---|
5658 |
|
---|
5659 | - panfrost: Check the return value of drmGetVersion
|
---|
5660 | - zink: bs->dd.push_pool[1].pool should be freed
|
---|