VirtualBox

source: vbox/trunk/src/VBox/Additions/3D/mesa/mesa-24.0.2/docs/relnotes/23.1.0.rst@ 105254

最後變更 在這個檔案從105254是 103996,由 vboxsync 提交於 12 月 前

Additions/3D/mesa: export mesa-24.0.2 to OSE. bugref:10606

檔案大小: 271.4 KB
 
1Mesa 23.1.0 Release Notes / 2023-05-10
2======================================
3
4Mesa 23.1.0 is a new development release. People who are concerned
5with stability and reliability should stick with a previous release or
6wait for Mesa 23.1.1.
7
8Mesa 23.1.0 implements the OpenGL 4.6 API, but the version reported by
9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
11Some drivers don't support all the features required in OpenGL 4.6. OpenGL
124.6 is **only** available if requested at context creation.
13Compatibility contexts may report a lower version depending on each driver.
14
15Mesa 23.1.0 implements the Vulkan 1.3 API, but the version reported by
16the apiVersion property of the VkPhysicalDeviceProperties struct
17depends on the particular driver being used.
18
19SHA256 checksum
20---------------
21
22::
23
24 a9dde3c76571c4806245a05bda1cceee347c3267127e9e549e4f4e225d92e992 mesa-23.1.0.tar.xz
25
26
27New features
28------------
29
30- VK_EXT_pipeline_library_group_handles on RADV
31- VK_EXT_image_sliced_view_of_3d on RADV/GFX10+
32- VK_KHR_map_memory2 on ANV and RADV
33- fullyCoveredFragmentShaderInputVariable on RADV/GFX9+
34- VK_EXT_discard_rectangles version 2 on RADV
35- VK_EXT_graphics_pipeline_library on RADV
36- extendedDynamicState3ColorBlendEquation on RADV
37- primitiveUnderestimation on RADV/GFX9+
38- VK_KHR_fragment_shading_rate on RADV/GFX11
39- VK_EXT_mesh_shader on RADV/GFX11
40- RGP support on RADV/GFX11
41- GL_NV_alpha_to_coverage_dither_control on r600/evergreen+
42
43
44Bug fixes
45---------
46
47- [radeonsi] flickering debug chunk border lines in Minecraft
48- radv, radeonsi: Rogue Legacy 2 alpha-to-coverage rendering issues
49- [r600, TURKS] R600: Unsupported instruction: vec1 32 ssa_1 = intrinsic image_samples (ssa_0) on spec@arb_shader_texture_image_samples@[email protected] (23.1.0-rc4)
50- vulkan/device_select: no way to select between GPUs of the same model due to bugs
51- Intel/anv: Modifier problems running gamescope embedded
52- radv: 7900 XTX hair flickering/rendering issues in VaM
53- radv: cache crashing
54- nouveau: Regression in arb_transform_feedback_overflow_query-basic from multithreading patches
55- radeonsi: vaapi: \`width >2880 && width % 64 != 0` results in wrong width in h265 stream
56- [regression] iris: unable to use driver as secondary GPU (primary AMD GPU)
57- iris: steam doesn't render on dg2
58- [llvm 16+] [microsoft-clc] opencl-c-base.h does not exist
59- Vulkancts clipping / tesselation tests trigger gpu hang on DG2
60- Swaped fields in picture in vlc and mythtv if hw accel is on
61- WGL: Assert assigns dwThreadId variable
62- nine regression with r600 (bisected)
63- [ACO] [RADV] Flickering squares in some areas in The Last of Us Part 1 (with workaround)
64- radv: Jedi Fallen Order flickering & blocky plants
65- nouveau: NV50 (NVAC) broken in latest master
66- rusticl failed to build with rust-bindgen 0.65.0
67- Regression, Bisected: glsl: Delete the lower_tess_level pass breaks r600 tesselation
68- vkcts-navi21-valve failing often with GCVM_L2_PROTECTION_FAULT_STATUS:0x00X00830
69- Deep Rock Galactic GPU freeze (AMD, DX11 DXVK Proton)
70- radv: Resident Evil 4 Chainsaw Demo GPU hang with Navi 24
71- radv: Gotham Knights GPU hang with Navi 24
72- aco: s_load_dword with negative soffset cause GPU hang
73- piglit.spec.ext_image_dma_buf_import.ext_image_dma_buf_import crash shutting down
74- overlay layer: unable to launch titles on steam
75- radv/zink: spec@ext_texture_integer@multisample-fast-clear gl_ext_texture_integer
76- VAAPI: Wrong H.264 playback on RX 6900 XT and RX 6700 XT (all Sienna?)
77- radv: possibly not setting state dirty bits correctly
78- RADV: VRS attachment not working in specific scenario
79- rusticl: invalid SPIR-V kernel causes panic
80- [RADV] The Last Of Us Part 1: artifacting in the menu (with workaround)
81- AMD va-api outputs corrupt encoding
82- !20673 regressed \`dEQP-VK.wsi.xlib.surface.query_formats`
83- aco: missing dependency on generated header
84- zink: spirv validation errors with spirv 1.6
85- freedreno/a6xx: Assertion \`view->rsc_seqno == rsc->seqno' failed.
86- iris regression in map stride after import with gen9 parts
87- anv: zink ADL failures
88- Vulkan loader \`vk_common_GetPhysicalDeviceFormatProperties` fails to sanitize properties bits.
89- Loading a model in PrusaSlicer 2.6.0-alpha5 crashes GNOME on radeonsi
90- [glx][bisected][regression]Intel HD 3000 failing to create context on applications like Unity
91- v3d: dEQP-GLES3.functional.texture.specification.teximage2d_pbo.rg32f_cube and similar fails when run together with other tests
92- standalone glsl compiler not getting built with mingw
93- !22191 broke test-docs-mr
94- mesa: index buffer leaking
95- RadeonSI: null dereference in amdgpu_cs_add_buffer, potential refcount mismatch, running BeyondAllReason
96- NIR can't unroll any loop from nine
97- Steel Division 2 - radv/gpu hang - bisected
98- Turnip: VKD3D can't run due to lack of memory property flag
99- eglCreateImageKHR, error: EGL_BAD_ALLOC (0x3003), message: "createImageFromDmaBufs failed" on AMD multi-gpu with explicit format modifiers
100- radv: In the game Quake II RTX appeared artifacts at fresh mesa builds
101- radv: Vampire: The Masquerade - Bloodline (Unofficial Patch) regression
102- radeonsi broken for gcn1 card
103- libgrl.a installed but not used?
104- radv: crash compiling UE5 lumen hardware RT shader
105- spec@ext_transform_feedback@builtin-varyings gl_culldistance fail
106- Panfrost T860 - broken system with latest mesa on gnome wayland jammy
107- aco: unused vtmp_in_loop
108- FTBFS: src/amd/llvm/ac_llvm_util.c:248:4: error: implicit declaration of function ‘LLVMAddIPSCCPPass’ (LLVM C interface removed upstream)
109- vulkan: new generated physical_device_feature missing meson dependency
110- Build broken on old-ish Python versions
111- radv: Support fullyCoveredFragmentShaderInputVariable from VK_EXT_conservative_rasterization on RDNA2+
112- radv,nir: dEQP-VK.ray_query.builtin.rayqueryterminate.* failures
113- RFE: Use _mesa_is_foo(ctx) helpers more
114- spec@ext_transform_feedback@builtin-varyings gl_culldistance fail
115- ci: infinite XDG_RUNTIME_DIR spam
116- ci: XDG_RUNTIME_DIR spam
117- [KBL] iris failures with dEQP-GLES3.functional.texture.compressed.astc.void_extent*
118- glsl compiled error when the RHS of operator \`>>` is int64_t by enabling GL_ARB_gpu_shader_int64 extension
119- turnip: inline uniforms regression
120- QPainter fails to render multiple shapes with a brush set since Mesa 23.0
121- eglSwapBuffers blocks in wayland when it's wl_surface_frame event is stolen.
122- plasmashell sometimes hangs with mesa_glthread
123- pps_device.h:23:11: error: ‘uint32_t’ does not name a type
124- Build fails with llvm 17: llvm/ADT/Triple.h: No such file or directory
125- nir: i2f32(i2i32(x@8)) isn't being collapsed to i2f32(x)
126- zink-lvp no longer running tests
127- radv: Immortals Fenyx Rising: Grass Flicker on R9 380X and Steam Deck
128- radv: A Plague Tale: Requiem black "flash" on 7900XTX
129- 7900 XTX: Graphical corruption / artifacts in Cyberpunk
130- radeonsi draws spurious values to depth buffer
131- Commit ccaaf8fe04c956d9f16f98b7f7fa69a2526283bc causes GPU ring timeouts on BONAIRE
132- radv: CmdCopyQueryPoolResults broken for VK_QUERY_TYPE_PRIMITIVES_GENERATED_EXT with queryCount > 1
133- rusticl over llvmpipe + ffmpeg's Opencl filter = error -51
134- rusticl over llvmpipe + ffmpeg's Opencl filter = error -51
135- ci: Remove LAVA Gitlab section handler workaround
136- System freeze when playing some h264 videos with VA-API on Rembrandt
137- OpenGL crashes in X-Plane 11
138- r600/TURKS: NIR Shader related errors on CLI with the game "A Hat in Time" and Gallium Nine
139- agx: shifts aren't sound
140- ci: build logs hidden
141- fatal error: intel/dev/intel_wa.h: No such file or directory
142- [Bisected] Regression: Project Zomboid renders black
143- hasvk: Black pixels with 8xMSAA and fast clears on Intel(R) HD Graphics 4400 (HSW GT2)
144- radv: GTA IV graphical artifacts on 7900XTX
145- radv: Resident Evil Revelations 2 artifacts on 7900XTX with DCC
146- radv: Prototype 2 black textures on RDNA 3 when DCC is enabled
147- Mesa 23.0.0 crashes immediately with indirect rendering
148- virpipe-on-gl: arb_enhanced_layouts\@matching_fp64_types crashes
149- [RADV] Returnal - pistol muzzle flash fills whole screen (graphical artifact)
150- ACO: dEQP-VK.binding_model.descriptor_buffer.multiple.graphics_geom_buffers1_sets3_imm_samplers hangs on NAVI10
151- Build failures with recent lld
152- r600,regression: Glitches on terrain with the NIR backend on Transport Fever 2
153- [radeonsi] Regression with MSAA fix for Unreal / Unreal Tournament 99
154- spirv: Switch Vulkan drivers to use \`deref_buffer_array_length`
155- r600/TURKS: Crash of the game "A Hat in Time" with Gallium Nine and NIR path (third report)
156- [gen9atom] Vulkan tests cause gpu hang: dEQP-VK.memory_model.*
157- GL_SHADER_BINARY_FORMAT_SPIR_V is not added to the list of GL_SHADER_BINARY_FORMATS even if GL_ARB_gl_spirv is supported.
158- mesa: "Fragmented" dynamic lights in IronWail with \`r_fsaamode 1` on
159- [ANV/DG2] Vertex explosion in nvpro-samples/vk_raytracing_tutorial_KHR/ray_tracing_gltf
160- CUEtools FLACCL hit assert in rusticl
161- Assertion Failed on Intel HD 5500 with Linux / Mesa 22.3.1 / OpenGL
162- Rise of the Tomb Raider's Ambient Occlusion pass misrenders (swimming shadows)
163- vk_enum_to_str: missing VkPipelineCreateFlags
164- [glsl] [spirv] ssbo unsizied array not supported ?
165- Creating a vulkan physical device on an AMD GPU causes following calls to drmModeAddFB to fail with ENOENT
166- Minecraft: spec related compile errors
167- mesa: _mesa_glthread_upload crash
168- glthread: OpenGL submission blocks while swapping buffers
169- glthread: Loading a shader cache in yuzu slows down with mesa_glthread=true
170- Commit ""radeonsi: enable glthread by default"" (d6fabe49cd72fb) causes a regression in gstreamer gtkglsink element
171- llvmpipe: linear rasterizer / depth bug
172- radv: (Using mesh shader) NIR validation failed after nir_lower_io_to_scalar_early
173- panfrost Mali-G31 glamor regression
174- allwinner a64: DRM_IOCTL_MODE_CREATE_DUMB failed: Cannot allocate memory after some time of apps usage
175- turnip: dEQP-VK.ubo.random.all_shared_buffer.48 slow
176- wine + dxvk + Rise of the Tomb Raider crashes in Soviet Installation 20% with VK_ERROR_DEVICE_LOST
177- Sometimes VLC player process gets stuck in memory after closure if video output used is Auto or OpenGL
178- ci: Remove LAVA Gitlab section handler workaround
179- kwin_wayland crashes involving dri2_create_drawable when Plasma starts and the llvmpipe driver from Mesa 23.0-rc3 and 23.0-rc4 is used
180- turnip: no ubwc fast clear for depth on a618
181- anv: VK_ACCESS_2_SHADER_READ_BIT doesn't seem to be handled correctly
182- Vulkan WSI flags leak into NIR, breaking build on BSDs
183- Iris corruptions in zoom calls
184- Sampling with aux enabled with ISL_AUX_STATE_PASS_THROUGH seems broken on Tigerlake+
185- anv: incorrect task shader payload
186- radv: Hi-Fi Rush incorrectly rendering face shadows with DCC on 7900 XTX
187- [iris] isl_calc_min_row_pitch seems incorrect on a750
188- DG2: incorrect rendering in Sascha Willems raytracing callable demo
189- turnip: conditional load/store hurts some workloads
190- Some blackouts / rendering issues with RADV_PERFTEST=gpl in Battlefield 1 (DX11)
191- radv/zink: ACO assert with DOOM2016
192- Registered special XGE not unregistered
193- draw_llvm.c:788:7: error: implicit declaration of function ‘LLVMContextSetOpaquePointers’
194- asahi: Optimize lower_resinfo for cube maps
195- Metro Exodus hits nir validation with a driver supporting raytracing.
196- ANV Gen 9.5 swapchain corruption when using newer \`VK_IMAGE_LAYOUT_ATTACHMENT_OPTIMAL` layout
197- turnip: optimal bin layout
198- piglit.spec.arb_shader_texture_image_samples.compiler.fs-image-samples_frag regression
199- lavapipe assert fails on Windows
200- zink: itoral-gl-terrain-demo rendering failure
201- asahi: Implement a shader disk cache
202- [ICL] Trine 4 trace causing GPU HANG
203- radv: Segfault during createAccelerationStructure when the backing buffer is not bound to memory
204- 7900 XTX stuck at 'compiling shaders' in Monster Hunter Rise
205- radv: slow GPL (fast) link times
206- libEGL warning: failed to get driver name for fd -1
207- iris: Context incorrectly marked as guilty
208- simple_mtx.h:34:12: fatal error: valgrind.h: No such file or directory
209- [ANV] Commit 4ceaed78 causes misrendering on Cyberpunk 2077
210- adding eglGetMscRateANGLE support for multiple monitors with different refresh rates
211- Performance regression in Chromium WebGL when implement ANGLE_sync_control_rate with egl/x11
212- ci: Ensure that the Intel/Freedreno trace pipelines only show up in relevant MR's
213- anv: Performance issue with Vulkan on Wayland KWin
214- Incorrect format conversion on big endian
215- radv: State of Decay 2 character rendering regression
216- aco_tests assembler.gfx11.vop12c_v128/gfx11 failure
217- r600,regression: Loading of DOOM stuck at 0% with the NIR backend
218- RADV: enabling TC-compat HTILE in GENERAL for compute queues is likely broken
219- Confidential issue #8065
220- VAAPI HEVC encode broken since 22.3
221- GPU HANG: ecode 12:1:859ffffb (Resetting rcs0 for stopped heartbeat on rcs0) - reproducible
222- zink: src/gallium/auxiliary/pipebuffer/pb_slab.c:138: Assertion failed: \`heap < slabs->num_heaps`
223- [zink] Assertion \`heap < slabs->num_heaps' failed on Pascal (bisected)
224- [RADV] Incorrect copies to/from compressed textures with mipmaps
225- mesa_glthread=true and probably ANY id Tech 3 engine games, offroad...
226- radeonsi: VRAM Leak/abnormally high usage in Minecraft mod pack
227- nir/lower_blend: Bogus assert
228- anv-tgl-vk: fails a multiple jobs after changing sharding
229- radv CTS crashes since ebec42d799b22b7b3d06acd710f5687252446a06
230- llvmpipe: dEQP-EGL programs.link failures.
231- libmesa_util depends on gallium
232- EGL report EGL_EXT_create_context_robustness with kms_dri drvier while can't create context with EGL_LOSE_CONTEXT_ON_RESET_EXT attribute.
233- v3d: missing drm format modifier support on Raspberry Pi 4 required for mpv
234- Return To Monkey Island black screen
235- Return To Monkey Island black screen
236- navi22 amdgpu: bo 000000002843d677 va 0x0800000400-0x08000005ff conflict with 0x0800000400-0x0800000600
237- Ryzen 6800H laptop amdgpu: bo 00000000b1eb583a va 0x0800000200-0x08000003ff conflict with 0x0800000200-0x0800000400
238- [RADV] [MISSED PERFORMANCE POTENTIAL] Vulkan not working when Color Depth is set to "16", but Vulkan works when Color Depth is set to "24"
239- v3dv: f2f16_rtz lowering could be improved
240- debug build compilation failed: inlining failed in call to ‘always_inline’ ‘src_is_ssa’: indirect function call with a yet undetermined callee
241- radv: regression: broken UI rendering in Elden Ring
242- radv: Missing implementation of VkImageSwapchainCreateInfoKHR and VkBindImageMemorySwapchainInfoKHR
243
244
245Changes
246-------
247
248Adam Jackson (22):
249
250- glx/dri3: Simplify protocol version tracking
251- glx: Remove glx_context::screen
252- glx: Remove a can't-happen NULL check
253- glx: Remove support for glXGetDriverConfig for old drivers
254- glx: Clean up some funny business from context bind/unbind
255- glx: Reflow MakeContextCurrent a little
256- glx: Check for initial "glX" first in glXGetProcAddress
257- glx: Move 1.2 GLXPixmap code into glx_pbuffer.c
258- glx: Inline a few single-use constant strings into their user
259- glx: Fix drawable type inference in visual/fbconfig setup
260- glx: Harmonize glXCreateGLXPixmap with glXCreatePixmap
261- mesa: Fix extension table formatting
262- mesa: Trivially advertise NV_generate_mipmap_sRGB
263- wsi/x11: Make get_sorted_vk_formats handle varying channel widths
264- wsi/x11: Infer the default surface format from the root window's visual
265- wsi/x11: Support depth 16 visuals
266- glx/dri: Use X/GLX error codes for our create_context_attribs
267- dri: Validate more of the context version in validate_context_version
268- glx/dri: Fix error generation for invalid GLX_RENDER_TYPE
269- glx: Disable the indirect fallback in CreateContextAttribs
270- glx: Fix error handling yet again in CreateContextAttribs
271- mesa: Enable NV_texture_barrier in GLES2+
272
273Adam Stylinski (2):
274
275- glx: fix a macro being invoked with the wrong parameter name
276- mesa: fix out of bounds stack access on big endian
277
278Alan Coopersmith (1):
279
280- util/disk_cache: Handle OS'es without d_type in struct dirent
281
282Alejandro Piñeiro (17):
283
284- vulkan/wsi: check if image info was already freed
285- v3dv/format: remove unused v3dv_get_tex_return_size
286- v3dv/pipeline: rename lower_tex_src_to_offset to lower_tex_src
287- v3dv: pass alignment to v3dv_buffer_init
288- v3dv/image: use 64-byte alingment for linear images if needed
289- v3dv: skip two ycbcr tests
290- broadcom/compiler: v3d_nir_lower_txf_ms doesn't need v3d_compile
291- broadcom/compiler: treat PIPE_FORMAT_NONE as 32-bit formats for output type
292- v3dv: enable shaderStorageImageReadWithoutFormat
293- broadcom/compiler: fix indentation at v3d_nir_lower_image_load_store
294- nir: track if var copies lowering was called
295- radv: use shader_info->var_copies_lowered
296- anv: use shader_info->var_copies_lowered
297- v3d/v3dv: use shader_info->var_copies_lowered
298- v3dv: handle ASPECT_MEMORY_PLANE aspect flags when getting plane number
299- v3dv/debug: add debug option to disable TFU codepaths
300- v3dv/pipeline: use pipeline depth bias enabled to fill up CFG packet
301
302Alexandros Frantzis (2):
303
304- egl/wayland: Fix destruction of event queue with proxies still attached.
305- vulkan/wsi/wayland: Fix destruction of event queue with proxies still attached.
306
307Alyssa Rosenzweig (351):
308
309- nir/peephole_select: Allow load_preamble
310- agx: Peephole select after opt_preamble
311- asahi: Handle sampler->compare_mode
312- panfrost: Don't use AFBC of sRGB luminance-alpha
313- pan/bi: Fix incorrect compilation of fsat(reg.yx)
314- pan/bi: Add a unit test for fsat(reg.yx)
315- panfrost: Enable NV_primitive_restart on Valhall
316- panfrost: Fix logic ops on Bifrost
317- panfrost: Stop testing CAP_INT16
318- panfrost: Remove PAN_MESA_DEBUG=deqp
319- panfrost: Remove unused debug parameter
320- panfrost: Fix clears with conditional rendering
321- panfrost: Document render_condition_check contract
322- nir: Add Midgard-specific fsin/fcos ops
323- nir: Optimize vendored sin/cos the same way
324- pan/mdg: Use special NIR ops for trig scaling
325- pan/mdg: Scalarize LUT instructions in NIR
326- pan/mdg: Remove MSGS debug
327- mesa: Set info.separate_shader for ARB programs
328- nir/lower_blend: Fix alpha=1 for RGBX format
329- nir/lower_blend: Clamp blend factors
330- nir/lower_blend: Fix SNORM logic ops
331- nir/lower_blend: Avoid useless iand with logic ops
332- nir/lower_blend: Don't do logic ops on pure float
333- nir/lower_blend: Handle undefs in stores
334- nir/lower_blend: No-op nir_color_mask if no mask
335- asahi: Omit extra call to clock_gettime
336- nir/opt_preamble: Treat \*size as an input
337- nir/opt_preamble: Consider load_preamble as movable
338- agx: Lower system values in NIR in the driver
339- agx: Bump preamble_storage_size to 512
340- agx: Centralize texture lowering
341- asahi: Use non-UAPI specific BO create flags
342- nir: Add a late texcoord replacement pass
343- asahi: Run nir_lower_fragcolor during preprocessing
344- asahi: Lower texcoords late
345- panfrost: Implement GL_EXT_render_snorm on Bifrost+
346- ail: Add layout->mipmapped_z input
347- ail: Test mipmapped_z behaviour
348- ail: Test 63x63 cube map
349- asahi: Set layout->mipmapped_z for 3D textures
350- asahi: Fix encoding of uniform size
351- asahi: Strengthen agx_usc_uniform contract
352- asahi/nir_lower_sysvals: Split large ranges
353- asahi: Correct alignment for USC Uniform packets
354- agx: Support uniform registers as LODs
355- asahi: Use writeback when it looks beneficial
356- asahi: Make STAGING resources linear
357- asahi: Prefer blit-based texture transfer
358- asahi: Implement nontrivial rasterizer discard
359- asahi: DRY dirty tracking conditions
360- asahi: Remove redundant tri merge disable bit
361- asahi: Merge fragment control XML
362- agx: Keep varyings forwarded to texture as fp32
363- asahi: Don't use 16-bit inputs to 32-bit st_tile
364- docs/asahi: Document clip distance varyings
365- agx: Fix storing to varying arrays
366- agx: Handle constant-offset in address matching
367- asahi: Add XML for custom border colours
368- agx/decode: Add a data parameter to stateful
369- agx/decode: Handle extended samplers
370- asahi: Implement custom border colours
371- asahi: Fix delete_vs_state implementation
372- asahi: Add compute kernel scaffolding
373- asahi: Don't leak shader NIR
374- asahi: Add hooks for SSBO and images
375- asahi: Fake more caps for dEQP-GLES31
376- asahi: Advertise seamless cube maps
377- asahi: Stub out MSAA for dEQP
378- asahi: Bump PIPE_CAP_MAX_TEXTURE_ARRAY_LAYERS
379- asahi: Add compute batches
380- asahi: Implement load_ssbo_address/get_ssbo_size
381- asahi: Identify more compute-related XML
382- agx: Implement compute ID intrinsics
383- agx: Implement barriers
384- nir/print: Extract get_location_str
385- nir/print: Pretty-print I/O semantic locations
386- nir/print: Pretty-print color0/1_interp
387- agx: Allow uniform sources on phis
388- agx: Run DCE twice
389- agx: Lower uniform sources with a dedicated pass
390- agx: Don't scalarize preambles in NIR
391- nir/lower_clip: Only emit 1 discard
392- tu,vulkan: Add common Get*OpaqueCaptureDescriptorDataEXT
393- radv: Use common Get*OpaqueCaptureDescriptorDataEXT
394- agx: Remove unused AGX_MAX_VARYINGS
395- agx: Respect component in frag load_input
396- agx: Fix AGX_MAX_CF_BINDINGS
397- agx: Remove bogus gl_Position assertion
398- agx: Implement load_helper_invocation
399- agx: Write sample mask even with no colour output
400- asahi: Submit batches that don't touch RTs
401- asahi: Add XML for indirect dispatch
402- asahi: Add XML for indirect draws
403- asahi: Add XML for VDM memory barriers
404- panvk: Take lock when tracing
405- panvk: Fix varying linking
406- panvk: Disable SNORM rendering
407- asahi: Remove default=true on index list values
408- asahi: Refactor index buffer upload for indirect
409- asahi: Implement indirect draws
410- panfrost: Fix some fields in v10.xml
411- pan/decode: Add support for decoding CSF
412- asahi: Vectorize background colour load
413- panfrost: Disable CRC by default
414- panfrost: Fix prim restart XML on Valhall
415- nir: Augment raw_output_pan with IO_SEMANTICS+BASE
416- pan/lower_framebuffer: Operate on lowered I/O
417- nir/lower_blend: Don't touch store->dest
418- nir/lower_blend: Don't handle gl_FragColor
419- nir/lower_blend,agx,panfrost: Use lowered I/O
420- asahi: Lower clip distances late
421- asahi: Move agx_preprocess_nir to CSO create
422- agx: Don't treat clip distances specially
423- agx: Do more work in agx_preprocess_nir
424- asahi: Fix rendering into mipmapped framebuffers
425- agx: Lower offsets in NIR
426- agx: Model and pack gathers
427- agx: Implement gathers (nir_texop_tg4)
428- docs/features: Sync Asahi with reality
429- asahi: Advertise ARB_derivative_control
430- asahi: Advertise ARB_texture_barrier
431- agx: Model atomic instructions
432- agx: Model local loads/stores
433- agx: Disallow immediate bases to device_load
434- agx: Pack global atomics
435- agx: Pack local load/store instructions
436- agx: Translate NIR atomics
437- agx: Translate load/store_shared
438- agx: Lower shared memory offsets to 16-bit
439- agx: Pack local atomics
440- agx: Implement b2b32
441- agx: Handle group_memory_barrier
442- agx: Add and use agx_nir_ssa_index helper
443- agx: Handle ssa_undef as zero
444- agx: Add agx_internal_format_supports_mask helper
445- asahi: Implement color masks with masked stores
446- asahi: Make shader-db work again
447- panfrost: Use proper locations in blend shaders
448- nir/lower_blend: Consume dual stores
449- nir: Add nir_texop_lod_bias_agx
450- asahi: Lower lod_bias_agx to uniform registers
451- agx: Lower sampler LOD bias
452- nir/lower_blend: Don't dereference null
453- docs/feature: Mark ARB_sync as done on Asahi
454- asahi/decode: Handle VDM barriers
455- nir: Add nir_lower_helper_writes pass
456- pan/mdg: Use nir_lower_helper_writes
457- asahi: Advertise dual-source blending
458- agx: Mask shifts in the backend
459- agx: Fix 2D MSAA array texture register allocation
460- asahi: Mark PIPE_FORMAT_NONE "supported"
461- agx: Don't write sample mask from preambles
462- agx: Add AGX_MESA_DEBUG=nopreamble option
463- agx: Clean up after lowering address arithmetic
464- agx: Factor out allows_16bit_immediate check
465- agx: Inline 16-bit load/store offsets
466- agx: Constify agx_print
467- agx: Refactor vector creation
468- agx: Use agx_emit_collect for st_tile
469- agx: Don't print pre-optimization shader
470- agx: Only lower int64 late
471- asahi: Bump shader buffers
472- asahi/meta: Use lowered I/O
473- agx: Disable tri merging with side effects
474- agx: Handle fragment shader side effects
475- asahi: Rework system value lowering
476- asahi: Wire up compute kernels
477- nir/lower_tex: Add lower_index_to_offset
478- pan/bi: Use lower_index_to_offset
479- ir3: Use lower_index_to_offset
480- nir/opt_barrier: Generalize to control barriers
481- glsl/nir: Use scoped_barrier for control barrier
482- pan/bi: Drop control_barrier handling
483- pan/mdg: Drop control_barrier handling
484- ir3: Drop non-scoped barrier handling
485- gallivm: Drop non-scoped barrier handling
486- agx/lower_address: Break on match
487- agx/lower_address: Optimize "shift + constant"
488- agx/lower_address: Handle large shifts
489- agx/lower_address: Handle 8-bit load/store
490- agx/lower_address: Fix handling of 64-bit immediates
491- agx/lower_address: Handle 16-bit offsets
492- agx: Assert that memory index is 32-bit reg
493- agx: Fix clang-formatting
494- agx: Pack indirect texture/sampler handles
495- agx: Handle indirect texture/samplers
496- asahi: Don't allow linear depth/stencil buffers
497- asahi, agx: Implement dummy samplers
498- asahi,agx: Implement buffer textures with gnarly NIR
499- panfrost: Remove some unused definitions
500- docs/panfrost: Move description of instancing
501- panfrost: Don't use DECODE_FIXED16 for sample position
502- panfrost: Handle fixed-point packing in GenXML
503- panfrost: Add XML for framebuffer pointers
504- panfrost: Use framebuffer pointer XML
505- panfrost: Remove FBD tag enum from XML
506- panfrost: Inline the last MALI_POSITIVE use
507- panfrost: Remove MALI_POSITIVE macro
508- pan/mdg: Remove reference to removed macro
509- agx: Don't set lower_pack_split
510- agx: Make partial DCE optional
511- agx: Fix subdivision coalescing
512- agx: Implement extract_[ui]16
513- agx: Use nir_lower_mem_access_bit_sizes
514- agx: Switch to scoped_barrier
515- nir/lower_point_size: Use shader_instructions_pass
516- ail: Restructure generated tests
517- agx: Lower discard late
518- util/prim_convert: Don't set index_bounds_valid
519- pan/bi: Ignore signedness in vertex fetch
520- panfrost: Identify "Base vertex offset" signedness
521- panfrost: Assert that we don't see unsupported vertex formats
522- panfrost: Defeature 24-bit textures
523- panfrost: Handle null textures robustly
524- panfrost/ci: Skip draw_buffers_indexed.random.* on Midgard
525- panfrost/ci: Identify some Piglit flakes
526- panfrost/ci: Add some Piglit skips
527- panfrost/ci: Remove fbo-mrt-new-bind fail+flake
528- panfrost: Note glDrawRangeElements underflow
529- asahi: Fix occlusion query lifetime
530- panfrost: Don't round up Midgard polygon list BOs
531- panvk: Use vk_get_physical_device_features
532- asahi: Use a dynarray for writers
533- ci: Add clang-format to the amd64 container
534- ci: Enforce clang-format for asahi
535- gallium: Fix u_stream_outputs_for_vertices with QUADS
536- nir/builder: Add nir_umod_imm helper
537- blorp,anv,hasvk: Use umod_imm
538- v3d,v3dv: Use udiv_imm/umod_imm
539- radv: Use umod_imm
540- ir3: Use umod_imm
541- nir: Add Panfrost intrinsics to lower sample mask
542- nir: Add Mali load_output taking converison
543- panfrost: Use 0/~0 boolean for MSAA sysval
544- pan/bi: Don't duplicate texture op cases
545- pan/bi: Lower sample mask writes in NIR
546- pan/bi: Lower load_output to make sysval explicit
547- pan/bi: Allow specializing bifrost_nir_options by arch
548- pan/bi: Lower gl_VertexID in NIR
549- pan/bi: Remove bi_load_sysval
550- pan/mdg: Use I/O semantics for MRT blend stores
551- panfrost: Remove inputs->blend.rt
552- panfrost: Remove unused inputs.nr_cbufs
553- pan/bi: Only lower once
554- pan/mdg: Only lower once
555- pan/bi: Split out early preprocessing from late
556- pan/mdg: Split out early preprocessing from late
557- pan/lower_framebuffer: Only call for FS
558- pan/lower_framebuffer: Use nir_shader_instructions_pass
559- pan/blit: Lower load_sampler_lod_parameters_pan
560- panfrost: Preprocess shaders in the driver
561- pan/lower_framebuffer: Lower MSAA blend shaders
562- panfrost: Lower clip_fs late
563- panfrost: Lower texcoords late
564- panfrost: Effectively lower gl_FragColor late
565- panfrost: Preprocess shaders at CSO create time
566- panfrost: Remove stale TODO
567- panvk: Lower sysvals in NIR
568- panvk: Don't use vec4 for vertex_instance_offsets
569- panvk: Inline blend constants as syvals
570- panfrost: Add NIR-based sysval lowering pass
571- panfrost: Lower sysvals in GL
572- panfrost: Move sysvals to GL driver struct
573- panvk: Remove unused function
574- panfrost: Move panfrost_sysvals to GL driver
575- pan/bi: Export bifrost_nir_lower_load_output
576- pan/bi: Call pan_nir_lower_zs_store late
577- panvk: Lower blending late
578- panfrost: Remove Midgard RSD fields from Bifrost
579- asahi: Convert to SPDX headers
580- mesa/st: Only set seamless for GLES3
581- mesa/st: Normalize wrap modes for seamless cubes
582- asahi: Don't lie about seamless cube maps
583- panfrost: Print perf debug when flushing everything
584- panfrost: Print perf debug on seqnum overflow
585- panfrost: Don't redundantly call emit_const_buf
586- panfrost: Mark packs as ALWAYS_INLINE
587- panfrost: Don't update access with a single batch
588- panfrost: Add a v9 fast path for no images
589- panfrost: Clean up tiler calculations
590- panfrost: Estimate vertex count for hier mask
591- panfrost: Choose hierarchy masks by vertex count
592- docs: Remove docs about macOS hardware drivers
593- nv50,nvc0: Use u_pipe_screen_get_param_defaults
594- panfrost: Always upload a workaround sampler
595- pan/{mdg,bi}: Always use sampler 0 for txf
596- panfrost: Unset TEXTURE_BUFFER_SAMPLERS
597- gallium: Remove PIPE_CAP_TEXTURE_BUFFER_SAMPLER
598- docs/gallium: Note samplers are not used for txf
599- nir/print: Don't print sampler_index for txf
600- asahi: Support more renderable formats
601- agx: DCE even with noopt
602- agx: Assert that we don't overflow registers
603- agx: Constify agx_{read,write}_registers
604- agx: Don't allow uniform source to local_atomic
605- agx: Don't destroy usub_sat with constant
606- asahi: Add perf debug for generate_mipmap
607- asahi: Add perf debug for shader variants
608- agx: Set loads_varying accurately
609- agx: Add helper for calculating occupancy
610- asahi/decode: Remove agxdecode_dump_bo
611- asahi/decode: Print VDM barriers
612- asahi: Set PIPE_CAP_LOAD_CONSTBUF
613- agx: Coalesce more collects
614- agx: Don't overallocate registers
615- asahi: Honour sampler count
616- asahi: Implement null textures
617- asahi: Lower 1D to 2D
618- asahi: Dirty track depth bias uploads
619- asahi: Clamp texture buffer sizes
620- agx: Tease apart some sample_mask packing magic
621- agx: Rename writeout to wait_pix
622- agx: Make signal_pix instructions explicit
623- vulkan: Add common features2_to_features
624- radv: Use vk_features2_to_features
625- v3dv: Use vk_features2_to_features
626- lavapipe: Use vk_features2_to_features
627- pvr: Use vk_features2_to_features
628- anv,hasvk: Use vk_features2_to_features
629- tu: Use vk_features2_to_features
630- nir: Combine if_uses with instruction uses
631- nir/opt_ray_queries: Don't use list_length
632- nir/opt_loop_unroll: Avoid list_length
633- nir: Remove 2nd argument from nir_before_src
634- nir/validate: Don't treat if-uses specially
635- dxil: Avoid list_length
636- nir: Reduce indirection
637- nir: Factor out nir_src_rewrite_ssa helper
638- nir: Use nir_src_rewrite_ssa
639- dxil: Use nir_src_rewrite_ssa
640- nir: Remove nir_if_rewrite_condition_ssa
641- nir/repair_ssa: Refactor some use handling
642- nir/validate: Only walk uses once
643- mailmap: Update my e-mail
644- panfrost: Symlink gallium .clang-format to common
645- panfrost/winsys: Add .clang-format for winsys folder
646- panfrost/winsys: Clang-format
647- pan/decode: Move comment out of designated initializer
648- panfrost: Re-run clang-format
649- panvk: Clang-format
650- ci: Run clang-format on panfrost
651- mesa/st: Set uses_sample_shading when forcing per-sample
652- nir/lower_blend: Set uses_fbfetch_output conservatively
653- nir/lower_blend: Enable per-sample shading
654- pan/bi: Lower swizzles for 8-bit CSEL
655- pan/bi: Respect swizzles for more vector ops
656- pan/bi: Use nir_lower_mem_access_bit_sizes
657- panfrost: Allocate shared memory in OpenCL
658- pan/decode: Print compute job payloads
659- asahi: Fix disk cache disable with AGX_MESA_DEBUG
660
661Amber (15):
662
663- util/u_trace: pass utrace context to marker functions.
664- freedreno: add support for markers.
665- ir3, isaspec: add raw instruction to assembler/disassembler.
666- ir3: support texture and sampler index with offsets
667- nir: support lowering nir_intrinsic_image_samples to a constant load
668- ir3: use lower_image_samples_to_one
669- intel/compiler: use lower_image_samples_to_one
670- freedreno: make sure depth/stencil layouts are always tiled
671- freedreno: use A6XX_GRAS_SC_CNTL_SINGLE_PRIM_MODE with fb readback
672- gallium: make BlendCoherent usable from gallium drivers
673- freedreno: use blendcoherent to set FLUSH_PER_OVERLAP
674- freedreno: check for conditional rendering in launch_grid
675- nir: allow nir_lower_fb_read to support multiple render targets
676- nir: Add memory coherency information to shaders.
677- freedreno, nir, ir3: implement GL_EXT_shader_framebuffer_fetch
678
679Andres Calderon Jaramillo (1):
680
681- r600: Report multi-plane formats as unsupported
682
683André Almeida (2):
684
685- radv: Implement vk.check_status
686- winsys/amdgpu: Fix amdgpu_cs_query_reset_state2 error log
687
688Antonio Gomes (11):
689
690- rusticl: Enabling reading/writing for images created from buffers
691- rusticl: Enabling image fill for images created from buffers
692- rusticl: Enable copy for images created from buffers
693- rusticl: Enable mapImage for images created from buffers
694- gallium, rusticl: Add tex2d_from_buf in image_view and sampler_view
695- mesa/st, nine, nouveau: Fix uninitialized pipe_sampler_view structs
696- lvmpipe/cs: Add support for 2d images created from buffers
697- gallium: Add new caps PIPE_CAP_LINEAR_IMAGE_(PITCH_ALIGNMENT|BASE_ADDRESS_ALIGNMENT)
698- rusticl: Implement spec for cl_khr_image2d_from_buffer
699- llvmpipe: Add new caps PIPE_CAP_LINEAR_IMAGE_(PITCH_ALIGNMENT|BASE_ADDRESS_ALIGNMENT)
700- iris: Add support for 2d images created from buffers
701
702Anuj Phogat (3):
703
704- anv: implement TES distribution mode WA 22012785325
705- iris: implement TES distribution mode WA 22012785325
706- intel/genxml/125: Add preferred SLM allocation size field
707
708Asahi Lina (43):
709
710- asahi: Split off common BO code into its own file
711- asahi: Split off macOS support into its own file
712- asahi: Refuse to transfer out-of-bounds mip levels
713- meson: Fix Asahi build on macOS
714- asahi: Fix shader key cloning overreads
715- asahi: Do not use memctx for pools / meta cache
716- asahi: Drop agx_device.memctx
717- asahi: Only apply FS lowerings to fragment shaders
718- asahi: Add BO_SHAREABLE flag
719- asahi: Add readonly BO flag
720- asahi: Identify USC cache invalidate
721- asahi: Flush USC caches on the first draw
722- asahi: Drop macOS backend
723- asahi: Add nocluster,sync,stats debug flags
724- asahi: Align device submission API with upcoming UAPI
725- asahi: Implement Linux driver scaffolding, sans UAPI
726- asahi: Add APIs for DMA-BUF sync file import/export
727- asahi: Add agx_debug_fault() helper
728- asahi: Add result buffer to context/batches
729- asahi: Add agx_bo_mmap() calls to transfer path
730- asahi: Pull device name from device struct
731- asahi: Do not overread user index buffers
732- asahi: Fix scissor culling check when out of bounds for FB/viewport
733- asahi: Fix device fd leak in agx_close_device
734- asahi: Destroy the renderonly context on screen destroy
735- asahi: clang-format the world again
736- asahi: Assert on TIB strides > 64
737- asahi: Support importing sync objects on BO export
738- asahi: Make agx_flush_resource reallocate non-shareable resources
739- asahi: Extend batch tracking for explicit sync
740- Revert "asahi: Advertise dual-source blending"
741- asahi: Make agx_alloc_staging() take a screen instead of a context
742- asahi: Enable glthread
743- asahi: Locate low VA BOs correctly
744- asahi: Fix style nits
745- asahi: Implement valid buffer range tracking
746- asahi: Make BO import path failures more robust
747- asahi: Add a helper macro for debug/error messages
748- asahi: Add resource debugging
749- asahi: Print reasons why compression is disabled
750- asahi: Fix compressed ZS support
751- asahi: Flip kmsro around to allocate on the GPU
752- asahi: Allow explicit non-LINEAR modifiers for scanout
753
754Axel Davy (1):
755
756- frontend/nine: Fix num_textures count
757
758Bas Nieuwenhuizen (26):
759
760- aco: Pass correct number of coords to Vega 1D LOD instruction.
761- radv: Strictly limit alignment needed within a descriptor set.
762- radv: Reduce descriptor pool allocation for alignment.
763- radv: Set FDCC_CONTROL SAMPLE_MASK_TRACKER_WATERMARK
764- radv: Shift left the tile swizzle more on GFX11.
765- nir: Apply a maximum stack depth to avoid stack overflows.
766- radv: Add helper to hash stages.
767- radv: Hash group handles as part of RT pipeline key.
768- radv: Use provided handles for switch cases in RT shaders.
769- radv: Use group handles based on shader hashes.
770- radv: Implement & expose VK_EXT_pipeline_library_group_handles.
771- Update my mailmap aliases
772- ac/surface,radv: Avoid pitch weirdness if image not used for rendertarget.
773- ac/surface: Only allow stencil pitch adjustment for mipmaps.
774- ac/surface,radv: Opt out of stencil adjust.
775- util: Add aligned int64_t types for x86(non 64).
776- util/disk_cache: Align atomic size.
777- radv: Align atomic values.
778- radv: Reserve space in framebuffer emission.
779- radv: Reserve space in various streamout functions.
780- radv: Reserve space in conditional rendering functions.
781- radv: Reserve space in si_cs_emit_cache_flush.
782- radv: Reserve space for updating DCC metadata.
783- radv: Reserve space for fast clear related writes.
784- radv: Reserve space for indirect descriptor set address writes.
785- radv: Move all the dirty flags from TES binding to TCS binding.
786
787Benjamin Cheng (1):
788
789- radv: initialize cmd_buffer upload list earlier
790
791Boyuan Zhang (6):
792
793- radeonsi/vcn: check fence before destroying dpb
794- radeonsi/vcn: check fence before destroying decoder
795- radeonsi/vcn: validate fence handle before using it
796- virgl/video: disable decoder fence
797- virgl: add more formats to conv table
798- frontends/va: check decoder in va surface call
799
800Brian Paul (9):
801
802- anv: add a third memory type for LLC configuration
803- llvmpipe: do additional checks in lp_state_fs_analysis.c for linear shaders
804- llvmpipe: remove debug printf spam in lp_setup_wait_empty_scene()
805- gallium/xlib: call fence_finish() in XMesaSwapBuffers()
806- llvmpipe: fix ps invocations query bug
807- llvmpipe: rename some vars related to occlusion query and ps invocations
808- llvmpipe: s/tabs/spaces/
809- llvmpipe: s/unsigned/enum pipe_query_type/
810- llvmpipe: clean-up llvmpipe_get_query_result()
811
812Błażej Szczygieł (1):
813
814- glx: Fix glXGetFBConfigFromVisualSGIX
815
816Caio Oliveira (26):
817
818- glsl: Account for unsized arrays in NIR linker
819- hasvk: Update driver name in debug information
820- intel: Add extra zeros at the end of debug identifiers
821- iris, crocus: Align workaround address to 32B
822- anv, hasvk: Align workaround address to 32B
823- nir: Add nir_intrinsic_rotate
824- nir/lower_subgroups: Add option lower_rotate_to_shuffle
825- spirv: Implement SPV_KHR_subgroup_rotate
826- nir: Support use_scoped_barrier in nir_lower_atomics_to_ssbo
827- microsoft/compiler: Handle scoped barrier in Tess splitting
828- gallivm: Fix handling of nir_intrinsic_scoped_barrier
829- glsl: Implement use_scoped_barrier option for lowering memory barriers
830- intel/compiler: Mark various memory barriers intrinsics unreachable
831- pan/compiler: Fix handling of nir_intrinsic_scoped_barrier
832- pan/midgard: Handle nir_intrinsic_scoped_barrier in Midgard compiler
833- panfrost: Use NIR scoped barriers instead of memory barriers
834- spirv: Don't specify nir_var_uniform or nir_var_mem_ubo in barriers
835- spirv/tests: Subclass spirv_test helper to namespace the tests
836- spirv/tests: Add script to generate C array from SPIR-V source
837- spirv/tests: Parametrize stage in get_nir() helper
838- spirv/tests: Add some basic control flow tests
839- spirv: Add skip_os_break_in_debug_build option to use in unit tests
840- intel/fs: Handle scoped barriers with execution scope
841- intel/vec4: Handle scoped barriers with execution scope
842- intel/compiler: Drop brw_nir_lower_scoped_barriers
843- intel/compiler: Drop non-scoped barrier handling
844
845Caleb Cornett (6):
846
847- d3d12: Lower minimum supported Shader Model to 6.0
848- futex: Change INT_MAX to INT32_MAX.
849- util: Add #ifdefs for Xbox GDK support.
850- dxil_validator: Add support for Xbox GDK.
851- wgl: Add support for Xbox GDK.
852- d3d12: Add support for Xbox GDK.
853
854Charlie Birks (1):
855
856- docs: add a few vulkan extensions supported by multiple drivers
857
858Charmaine Lee (5):
859
860- svga: fix resource_get_handle from resource created without SHARED bind flag
861- svga: fix compatible formats for shareable surfaces
862- svga: use upload buffer if texture has pending changes
863- translate: do not clamp element index in generic_run
864- svga: set PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY for VGPU10 device
865
866Chia-I Wu (38):
867
868- turnip: replace TU_DEBUG_DONT_CARE_AS_LOAD by a bool
869- turnip: make debug_flags a global variable
870- freedreno: add has_implicit_modifier helper
871- freedreno: support UBWC scanout
872- turnip: add a comment to tu_format_for_aspect
873- turnip: move a comment about FMT6_Z24_UNORM_S8_UINT_AS_R8G8B8A8
874- turnip: remove tu_native_format::tile_mode
875- turnip: make tu6_format_*_supported static
876- turnip: let tu6_format_vtx* take pipe format
877- turnip: add blit_format_texture
878- turnip: add blit_format_color and blit_base_format
879- turnip: handle ubwc in blit_base_format
880- turnip: reorder tu6_format_*
881- freedreno/registers: correct WFM bit in CP_REG_TEST
882- turnip: add a comment to tu_render_pass_cond_config
883- turnip: skip unnecessary CP_REG_TEST for cond load/store
884- freedreno/registers: document more bits of CP_REG_TEST
885- freedreno: avoid conditional ib in fd6_emit_tile
886- radv: fix a hang with binning on CHIP_RENOIR
887- turnip: fix a major leak with GPL LTO
888- turnip: fix a null descriptor set dereference
889- turnip: avoid FMT6_Z24_UNORM_S8_UINT_AS_R8G8B8A8 for event blits
890- radv: add a size check in radv_create_buffer for Android
891- util/log: refactor mesa_log
892- util/log: allow multiple loggers
893- util/log: improve logger_file newline handling
894- util/log: improve logger_android
895- util/log: add logger_syslog
896- util/log: add support for MESA_LOG_FILE
897- util/log: add logger_windbg
898- mesa: add missing newlines for _mesa_debug/_mesa_log callers
899- mesa: use mesa_log from output_if_debug
900- anv: process utrace payloads on queue submission
901- ci/radv: remove dEQP-VK.image.sample_texture.* fails/flakes
902- radv: set RADEON_FLAG_GTT_WC for external mem on vram
903- radv: rework radv_layout_fmask_compressed
904- radv: add RADV_FMASK_COMPRESSION_PARTIAL
905- radv: disable tc_compatible_cmask on GFX9 in some cases
906
907Christian Gmeiner (1):
908
909- etnaviv: nir: use lower_fround_even
910
911Collabora's Gfx CI Team (3):
912
913- Uprev Piglit to 60e7f0586bac0cfcfcb5871046e31ca2057a5117
914- Uprev Piglit to 2391a83d1639a7ab7bbea02853b922878687b0e5
915- Uprev Piglit to 355ad6bcb2cb3d9e030b7c6eef2b076b0dfb4d63
916
917Connor Abbott (12):
918
919- freedreno/a6xx: Rename CP_CSQ_IB*_STAT
920- freedreno/a6xx: Add CP_ROQ_*_STAT
921- freedreno/a6xx: Fix CP_ROQ_THRESHOLDS_1
922- freedreno/a6xx: Fill in ROQ status registers
923- freedreno/crashdec: Fix apparent off-by-one with ROQ size
924- freedreno/crashdec: Add prefetch test
925- tu: Fix tile_align_h on a650
926- freedreno: Fix or/and'ing two BitmaskEnums
927- tu: Use vk_pipeline_get_renderpass_flags()
928- vk/render_pass: Support VK_EXT_fragment_density_map
929- nir, spirv: Add support for VK_EXT_fragment_density_map
930- tu: Don't override depth for GMEM
931
932Constantine Shablya (12):
933
934- anv: handle ATTACHMENT_OPTIMAL layout
935- anv: use Vulkan runtime's robust buffer access
936- hasvk: use Vulkan runtime's robust buffer access
937- anv,hasvk: flush what UNIFORM_READ flushes on SHADER_READ
938- vulkan: relocate rmv to its correct home
939- vulkan: tidy up vk_physical_device_features
940- vulkan: delete trailing namespace
941- vulkan: add hepler for vkGetPhysicalDeviceFeatures2
942- vulkan: use vk_features for vk_device::enabled_features
943- anv: use vk_get_physical_device_features
944- vulkan: fix building with python3.8
945- vulkan: depend idep_vulkan_runtime_headers on vk_physical_device_features.h
946
947Corentin Noël (12):
948
949- ci/venus: Remove failure now passing
950- kopper: Do not free the given screen in initScreen implementation
951- ci: uprev virglrenderer
952- ci/venus: Skip tests risking out of memory issues
953- ci: uprev virglrenderer and crosvm
954- ci: Setup XDG_RUNTIME_DIR in crosvm-init
955- ci: Allow to use crosvm-runner before deqp-runner
956- ci: Uprev crosvm and virglrenderer
957- venus/ci: Only run one crosvm instance
958- mesa: OpenGL ES 3.0 requires EXT_instanced_arrays
959- glapi: Make EXT_draw_instanced functions available for GLES 2.0
960- mesa: Add EXT_instanced_arrays support
961
962Daniel Schürmann (82):
963
964- radv: CSE ray_launch_{size|id}
965- radv: rename shader_info->cs.uses_sbt -> shader_info->cs.is_rt_shader
966- radv: unconditionally enable scratch for RT shaders
967- radv/rt: introduce and set rt_pipeline->stack_size
968- radv/rt: use dynamic_callable_stack_base also for static stack_sizes
969- radv/rt: don't hash maxPipelineRayRecursionDepth
970- nir: add Continue Construct to nir_loop
971- nir: add assertions that loops don't have a Continue Construct
972- nir: create nir_push_continue() and related helpers
973- nir: add lowering for Loop Continue Constructs
974- spirv: use Loop Continue Construct to emit SPIR-V loops and lower after parsing
975- nir/lower_continue_constructs: special-case Continue Constructs with zero or one predecessors
976- nir/lower_continue_targets: only repair SSA when necessary
977- nir: simplify nir_block_cf_tree_{next|prev}
978- radv/rt: rename library_pipeline->groups to library_pipeline->group_infos
979- radv/rt: defer library_pipeline allocation
980- radv/rt: introduce struct radv_ray_tracing_module
981- radv/rt: move stack_sizes into radv_ray_tracing_module
982- radv/rt: only reserve stack_sizes after rt_case insertion
983- radv: expose radv_postprocess_nir()
984- radv: expose radv_pipeline_capture_shaders()
985- radv/rt: introduce and use radv_rt_pipeline_compile()
986- radv: remove unused parameters from radv_compute_pipeline_compile()
987- radv/rt: move radv_pipeline_key from rt_variables to traversal_data
988- nir/gather_info: allow terminate() in non-PS
989- aco: fix NIR infinite loops
990- radv/rt: use terminate() when returning from raygen shaders
991- aco/dominance: set immediate dominator for any BB without predecessors
992- aco/value_numbering: clear hashmap between disconnected CFGs
993- aco/dead_code_analysis: don't add artificial uses to p_startpgm
994- aco/insert_exec_mask: allow for disconnected CFG
995- aco/spill: allow for disconnected CFG
996- radv/rt: place any-hit scratch vars after intersection scratch vars
997- radv/rt: Fix any_hit scratch variables.
998- mesa: add gl_shader_stage_is_rt()
999- radv: add RT shader args
1000- radv: handle RT stages in radv_nir_shader_info_pass()
1001- radv: add RT stages to radv_get_shader_name()
1002- radv: add RT shader handling to radv_postprocess_config
1003- aco: add RT stage enums
1004- aco: don't set private_segment_buffer/scratch_offset on GFX9+
1005- aco: move rt_dynamic_callable_stack_base_amd to VGPR
1006- aco: implement load_ray_launch_{id|size}
1007- aco: create hw_init_scratch() function for p_init_scratch lowering
1008- aco: implement select_rt_prolog()
1009- radv: add radv_create_rt_prolog()
1010- radv: compile rt_prolog
1011- radv/rt: use prolog for raytracing shaders
1012- aco: remove aco::rt_stack variable
1013- radv: remove unused parameter from radv_open_rtld_binary()
1014- radv: separate radv_postprocess_binary_config() from radv_shader_create()
1015- radv: remove unnecessary copy of binary->config
1016- radv: inline radv_postprocess_config()
1017- radv: separate radv_capture_shader_executable_info() from radv_shader_create()
1018- radv: move gl_shader_stage from radv_binary to radv_shader_info
1019- radv: remove radv_create_gs_copy_shader()
1020- radv: refactor shader_compile()
1021- radv: skip pipeline caching with RADV_DEBUG=shaders
1022- radv: fix radv_shader_binary member fields to 32 bit.
1023- radv/rt: Fix VK_KHR_pipeline_executable_properties
1024- aco: split ps_epilog args before exporting them
1025- aco/ra: adjust_max_used_regs() for fixed Operands
1026- aco: don't use shared VGPRs for shaders consisting of multiple binaries
1027- radv: update PS num_vgprs in case of epilogs rather than overallocating VGPRs
1028- vulkan/pipeline_cache: remove vk_device from vk_pipeline_cache_object
1029- vulkan/pipeline_cache: Don't re-insert disk-cache hits into disk-cache
1030- vulkan/pipeline_cache: implement vk_pipeline_cache_create_and_insert_object()
1031- vulkan/pipeline_cache: use vk_pipeline_cache_create_and_insert_object() during vk_pipeline_cache_load()
1032- vulkan/pipeline_cache: add cache parameter to deserialize() function
1033- vulkan/pipeline_cache: move vk_log on failed deserialization to vk_pipeline_cache_load()
1034- radv: derive struct radv_shader from vk_pipeline_cache_object
1035- radv: unconditionally store the binary code in radv_shader
1036- radv: add radv_shader_serialize() and radv_shader_deserialize() functions
1037- radv: add struct radv_pipeline_cache_object
1038- radv: implement radv_shader_create_cached()
1039- radv: use vk_pipeline_cache
1040- radv: clean up pipeline-cache interface
1041- radv/ci: add 2 more Flakes for Navi21
1042- radv/rt: fix total stack size computation
1043- radv/rt: properly destroy radv_ray_tracing_lib_pipeline on error
1044- vulkan/pipeline_cache: replace raw data objects on cache insertion of real objects
1045- radv: add padding to radv_shader_binary_legacy
1046
1047Daniel Stone (18):
1048
1049- ci/fdno: Only run full tests on a limited subset of machines
1050- ci/radv: Skip vkCreateInstance memory-fail test
1051- ci/anv: Temporarily halve TGL testing load
1052- intel/isl: Don't scream FINISHME into logs for 3D vs. CCS
1053- ci/radv: Drop raven quick_shader load
1054- ci/fdno: Add a618 Vulkan flakes
1055- ci/zink: Add flake seen in the wild
1056- ci/radv: Lower stoney CTS load
1057- ci/android: Use a more aggressive timeout for the job
1058- ci: Actually run Piglit on LAVA
1059- ci: Disable Collabora LAVA farm
1060- Revert "ci: Disable Collabora LAVA farm"
1061- CI: Disable Windows runners
1062- CI: Disable mingw job
1063- ci/panfrost: Add texturesize flake seen in the wild
1064- CI: Disable freedreno
1065- ci/radeonsi: sort and dedup stoney skips
1066- ci/radeonsi: Skip really slow tests on stoney
1067
1068Danylo Piliaiev (52):
1069
1070- tu/kgsl: do not use kgsl_command_object::offset
1071- tu: Prevent using stale value of RB_UNKNOWN_88D0 on BLIT
1072- tu: Prevent using stale value of GRAS_SC_CNTL in sysmem clear
1073- freedreno: Document A6XX_GRAS_SC_CNTL::rotation field
1074- turnip: Ensure that there is no renderpass rotation in binning
1075- turnip: Disable draw states after dyn renderpass in all cases
1076- ir3: Consider dst type in ubo_vec4 to ldc lowering
1077- tu: Don't expose KHR_present_id,KHR_present_wait without KHR_swapchain
1078- turnip: Add debug option to find usage of stale reg values
1079- docs/freedreno: Add info about stale reg stomper dbg option
1080- ci/tu: Add 1/200 pass to test for stale reg usage
1081- ir3: Add cat5/cat7 cache related instructions
1082- ir3: Add cat7 sleep instruction
1083- freedreno/register: Define chip enum values
1084- util/perf: C++-proof util/perf
1085- util/format: Make format_table compatible with C++
1086- spirv: sort spirv_supported_capabilities
1087- vk/vk_extension_gen: Make table struct initializable in C++ on older gcc
1088- vk/wsi: C++-proof wsi_common_drm.h
1089- vk/util: remove (void \*) casts from vk_foreach_multi_draw macros
1090- vk/util: Generate defines to help casting structs with vk_find_struct
1091- freedreno/common: C++-proof freedreno_uuid.h
1092- ir3: C++-proofing
1093- tu: C++-proofing: fix offsetof with dynamic array index
1094- tu: C++-proofing: fix struct initializers
1095- tu: C++-proofing: various enum fixes
1096- tu: C++-proof: do not goto over variables initialization
1097- tu: C++-proofing: fix designator initializer order
1098- tu: C++-proofing: fix extension table initialization
1099- tu: C++-proofing: Initialize tu_reg_value in-order by pack funcs
1100- tu: C++-proofing: fix casting from void * fpermissive warnings
1101- tu: C++-proofing: ease access to global bo struct
1102- tu: C++-proofing: prevent taking address from rvalue
1103- tu: C++-proofing: cast result when extracting field from reg value
1104- tu: C++-proofing: misc fixes
1105- freedreno/msm: Rename drm_msm_gem_submit_reloc::or in C++ code
1106- tu: compile as C++
1107- vk/entry_points: Add option to generate template entrypoints
1108- freedreno/regs: Include assert.h in generated headers
1109- tu: Generate entrypoints for each gen
1110- turnip: add cached and cached-coherent memory types
1111- tu/drm: Support cached non-coherent memory
1112- freedreno/registers: Document new CP_EVENT_WRITE::SEQNO
1113- freedreno/registers: More a7xx regs
1114- freedreno/computerator: C++ proofing
1115- freedreno: C++ fixes for computerator to compile
1116- freedreno/computerator: Convert to C++
1117- freedreno: Move fd6_pack.h to common code accessible by computerator
1118- freedreno: Add dummy a730/a740 definition
1119- freedreno/computerator: Templatize a6xx backend
1120- freedreno/computerator: Add support for a7xx
1121- vulkan: Sanitize pSampleMask in CmdSetSampleMaskEXT
1122
1123Dave Airlie (37):
1124
1125- ci: bump vk cts to 1.3.3.1 + and a crash fix.
1126- vulkan/video: add common h264/h265 parameter set management code.
1127- vulkan/format: add a 10-bit video format
1128- radv: remove the status query mark it unsupported.
1129- radv: add new upload alloc aligned api
1130- ac: add name to codec info struct
1131- radv: adding video decode queue support
1132- radv: add video decoder register setup.
1133- radv/video: add initial frameworking.
1134- radv/video: add initial h264 decoder for VCN
1135- radv/video: add h264 support for uvd
1136- radv: add vcn h265 decode.
1137- radv/video: add h265 decode UVD support
1138- radv/vcn: enable dynamic dpb tier 2 for h264/h265 on navi21+
1139- anv: add video engine support in various places
1140- anv: set Y/4 tiling for video decode images
1141- anv: add video format features for the one supported video output format
1142- anv/format: handle video extensions structs by ignoring them
1143- intel/genxml: align some of the fields with the media driver
1144- intel/genxml: add missing power well control bits
1145- anv/image: allocate some memory for mv storage after video images.
1146- anv: add initial video decode support for h264.
1147- anv/query: add query status report
1148- anv: enable video decode extensions.
1149- anv/video: fix video memory bindings.
1150- crocus: disable Y tiling for render targets properly.
1151- crocus: switch gen4/5 tiling flags to follow suggestions.
1152- llvmpipe: fix compute address bits to return native pointer size.
1153- anv: always pick graphics queue to execute prime blits on.
1154- radv: add video format support to format probing.
1155- anv/video: fix chroma qp to be a integer value.
1156- anv/video: disable picture id reampping.
1157- anv: fix image height for field pictures.
1158- radv/video: fix h264 frame heights when field images are in use
1159- radv/video: fix used for reference flags.
1160- radv/video: fix h265 decoding sizes.
1161- radv/trace: don't attempt to emit trace on non-graphics/compute queues
1162
1163David (Ming Qiang) Wu (1):
1164
1165- radeonsi/vcn: add an exception of field case for h264 decoding
1166
1167David Heidelberg (73):
1168
1169- ci/zink: Penumbra is now fixed.
1170- freedreno/ci: Switch also performance a630 job to manual
1171- ci/anv: add multiple fails uncovered by change of sharding
1172- ci/intel: fully utilize asus-cx9400-volteer
1173- ci/piglit: explicitely define we want GLX tests
1174- ci: migrate from wget to curl
1175- ci/piglit: 2023-01-19 uprev
1176- ci: bump ci-fairy with session support (robust downloads)
1177- ci: Sir trace has small invisible change in rendering
1178- ci: bump Mold to the 1.10.0
1179- ci: uprev piglit (etag md5 checksumming support)
1180- ci/lavapipe: use dxvk for the traces
1181- ci: revert download of git cache to the wget
1182- ci/llvmpipe: add flake timeout for rusticl program@execute@[email protected]
1183- util/process_test: make the error variable static
1184- intel: enable -mfpmath=sse on x86
1185- intel: use c_see2_arg instead of explicit -msse2
1186- ci/freedreno: add flaking KHR-GL45.buffer_storage.map_persistent_dispatch
1187- meson: print c_cpp_args
1188- intel/vulkan: add missing dependency on generated headers
1189- ci/freedreno: add flaking KHR-GL45.buffer_storage.map_persistent_flush
1190- ci/alpine: keep the curl inside the image
1191- ci: alpine: install bash and coreutils for date -d
1192- ci: implement unified sections
1193- ci: make meson build and test uncollapsed
1194- ci: deqp-runner: drop already unused function
1195- ci: Retry, retry, retry... No one likes to trigger Marge more than once.
1196- ci/zink: add skip for the Single-GL46.enhanced_layouts.ssb_member_align_non_power_of_2
1197- ci/lavapipe: add recent occasional flake
1198- ci/freedreno: rare flake KHR-GL45.sample_variables.mask.rgba8i.samples_4.mask_3
1199- crocus/meson: add dependency on libintel_dev also for versioned static libraries
1200- ci/ci_run_n_monitor: while we usually disable many jobs, print them inline
1201- ci: do not exit when an error happens inside the section
1202- ci/lavapipe: fixes typo
1203- ci/zink: fixup the zink-lvp job
1204- ci: disable mesa-swrast runner jobs
1205- ci/lava: implement the priority
1206- ci/weston: before testing, verify that XWayland is really running
1207- ci/weston: add background PID
1208- ci: add and utilize dalboz devices
1209- ci/amd: move skqp and va jobs on raven from XOrg to the XWayland
1210- ci/panfrost: correct the job name, as it runs on gles2
1211- ci/lava: every LAVA job doesn't want to run gles2 deqp, drop it
1212- ci: build Wayland support for the amd64
1213- ci/iris: update apl and glk expectations, after enabling Wayland support
1214- ci/clover: disable the jobs
1215- ci/traces: disable nheko trace with zink since it flakes
1216- ci/freedreno: add recent occasional flakes
1217- ci/traces: add two skips due to flakes
1218- ci/intel: add dEQP-EGL.functional.wide_color.window_fp16_default_colorspace flake
1219- ci: distribute XDG_RUNTIME_DIR with setup-test-env script
1220- ci: disable weston session timeout for llvmpipe
1221- meson: implement quirk for the compilation under armv7 GCC with LTO
1222- aco: drop leftover variable
1223- ci: bump Alpine to 3.17 (again)
1224- ci/freedreno: do not build tools executables without explicitly enabling them
1225- freedreno/decode: fix possible overflow
1226- ci: rename .lava-test to .lava-test-deqp to describe it correctly
1227- ci: create lava-test without deqp HWCI_TEST_SCRIPT
1228- ci: remove deqp from lava piglit and traces runs
1229- ci/freedreno: split deqp from other jobs
1230- ci/freedreno: define Google farm specific includes
1231- ci/freedreno: Make traces work on LAVA caching proxy
1232- ci/broadcom: test occasionally fails, but typically passes
1233- ci: disable lima farm, currently out-of-space, needs to be fixed
1234- ci: implement sections for cuttlefish
1235- ci/v3d: add flaking spec@ext_framebuffer_blit@fbo-sys-blit
1236- Revert "mesa: Enable NV_texture_barrier in GLES2+"
1237- ci/amd: update device status
1238- ci/amd: raven is currently downgraded to 2 machines only, adapt
1239- ci/amd: add draw.dynamic_rendering flake
1240- ci/freedreno: fix the a530_piglit job and switch to Weston
1241- panvk: clear dangling pointers
1242
1243David Redondo (1):
1244
1245- egl/wayland: fix oob buffer access during buffer_fds clean up
1246
1247David Rosca (2):
1248
1249- frontents/va: Use PIPE_USAGE_STAGING for coded buffer
1250- frontends/va: Map VAEncCodedBufferType buffer as PIPE_MAP_READ
1251
1252Dmitry Baryshkov (2):
1253
1254- freedreno/a5xx: reorder GPMU registers
1255- freedreno/a5xx: add SP clock control register
1256
1257Dmitry Osipenko (6):
1258
1259- util/cache_test: Unset env vars left after Cache.List test
1260- util/mesa-db: Don't account header size
1261- util/mesa-db: Support removal of cache entries
1262- util/cache_test: Remove dummy cache entry added by cache_exists()
1263- util/mesa-db: Introduce multipart mesa-db cache
1264- util/disk_cache: Switch to multipart mesa-db cache
1265
1266Dylan Baker (38):
1267
1268- VERSION: bump to 23.1.0-devel for further development
1269- docs: reset new_features.txt
1270- meson: bump minimum required version to meson 0.59
1271- meson: replace has_exe_wrapper with can_run_host_binaries
1272- meson: replace uses of ExternalProgram.path with .full_path
1273- meson: drop meson < 0.54 workaround
1274- meson: use a feature option for dri3
1275- meson: use a feature option for gallium-vdpau
1276- meson: use a feature option for gallium-va
1277- meson: use a feature option for gallium-xa
1278- meson: use a feature option for shader_cache
1279- meson: use a feature option for shared-glapi
1280- meson: use a feature option for gles1
1281- meson: use a feature option for gles2
1282- meson: use a feature option for gbm
1283- meson: use a feature option for llvm
1284- meson: use a feature option for valgrind
1285- meson: use a feature option for libunwind
1286- meson: use a feature option for lmsensors
1287- meson: use a feature option for power8
1288- meson: use a feature option for xlib-lease
1289- meson: use a feature option for zstd
1290- meson: use a feature option for egl
1291- meson: use a feature option for shared-llvm
1292- meson: Use feature option methods for xmlconfig
1293- meson: remove version checks for < 0.59
1294- meson: use builtin support for reading version from a file
1295- meson: use [] instead of 'lib for !windows name_prefix
1296- meson: use the same workaround for setting 'lib' on windows
1297- meson: combine checks for linker --gc-sections support
1298- util: rzalloc and free hash_table_u64
1299- iris: consider bufmgr creation to have failed if \`dup`ing of the fd fails
1300- intel/mi: use 64bit constant for bitshift
1301- intel/dev: create a helper dependency for libintel_dev
1302- docs: Add calendar entries for 23.0 release.
1303- docs: add release notes for 23.0.0
1304- docs: Add sha256 sum for 23.0.0
1305- docs/relnotes: add 23.0.0 to relnotes.rst
1306
1307Ella Stanforth (1):
1308
1309- v3dv: add support for multi-planar formats, enable YCbCr
1310
1311Emma Anholt (211):
1312
1313- dri2: Fix exposing robustness with swkms.
1314- ci/llvmpipe: Drop dEQP-EGL.functional.sharing.*.link.7 flakes.
1315- ci/iris: Add known flakes for skqp.
1316- ci/iris: Generalize the 8888_pbuffer EGL known flakes and share with GLK.
1317- ci/zink: Add more blit conversion xfails for a618.
1318- freedreno: Skip CPU/GPU timestamp sync when not supported.
1319- ci/freedreno: Add glx-swap-event-async as a flake.
1320- freedreno/pps: Fix a signed/unsigned complaint.
1321- ci: Enable building the testing drivers with perfetto.
1322- ci: Add some new folks to the restricted-traces access list.
1323- Revert "nouveau/ci: temporary disable gk20a-gles"
1324- ci/virgl: Disable iris traces for now while it's unstable.
1325- ci: Drop windowoverlap xfails, since it's always skipped.
1326- ci/zink: Drop xfail for copy-sub-buffer.
1327- ci/zink: Drop glx-swap-copy xfails.
1328- ci/zink: Clear issue #7781 flakes.
1329- ci/freedreno: Switch the piglit job to using a deqp-runner suite.
1330- ci: Move PIGLIT_PLATFORM settings out of the .tomls.
1331- ci/piglit: Add some common piglit skips for Mesa CI's testing of glx.
1332- ci/piglit: Exclude swapbuffers front-readback tests with PIGLIT_PLATFORM=gbm.
1333- zink: Fatal error if requesting validation and we fail to load the layer.
1334- zink: Add missing Flat decorations on some inputs.
1335- zink: Fix validation failure for maxLod < minLod.
1336- zink: Fix up mismatches of memory model vs addressing model.
1337- zink: Re-emit the SpvBuiltInSampleMask access chain each load.
1338- ci/zink: Add coverage using the vulkan validation layer on lvp.
1339- ci/zink: Update TGL full-run xfails.
1340- ci/zink: Update radv xfails for the recent shadow fixes.
1341- ci/freedreno: Mark max-texture-size as a flake.
1342- ci: Move the performance jobs' allow_failure:true to the gl rules.
1343- ci: Add manual rules variations to disable irrelevant driver jobs.
1344- freedreno: Don't sync timestamps while perfetto isn't running.
1345- ci/zink: Disable Amnesia trace until the linked issue gets fixed.
1346- ci/zink: Move the zink-anv-tgl manual full run to custom manual deps.
1347- ci: Run our manual jobs during the nightly scheduled run.
1348- ci: Fix perf jobs blocking Marge pipelines.
1349- ci: Fix perf job condition.
1350- ci: Drop the itoral-gl-terrain demo from traces.
1351- tu: Mark tiling impossible if we couldn't lay out gmem in the first place.
1352- turnip: Optimize tile sizes to reduce the number of bins.
1353- tu: Only emit the conditional gmem subpass resolves when gmem is possible.
1354- turnip: Make the tiling-impossible case have an impossible tile layout.
1355- gallivm: Optimize emit_read_invocation's first-invocation loop.
1356- gallivm: Refactor out a shared "get the first active invocation" loop.
1357- gallivm: Return 0 first_active_invocation when we know that up front.
1358- gallivm: Use cttz instead of a loop for first_active_invocation().
1359- gallivm: Use first active invocation in some image/ssbo accesses.
1360- ci/lvp: Drop the subgroupbroadcast skips.
1361- llvmpipe: Enable LP_DEBUG on normal builds.
1362- gallivm: Enable GALLIVM_DEBUG (mostly) on non-DEBUG builds.
1363- gallivm: Fix the type of array nir_registers.
1364- gallivm: Fix codegen performance for constant-index register array stores.
1365- gallivm: Do the same codegen improvement for constant-index array loads.
1366- ci/swrast: Drop skips for tests whose perf had been fixed.
1367- ci/llvmpipe: Drop skip of InteractionFunctionCalls2.
1368- ci/freedreno: Don't forget to report flakes on a618, too.
1369- u_trace: Add an interface for checking trace enablement outside a context.
1370- zink: Add tracing of blit operations.
1371- ci: Disable systems in my farm that haven't recovered.
1372- ci/zink: Update TGL full-run xfails.
1373- ci/freedreno: Disable the a306_piglit_gl job.
1374- ci/freedreno: Update a530 manual-run xfails.
1375- ci/freedreno: Add an xfail for a618 VK full run.
1376- ci/freedreno: Update a3xx piglit_shader xfails.
1377- ci/nouveau: Disable the gm20b jobs entirely.
1378- ci/radv: Update navi21 llvm xfails.
1379- ci/crocus: Update HSW expectations.
1380- ci/freedreno: Update manual-run xfails for a530.
1381- Revert "freedreno/a5xx: Fix clip_mask"
1382- ci/radv: Add a skip for navi21-llvm for a test that consistently timeouts.
1383- ci/etnaviv: Drop stale xfails from gc7000.
1384- ci/etnaviv: Update deqp xfails for gc2000.
1385- egl/kopper: Add assert for no kopper in dri2_copy_region.
1386- egl: Add a note explaining the swapBuffers badness in dri2_x11_copy_buffers().
1387- egl/kopper: Use the kopper private interface for swapBuffers.
1388- egl/kopper: Pass ancillary invalidate flush flags down to gallium.
1389- ci: Add a manual full and 1/10th hasvk CTS runs.
1390- hasvk: Silence conformance warning in CI.
1391- hasvk: Fix SPIR-V warning about TF unsupported on gen7.
1392- anv: Fix gfx8/9 VB range > 32bits workaround detection.
1393- hasvk: Fix gfx8/9 VB range > 32bits workaround detection.
1394- glsl: Drop the (v.x + v.y + v.z + v.w) -> dot(v, 1.0) optimization.
1395- ci/etnaviv: Drop one more gc7000 xfail.
1396- ci/freedreno: Drop a530 piglit_gl coverage.
1397- ci/turnip: Drop the #8219 xfail.
1398- ci/zink+turnip: Disable flaky minetest trace.
1399- ci/hasvk: Add a synchronization flake.
1400- ci: Fix stage of etnaviv manual runs.
1401- ci/zink: Add a glx flake on anv
1402- ci/crocus: Add new tess xfails and a link to the regression bug report.
1403- ci/crocus: Mark unvanquished as flaky.
1404- anv: Skip the RT flush when doing depth-only rendering.
1405- anv: Skip BTI RT flush if we're doing an op that doesn't use render targets.
1406- glsl/opt_algebraic: Drop ~~x == x transformation.
1407- glsl/opt_algebraic: Drop log(exp(x)) -> x and exp(log(x)) -> x optimisations.
1408- glsl/opt_algebraic: Drop pow-recognizer.
1409- glsl/opt_algebraic: Drop abs(-x) -> abs(x) and abs(abs(x)) -> abs(x).
1410- glsl/opt_algebraic: Drop -(-x) -> x optimization.
1411- glsl/opt_algebraic: Drop f2i(trunc(x)) -> f2i(x) optimization.
1412- glsl/opt_algebraic: drop fsat(fadd(b2f(x), b2f(y))) -> b2f(ior(x, y)) opt.
1413- glsl/opt_algebraic: Drop shifts of 0 optimizations.
1414- glsl/opt_algebraic: Drop pow optimizations.
1415- glsl/opt_algebraic: Drop rcp optimizations.
1416- glsl/opt_algebraic: Drop and/or/xor optimizations.
1417- glsl/opt_algebraic: Drop fdiv(1,x) -> frcp(x) and fdiv(x,1) -> x optimizations.
1418- glsl/opt_algebraic: Drop add/sub with 0 optimizations.
1419- glsl/opt_algebraic: Drop x + -x -> 0 optimization.
1420- glsl/opt_algebraic: Drop csel(true/false, x, y) optimization.
1421- nir: Add optimization for fdot(x, 0) -> 0.
1422- glsl/opt_algebraic: Drop fdot 0-channel optimizations.
1423- glsl/opt_algebraic: Drop scalar all_eq/any_neq -> eq/neq opt.
1424- glsl/opt_algebraic: Drop the eq/neq add-removal optimization.
1425- glsl/opt_algebraic: Drop no-op pack/unpack optimization.
1426- glsl/opt_algebraic: Drop the flrp/ffma simplifiers.
1427- glsl/opt_algebraic: Drop some fmul simplifications.
1428- nir: Port a floor->truncate algebraic opt pattern from GLSL.
1429- glsl/opt_algebraic: Drop the ftrunc pattern recognizer.
1430- glsl/opt_algebraic: Drop the flrp recognizer.
1431- glsl: Remove unused as_rvalue_to_saturate().
1432- ci: Update traces expectations for gutting glsl opt_algebraic.
1433- panfrost/midgard: Fix handling of csel with a vector constant condition.
1434- panfrost/midgard: Drop redundant arg to emit_explicit_constant.
1435- glsl: Move lower_vector_insert to GLSL-to-NIR.
1436- nir/split_64bit_vec3_and_vec4: Handle 64-bit matrix types.
1437- gallivm: Return 0 for first active invocation when no invocations are active.
1438- gallivm: Use first_active_invocation for ubo/kernel memory loads.
1439- gallivm: Use first_active_invocation for scalar SSBO loads.
1440- gallivm: Add some notes about other invocation_0_must_be_active usages.
1441- ci: Add some xfail updates from VKCTS 1.3.5.0 for the manual jobs.
1442- ci/etnaviv: Drop the dEQP-GLES2.functional.uniform_api.random.94 xfail.
1443- anv+hasvk: Use driconf to disable 16-bit for zink.
1444- zink: Pass the cmdbuf to the end of the marker, too.
1445- Revert "ci: disable mesa-swrast runner jobs"
1446- ci: Re-enable some swrast testing using fd.o's shared runners for now.
1447- glsl/nir: Include early glsl-to-nir output in NIR_DEBUG=print.
1448- glsl_to_nir: Use a variable's constant_value if it wasn't const-propped out.
1449- glsl: Delete constant propagation pass.
1450- glsl: Delete constant folding pass.
1451- glsl: Delete constant-variables pass.
1452- ci: Update trace expectations for GLSL constant prop removal.
1453- ci/zink: Update TGL xfails/flakes based on the last nightly pipelines.
1454- ci/turnip: Extend a630 vk full timeout to 3 hours.
1455- ci/iris: Add skips for slow tests on APL.
1456- turnip: Don't push inline uniform buffer contents outside constlen.
1457- ci/turnip: Clear out stale xfails.
1458- ci/turnip: Disable dEQP-VK.image.queue_transfer.* for now.
1459- ci/turnip: Move some more of the 1.3.5 new xfails under links.
1460- glsl: Simplify vector constructors from scalars.
1461- glsl/lower_precision: Add a unit test that I thought we might fail at.
1462- glsl/lower_precision: Add a cut-down testcase for #8124
1463- glsl: Set the precisions of builtin function arguments and returns.
1464- glsl: Handle highp promotion of builtin function args in the builtins.
1465- glsl: Set the precision of function return value temporaries.
1466- glsl/lower_precision: Drop most special-casing of builtin arg precision.
1467- glsl: Fix the precision of atomic counter builtin function args.
1468- glsl/lower_precision: Add actual spec quotes for "check_parameters"
1469- nir/lower_mediump: Fix assertion about copy_deref lowering matching.
1470- ci/iris: Update more manual job xfails from the Wayland build change.
1471- ci/crocus: Update expectations from VK CTS 1.3.5.0.
1472- ci/hasvk: Update some xfails from the 8-sample fast clear disable.
1473- ci/etnaviv: Get the gc2000_piglit manual job mostly working.
1474- glsl/standalone: Pull program create/destroy out to a public function.
1475- glsl/standalone: Pull out a helper function for adding GLSL source shaders.
1476- glsl/standalone: Make all standalone contexts have NewProgram set.
1477- glsl: Write a new test for GLSL and NIR mediump lowering.
1478- ci/crocus: Fix 1.3.5.0 xfails.
1479- ci/etnaviv: Polish the gc2000 xfails a bit.
1480- ci/zink: Update the tgl manual run xfails.
1481- gallivm: Skip loads/stores that are definitely outside of compact vars.
1482- nir/lower_sysvals: Add support for un-lowered tess_level_inner/outer.
1483- nir_to_tgsi: Handle stores to compact outputs.
1484- glsl: Delete the lower_tess_level pass.
1485- glsl: Remove the TessLevel lowering special case from xfb.
1486- glsl: Drop dead prototype.
1487- ci/freedreno: Flake KHR-GL45.shader_image_load_store.basic-allTargets-store
1488- ci/broadcom: Skip another texelfetch case.
1489- perfetto: Add a .clang-format for the directory.
1490- intel/perfetto: Drop unused "pipelined" field.
1491- perfetto: Make a MesaRenderpassDataSource with common setup/start/stop.
1492- perfetto: Deduplicate clock sync packet emit from renderstage sources.
1493- perfetto: Move intel's cmdbuf/queue annotation code to the shared util.
1494- ci/zink: Drop validation exception for leaks at device destroy.
1495- ci/zink: Disable godot-tps-gles3 on a630.
1496- docs: Update Vulkan renderpass docs for !22191
1497- ci: Add missing dependency on doxygen sources for docs-generation jobs.
1498- docs: Claim less functionality for glsl_compiler.
1499- glsl: Move ForceGLSLAbsSqrt handling to glsl-to-nir.
1500- zink: Add mapping for nir_op_ldexp, but disable it for 64-bit's sake.
1501- glsl: Retire ldexp lowering in favor of the nir lowering flag.
1502- glsl/softfp64: GC the temp vars after we lower them to SSA.
1503- glsl/softfp64: Add fisfinite lowering.
1504- state_tracker: Lower frexp before lowering doubles.
1505- intel: Always call nir_lower_frexp.
1506- ir3: Move turnip's nir_lower_frexp to the shared compiler.
1507- nouveau: Add missing nir_opt_algebraic_late.
1508- nouveau: Enable frexp lowering in the backend.
1509- zink: Enable nir_lower_frexp.
1510- v3d: Lower frexp in the GL compiler like we do in Vulkan.
1511- agx: Enable nir_lower_frexp.
1512- panfrost/midgard: Enable nir_lower_frexp.
1513- nir_to_tgsi: Always lower frexp_exp/sig.
1514- glsl: Drop frontend lowering of 32-bit frexp.
1515- glsl: Drop PIPE_SHADER_CAP_DFRACEXP_DLDEXP_SUPPORTED.
1516- tgsi: Drop TGSI_OPCODE_DFRACEXP.
1517- ci/zink: Disable a630 portal-2-v2 due to kernel OOMs.
1518- etnaviv: Fix regression from if_uses change.
1519- blob: Don't valgrind assert for defined memory if we aren't writing.
1520- util/log: Fix log messages over 1024 characters.
1521- vulkan: Handle alignment failure in the pipeline cache.
1522- vulkan: Actually increment the count of objects in GetPipelineCacheData.
1523- ci/radeonsi: Mark glx-make-current as flaky.
1524
1525EmperorPenguin18 (1):
1526
1527- v3d: expose more drm formats with SAND128 modifier
1528
1529Eric Engestrom (172):
1530
1531- bin/ci: add gitlab_gql.py.cache to the .gitignore
1532- mesa/st: drop unused param
1533- ci/bare-metal: add more timestamps to help debugging issues
1534- ci: be explicit about the \`meson setup` subcommand
1535- docs: add release notes for 22.3.4
1536- docs/relnotes: add sha256sum for 22.3.4
1537- docs: update calendar for 22.3.4
1538- meson: turn android-libbacktrace into a feature option
1539- v3dv: mark dEQP-VK.api.command_buffers.record_many_draws_secondary_2 as flaky
1540- ci/android: move common config to common job
1541- ci/android: move virgl-specific gpu_mode to virgl-defined variables
1542- ci/android: move virgl-specific fails/flakes/skips lists to virgl-defined variables
1543- ci/android: move virgl-specific deqp suite to virgl-defined variables
1544- ci/android: move virgl-specific so lib name to virgl-defined variables
1545- ci/android: add missing line terminator at the end of the file
1546- docs: add release notes for 22.3.5
1547- docs: update calendar for 22.3.5
1548- panfrost: drop no-longer-needed libglsl
1549- gallium/u_screen.h: add missing stdint.h include
1550- util: avoid calling kcmp on Android
1551- etnaviv: use simple_mtx to avoid breaking windows in the next commit
1552- gallium: move etnaviv screen_lookup_or_create function to common code
1553- freedreno: replace custom code with u_pipe_screen_lookup_or_create()
1554- lima: replace custom code with u_pipe_screen_lookup_or_create()
1555- v3d: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens
1556- vc4: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens
1557- panfrost: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens
1558- asahi: use u_pipe_screen_lookup_or_create() to keep track of and reuse screens
1559- u_pipe_screen_lookup_or_create: avoid re-querying the fd to have a consistent hash key
1560- broadcom/ci: mark test as flaky
1561- vk/util: keep track of extension requirements
1562- vk/runtime: keep track of supported instance extensions
1563- vk/runtime: turn vk.xml extension requirements into asserts
1564- meson: move float64_glsl_file one meson.build up
1565- meson: only build mapi when needed
1566- meson: only build the loader when needed
1567- meson: only build libglsl_util when needed
1568- meson: only build glsl when needed
1569- meson: drop \`TODO: opengl`, it's done
1570- ci: simplify adding & removing deqp patches
1571- ci: remove no-op sed
1572- ci: fix grouping of image tags
1573- ci: bump tags of deqp images
1574- docs: add 23.1 branchpoint & rc dates
1575- meson: make GLX require OpenGL
1576- meson/windows: only build libgl-gdi for desktop gl
1577- meson: allow building GLES without GL
1578- mesa: add _mesa_is_desktop_gl_compat() and _mesa_is_desktop_gl_core() helpers
1579- mesa: make use of the new _mesa_is_desktop_gl_compat() helper
1580- mesa: make use of the new _mesa_is_desktop_gl_core() helper
1581- mesa: make more use of the existing _mesa_is_gles* helpers
1582- mesa: add & use new _mesa_is_gles1() & _mesa_is_gles2() helpers
1583- mesa: make more use of the new _mesa_is_gles1() helper
1584- mesa: make more use of the new _mesa_is_gles2() helper
1585- mesa: optimize out _mesa_is_desktop_gl*() and _mesa_is_gles*() calls when not built
1586- ci: stop watching for changes in removed script
1587- meson: improve formatting of options file
1588- broadcom/ci: refactor a bit
1589- broadcom/ci: fold .vc4-rpi3-piglit:armhf into its only user
1590- broadcom/ci: use deqp-runner to run piglit tests
1591- docs/release-calendar: drop the last 22.2.x, it won't happen
1592- broadcom/ci: group x11 and wayland variant of the same test failing
1593- broadcom/ci: use weston's xwayland instead of starting X as well
1594- broadcom/ci: add x11- prefix to x11 EGL tests
1595- broadcom/ci: drop create_pixmap_surface from the fails; it passes now
1596- broadcom/ci: skip buffer_age.no_preserve and swap_buffers_with_damage on wayland
1597- broadcom/ci: add two known failures
1598- broadcom/ci: re-enable egl on wayland
1599- docs: include explicit \`setup` in instructions
1600- docs: add release notes for 22.3.6
1601- docs/relnotes: add sha256sum for 22.3.6
1602- docs: update calendar for 22.3.6
1603- v3d: update supertuxkart reference after 1c028a4d5b623e73bdf5
1604- docs: mention the meson summary
1605- docs: mention \`meson configure` and drop broken workaround script
1606- meson: reuse vulkan_wsi_list for defining vk_wsi_args
1607- meson: replace vk_wsi_args with dependencies to let meson take care of transitivity
1608- egl: include directly the useful vulkan header, instead of including everything
1609- glx: include directly the useful vulkan header, instead of including everything
1610- gbm: drop unnecessary vulkan dependency
1611- radv: split linker script for android since it requires different symbols
1612- glsl: align definition of _mesa_problem with the one in main/error.h
1613- glapi/meson: drop duplicate line in deps
1614- meson: allow checking for null pointers even if they're supposed to be non-null
1615- panfrost/ci: add EGL tests
1616- asahi/winsys: add .clang-format
1617- vk: move radv's linker symbols scripts for use in all drivers
1618- v3dv: add linker script to fix android symbols
1619- tu: add linker script to fix android symbols
1620- anv: add linker script to fix android symbols
1621- vn: add linker script to fix android symbols
1622- android/vk: drop unnecessary symbols
1623- vk: be stricter about symbols check between android and other platforms
1624- v3d/ci: add dEQP-GLES3.functional.texture.specification.teximage2d_pbo.*_cube flakes
1625- osmesa: add exported symbols check
1626- docs: add release notes for 22.3.7
1627- docs/relnotes: add sha256sum for 22.3.7
1628- docs: update calendar for 22.3.7
1629- v3dv/ci: add a test to the known failures
1630- meson: bump minimum version to 0.60
1631- meson: allow feature options to take true/false to mean enabled/disabled
1632- meson: inline gtest_test_protocol now that it's always 'gtest'
1633- v3dv: split out broadcom_shader_stage_to_gl() calls to improve readability
1634- ci: take valve farm offline
1635- ci: disable weston session timeout
1636- broadcom/ci: no need to skip the tests that swap buffers anymore
1637- ci/broadcom: move rare failure to the flakes
1638- ci: drop redundant .no_scheduled_pipelines-rules + .core-rules since the latter already includes it
1639- ci/rustfmt: simplify getting all the rust files
1640- ci/rustfmt: print which files are checked
1641- ci: group RESULT logic in a single place
1642- v3dv/ci: fix test name (\`,Fail` is not part of the test name)
1643- asahi: replace copies of .clang-format with symlinks
1644- asahi: fix a few typos
1645- v3d: fix \`dirty` bitset being too small to accept V3D_DIRTY_SSBO
1646- v3dv: use common GetPhysicalDeviceFeatures
1647- v3dv: reorder features as 1.0, 1.1, 1.2, 1.3
1648- v3dv: use vk_get_physical_device_features
1649- v3d/ci: add another depthstencil-default_fb-drawpixels-* to the flakes
1650- v3d/ci: group dEQP-GLES3.functional.texture.specification.teximage2d_pbo.* flakes and add another one
1651- ci: centralize detection of ccache in link-werror wrapper
1652- ci: add linker wrapper for clang
1653- ci: always use the -Werror wrapper
1654- ci: deduplicate compiler wrappers
1655- ci/docs: start documenting ci_run_n_monitor.py
1656- v3d: add link to issue investigating failure
1657- asahi: change create_renderonly signature to uniformize it
1658- etnaviv: change create_renderonly signature to uniformize it
1659- freedreno: change create_renderonly signature to uniformize it
1660- lima: change create_renderonly signature to uniformize it
1661- panfrost: change create_renderonly signature to uniformize it
1662- v3d: change create_renderonly signature to uniformize it
1663- vc4: change create_renderonly signature to uniformize it
1664- kmsro: uniformize renderonly creation
1665- kmsro: sort drivers alphabetically
1666- ci/broadcom: consolidate vc4-rpi3* jobs into a single vc4-rpi3-gl:armhf
1667- ci/broadcom: consolidate v3d-rpi4* jobs into a single v3d-rpi4-gl:armhf
1668- ci/broadcom: slightly increase coverage of vk tests
1669- vc4/ci: add arm64 failure to flakes as it works on armhf
1670- broadcom/ci: run gl jobs on arm64, just like vk
1671- vc4/ci: add another sync flake
1672- panfrost: assign the correct create_for_resource from the start
1673- Revert "broadcom/ci: run gl jobs on arm64, just like vk"
1674- v3dv/ci: mark known dEQP-VK.wsi.xlib.surface.query_formats failure
1675- ci/rustfmt: make sure to only check each file once
1676- v3d: disable GL_NV_conditional_render
1677- VERSION: bump for 23.1.0-rc1
1678- .pick_status.json: Update to 8ebc5cbe2b828f34b9bfb32c528d3514ead59798
1679- v3dv/ci: drop fixed failure from fails.txt
1680- .pick_status.json: Update to 0d7912d239dac5bf3c8b07f2a6ca467f760d6aa6
1681- .pick_status.json: Update to 543b6ca7c4b00c4bfff5668ba0a0643d565db201
1682- amd: fix buggy usage of unreachable()
1683- compiler: fix buggy usage of unreachable()
1684- pvr: fix buggy usage of unreachable()
1685- vk/util: fix buggy usage of unreachable()
1686- v3d: add flake spec@ext_framebuffer_blit@fbo-sys-sub-blit
1687- VERSION: bump for 23.1.0-rc2
1688- .pick_status.json: Update to 3017d01c9ded9c9fd097b600081b1bbe86e90fb8
1689- .pick_status.json: Update to a18a51a708a86f51e0a5ab031b379f65bc84fb49
1690- .pick_status.json: Update to c060b649c5a866f42e5df73f41c6e2809cf30e99
1691- ci: rework vulkan validation layer build script
1692- .pick_status.json: Update to 3f14fd8578549e34db2f564396f300819b2ff10f
1693- VERSION: bump for 23.1.0-rc3
1694- .pick_status.json: Update to 040aeb5a23e5cc8a71a352e55282d514dd2ab64f
1695- .pick_status.json: Update to 9f522ac0c65ceae11ad1a4e84ec9f32a9393a25c
1696- .pick_status.json: Update to efc94390f716b70ac1d5b09c6f949f938aeadcac
1697- VERSION: bump for 23.1.0-rc4
1698- .pick_status.json: Update to 6d84b34359dcbad477209adb9f9d0592c5a71bb9
1699- .pick_status.json: Update to cb4e4fc5de48886758a26ff19d322947b5abfcec
1700- dzn: fix pointer type mismatch
1701- .pick_status.json: Update to 57afa7c0b12d6d0c9013368853080dfea5b50d07
1702- .pick_status.json: Update to 31e6d15801a9904089aa2913c8eb5a31b79c7dfc
1703
1704Erico Nunes (5):
1705
1706- lima/ci: Add more piglit unsupported tests to skip
1707- Revert "CI: Lima farm is offline"
1708- lima: don't use resource_from_handle while creating scanout
1709- lima/ci: restore swap buffers egl tests
1710- Revert "ci: disable lima farm, currently out-of-space, needs to be fixed"
1711
1712Erik Faye-Lund (54):
1713
1714- zink: whitespace fixup
1715- zink: fix depth-clip disable cap
1716- zink: remove depth_clip_control_missing workaround
1717- radeonsi: respect smoothing_enabled
1718- meson: remove dupliace add_devenv call
1719- meson: remove deprecated osmesa-bits option
1720- meson: remove deprecated dri-drivers option
1721- meson: avoid using deprecated build_root() method
1722- meson: use files() instead of joining paths
1723- freedreno/meson: simplify script-path logic
1724- meson: do not reconstruct ICD paths
1725- anv, hasvk: remove stale TODO-files
1726- zink: correct companies in requirements
1727- zink: remove incorrect trailing comma
1728- meson: remove unused USE_FOO_ASM defines
1729- vulkan: prefer vulkan_core.h over vulkan.h
1730- meson: don't pass vk wsi args where they don't belong
1731- Revert "meson: Fix Asahi build on macOS"
1732- zink: prefer vulkan_core.h over vulkan.h
1733- zink: get rid of needless dependency
1734- ci: correct typo in name of linkcheck job
1735- docs: update link to intel optimization reference manual
1736- nir: add a print_internal debug-flag
1737- docs: implement new vk-feat role
1738- docs/zink: use vk-feat role for features
1739- docs/zink: remove some trailing spaces
1740- docs/zink: fixup wording of the GL 4.6 requirements
1741- meson: correct typo in comment
1742- ci: move docs-stuff out of root .gitlab-ci.yml
1743- docs: fixup broken envvar-role syntax
1744- docs: escape a few more strings
1745- docs: fixup broken indentation
1746- docs/zink: mention vk1.2 mirror-clamp feature option
1747- docs/zink: clean up requirements-language
1748- docs: move developers article to main website
1749- docs: remove old thanks-article
1750- docs: prefer http-links over ftp
1751- docs/freedreno: fix turnip-heading level
1752- docs: drop reference to modindex
1753- docs: move old relnotes to _extra directory
1754- docs: use version-number as toctree-title for relnotes
1755- zink: emit terminate for spir-v 1.6
1756- zink: use demote from spir-v 1.6 when possible
1757- zink: use spir-v 1.6 local-size when needed
1758- zink: enable spir-v 1.6 for vulkan 1.3
1759- docs: format code-block as ini
1760- docs: format code-block as toml
1761- docs: make code-block indents consistent
1762- ci: move virgl-rules after intel-rules
1763- virgl/ci: clean up manual rules for virgl
1764- ci: remove unused rules
1765- zink: do not use sampled-image for buffers
1766- nir: fix constant-folding of 64-bit fpow
1767- llvmpipe: fixup refactor copypasta
1768
1769Faith Ekstrand (99):
1770
1771- nir: Add more opcodes to nir_tex_instr_is_query()
1772- nir/builder: Add some texture helpers
1773- radv: Use the new NIR builder tex helpers for meta
1774- anv: Refactor Android externalFormat handling in CreateYcbcrConversion
1775- anv/android: Use VkFormat for externalFormat
1776- util/format: YUYV and UYVY have 4 8-bit channels
1777- vulkan/formats: Add YCbCr format information
1778- vulkan: Add a common vk_ycbcr_conversion struct
1779- anv: Use the common vk_ycbcr_conversion object
1780- anv: Use the YCbCr format info from common code
1781- nir: Add copyright and include guards to nir_vulkan.h
1782- anv,nir: Move the ANV YCbCr lowering pass to common code
1783- gallium,util: Pull u_indices and u_primconvert back into gallium
1784- mailmap: Remap e-mail addresses for Faith Ekstrand
1785- vtn: Set alignment on initial UBO/SSBO casts
1786- anv: Let spirv_to_nir() set UBO/SSBO base cast alignments
1787- hasvk: Let spirv_to_nir() set UBO/SSBO base cast alignments
1788- intel/compiler: Document wm_prog_key::persample_interp
1789- intel/nir: Lower barycentrics to per-sample in a dedicated pass
1790- nir: Remove nir_lower_io_force_sample_interpolation
1791- intel/compiler: Use SHADER_OPCODE_SEND for PI messages
1792- intel/fs: Return early in a couple builtin setup helpers
1793- intel/compiler: Convert brw_wm_aa_enable to brw_sometimes
1794- intel/fs: Make per-sample and coarse dispatch tri-state
1795- intel/compiler: Convert wm_prog_key::persample_interp to a tri-state
1796- intel/compiler: Convert wm_prog_key::multisample_fbo to a tri-state
1797- intel/fs/validate: Assert SEND [extended] descriptors are uniform
1798- intel/fs: Break out yet another FB write helper
1799- intel/fs: Rework dynamic coarse handling
1800- nir/deref: Preserve alignments in opt_remove_cast_cast()
1801- nir/from_ssa: Use more helpers in resolve_parallel_copies
1802- nir/from_ssa: Only re-locate values that are destinations
1803- nir/from_ssa: Move the loop bounds check in resolve_parallel_copy
1804- nir: Add a load/store bit size lowering pass
1805- intel/nir: Use nir_lower_mem_access_bit_sizes()
1806- Revert "vk/runtime: turn vk.xml extension requirements into asserts"
1807- Revert "vk/util: keep track of extension requirements"
1808- vulkan: Remove unused fields from Extension and ApiVersion
1809- vulkan: Improve extension parsing
1810- vulkan: Parse the platform in Extensions.from_xml()
1811- vulkan: Add a get_all_required() helper
1812- vulkan: Properly filter entrypoints
1813- vulkan: Properly filter by api in enum_to_str
1814- Vulkan: Properly filter structs in vk_cmd_queue_gen
1815- vulkan: Filter out provisional extensions
1816- vulkan: Move the features generator to vulkan/util
1817- vulkan: Properly filter structs in vk_physical_device_features
1818- vulkan/layers: Use PUBLIC instead of VK_LAYER_EXPORT
1819- vulkan/device-select-layer: Include vulkan.h
1820- vulkan: Update the XML and headers to 1.3.241
1821- nir/lower_io: Handle buffer_array_length for more address modes
1822- anv: Drop our manual SSBO size handling
1823- hasvk: Drop our manual SSBO size handling
1824- panvk: Drop our manual SSBO size handling
1825- turnip: Set spirv_options::use_deref_buffer_array_length
1826- lavapipe: Set spirv_options::use_deref_buffer_array_length
1827- v3dv: Set spirv_options::use_deref_buffer_array_length
1828- spirv: Always emit deref_buffer_array_length intrinsics
1829- nir: Check against combined alignment in nir_lower_mem_access_bit_sizes
1830- nir: Add mode filtering to lower_mem_access_bit_sizes
1831- nir: Add UBO support to nir_lower_mem_access_bit_sizes
1832- nir: Add a combined alignment helper
1833- nir: Rename align to whole_align in lower_mem_load
1834- nir: Rename nir_mem_access_size_align::align_mul to align
1835- nir: Make chunk_align_offset const in lower_mem_load()
1836- nir: Handle wider unaligned loads in lower_mem_access_bit_size
1837- intel/nir: Limit unaligned loads to vec4
1838- vulkan/runtime: Rename and document storage image Z range
1839- intel/blorp: Set array_len for 3D images properly
1840- isl: Set Depth to array len for 3D storage images
1841- intel: Use nir_lower_tex_options::lower_index_to_offset
1842- vulkan: Update XML and headers to 1.3.244
1843- vulkan: Provide wrappers for VK_EXT_map_memory2 functions
1844- anv: Limit memory maps to the client-allocated size
1845- anv: Implement VK_KHR_map_memory2
1846- intel/isl: Support Yf/Ys/Tile-64 in isl_surf_get_image_offset_sa
1847- intel/blorp: Drop the TODO file
1848- docs: Fix Faith's name in relnotes
1849- nir: Drop a bunch of Authors tags
1850- spirv: Drop a bunch of Authors tags
1851- intel: Drop some author comments and update Faith's name
1852- util,mesa,panfrost: Drop some author tags
1853- vulkan: vk_android.c should be copyright Intel
1854- util: Update some copyright tags
1855- CODEOWNERS: s/jekstrand/gfxstrand
1856- vulkan,anv,hasvk,radv: Add a common vk_image_usage_to_ahb_usage helper
1857- vulkan/android: Fix hardware buffer usage flags
1858- vulkan: Add an ahardware_buffer_format field to vk_image
1859- anv,hasvk: Set vk_image.ahardware_buffer_format
1860- radv: Set vk_image.ahardware_buffer_format
1861- vulkan,anv,hasvk,radv: Unify Android hardware buffer creation
1862- vulkan: Add a vk_device_memory base struct
1863- anv: Use the new vk_device_memory base struct
1864- vulkan: Record pipeline flags in the render pass
1865- vulkan: Plumb rendering flags through vk_graphics_pipeline_state
1866- anv/pipeline: Use feedback loop flags for self-dependencies
1867- hasvk/pipeline: Use feedback loop flags for self-dependencies
1868- vulkan: Drop vk_render_pass_state::\*self_dependenc*
1869- vulkan: Drop VkRenderingSelfDependencyInfoMESA
1870
1871Felix DeGrood (10):
1872
1873- intel/perf: Hide extended metrics by default
1874- anv: cs_stall during compute state flush on < gen12.5
1875- anv: only emit CFE_STATE when scratch space increases
1876- anv: set CFE_STATE.OverDispatchControl to default
1877- iris: report draw count for perfetto
1878- anv/blorp: support surf generation for addresses
1879- anv/blorp: implement anv_cmd_buffer_fill_area
1880- anv/blorp: add flush reasons to RT flushes
1881- anv: reset query pools using blorp
1882- anv: disable reset query pools using blorp opt on MTL
1883
1884Filip Gawin (2):
1885
1886- crocus: don't quantize the clear value
1887- nine: add fallback for D3DFMT_D16 in d3d9_to_pipe_format_checked
1888
1889Francisco Jerez (11):
1890
1891- intel/fs/gfx12: Ensure that prior reads have executed before barrier with acquire semantics.
1892- intel/disasm/gfx12+: Use helper instead of hardcoded bit access for 64-bit immediates.
1893- intel/disasm/gfx12+: Fix print out of non-existing condmod field with 64-bit immediate.
1894- intel/eu/gfx12+: Implement decoding of 64-bit immediates.
1895- intel/fs/gfx12+: Drop redundant handling of SHADER_OPCODE_BROADCAST in exec pipe inference.
1896- intel/fs: Fix src and dst types of LOAD_PAYLOAD ACP entries during copy propagation.
1897- intel/eu/gfx8-9: Fix execution with all channels disabled due to HW bug #220160235.
1898- intel/rt: Fix L3 bank performance bottlenecks due to SW stack stride alignment.
1899- intel/fs: Track force_writemask_all behavior of copy propagation ACP entries.
1900- intel/fs: Fix copy propagation dataflow analysis in presence of force_writemask_all ACP overwrites.
1901- intel/fs: Fix register coalesce in presence of force_writemask_all copy source writes.
1902
1903Frank Binns (7):
1904
1905- pvr: small cleanups
1906- pvr: remove start/stop transfer flags
1907- pvr: stop restricting the compiler to the Sascha Willems triangle demo
1908- pvr: remove duplicate define
1909- pvr: initialise size for placeholder "zeroed" shaders
1910- pvr: replace nop binary shader with run-time compiled shader
1911- pvr: fix clang-format issue
1912
1913Friedrich Vock (26):
1914
1915- radv/rt: Divide by the correct workgroup size
1916- radv/bvh: Prevent NANs when computing node cost
1917- radv/rmv: Also check the other pid field
1918- radv/rmv: Avoid more CPU unmap deadlocks
1919- radv/rmv: Log bo destruction before freeing it
1920- radv/rmv: Correct timestamp shifting
1921- vulkan/rmv: Use the timestamp divisor instead of a hardcoded value
1922- vulkan/rmv: Remove delta parameter from dump helpers
1923- mesa: Report GL_SHADER_BINARY_FORMAT_SPIR_V as supported
1924- docs: Fix formatting for RMV tracing docs
1925- radv: Extend hit attribute lowering for LDS
1926- radv: Use LDS for closest-hit hit attributes
1927- radv: Emit RT shader VA user SGPR
1928- radv/rt: Add shader config combination/postprocessing utils
1929- radv: Add RT shader stage names for executable properties
1930- aco: Swap operands for v_and_b32 in RT prolog
1931- radv/rt: Also adjust the SGPR count in postprocess_rt_config
1932- aco: Un-swap addressable VGPRs/SGPRs in RT prolog
1933- radv: Work around use-after-free compiler errors
1934- radv: Add RT stages to radv_mesa_to_rgp_shader_stages
1935- radv/rmv: Fix creating RT pipelines
1936- radv/rmv: Fix import memory
1937- radv/rt: Plug some memory leaks during shader creation
1938- radv: Don't leak the RT prolog binary
1939- radv: Always call si_emit_cache_flush before writing timestamps
1940- radv: Add driconf to always drain waves before writing timestamps
1941
1942GH Cao (1):
1943
1944- gallium: Add MCJIT target triplet for Windows ARM64
1945
1946Ganesh Belgur Ramachandra (1):
1947
1948- ac/nir: fix CDNA image lowering for array textures
1949
1950Georg Lehmann (81):
1951
1952- Revert "aco: Combine v_cvt_u32_f32 with insert to v_cvt_pk_u8_f32."
1953- aco: use s_bfm_64 for constant copies
1954- aco: use s_pack_ll_b32_b16 for constant copies
1955- aco: Improve wave64 cycle estimates.
1956- aco: fix imod/omod for gfx11 VOP3 opcodes
1957- aco: add mov/cndmask opcodes to does_fp_op_flush_denorms
1958- aco: don't allow output modifiers for v_cvt_pkrtz_f16_f32
1959- aco: allow output modifiers for ldexp_f16
1960- aco: don't list imod/omod support v_fmaak_f32/v_fmamk_f32
1961- aco: support omod/imod for v_fmac_f16
1962- aco: remove stale TODOs about v_interp opsel
1963- aco: new 16bit VOP3 opcodes can use opsel
1964- aco: Don't use vcmpx with DPP.
1965- aco: combine a ^ ~b and ~(a ^ b) to v_xnor_b32
1966- amd,nir: remove byte_permute_amd intrinsic
1967- nir: change 16bit image dest folding option to per type
1968- amd: don't use d16 for integer loads
1969- amd: d16 uses rtz conversion for 32bit float
1970- aco: use v_permlane(x)16_b32 for masked swizzle
1971- aco/gfx11: use dpp_row_xmask and dpp_row_share
1972- aco: use and swizzle mask in dpp quad perm
1973- aco/optimizer_postRA: assume all registers are untrackable in loop headers
1974- nir/opt_algebraic: add patterns for iand/ior of feq/fneu with 0
1975- aco: mark mad definition as precise if the mul/add were precise
1976- aco: use v_fma_mix_f32 for v_fma_f32 with 2 fp16 representable, different literals
1977- nir/lower_mediump: don't use fp16 for constants if the result is denormal
1978- aco: treat VINTERP_INREG as VALU
1979- aco/ir: rework IR to have one common valu instruction struct
1980- aco/ra: set opsel_hi to zero when converting to VOP2
1981- aco: validate VALU modifiers
1982- aco/print_ir: simplify using VALU instruction
1983- aco/optimizer: simplify using VALU instruction
1984- aco: remove VOP[123C]P? structs
1985- aco: add bitfield array helper classes
1986- aco: use bitfield array helpers for valu modifiers
1987- aco/assembler/gfx11: simplify 16bit VOP12C promotion to VOP3
1988- aco/optimizer: don't reallocate instruction when converting to VOP3
1989- aco: don't reallocate fma{mk,ak,_mix} instruction
1990- aco: copy abs/neg with assignment
1991- aco: use integer access for neg_lo/neg_hi
1992- aco: use array indexing for opsel/opsel_lo/opsel_hi
1993- aco: access neg/abs as int in usesModifiers
1994- aco: use bitfield_array for temporary neg/abs/opsel
1995- nir: optimize i2f(f2i(fsign))
1996- aco: remove duplicates from .clang-format
1997- amd: remove duplicate from .clang-format
1998- aco: don't check usesModifiers for pseudo instructions
1999- aco: fix p_interp_gfx11 comment
2000- aco: make .clang-format usable with tests
2001- aco/ir: fix copy paste bug in convert_to_SDWA
2002- aco/util: override default assignment operator for bitfield helpers
2003- aco: clean up to_mad_mix
2004- aco/ra: don't reallocate VOP3 instruction for non-vcc lane mask
2005- aco/vn: hash opsel for VOP12C
2006- aco/assembler: support VOP12C opsel
2007- aco: validate VOP12C opsel
2008- aco/to_hw_instr: use VOP1 opsel for v_mov_b16
2009- aco/ra: prepare for VOP12C opsel
2010- aco/optimizer: preserve opsel when fusing fma
2011- aco: handle opsel in combine_comparison_ordering
2012- aco: handle opsel in combine_ordering_test
2013- aco: handle opsel in combine_constant_comparison_ordering
2014- aco: update match_op3_for_vop3 for VOP12C opsel
2015- aco: support v_cvt_f32_f16 with opsel in combine_mad_mix
2016- aco: support neg(mul)/abs(mul) optimization in more cases
2017- aco: return true in usesModifiers for VOP12C with opsel
2018- aco: swap opsel when swapping VOP2/C operands
2019- aco/ir: copy opsel when converting to DPP
2020- aco: don't label mul with opsel as abs/neg
2021- aco/gfx11: allow opsel for VOP12C
2022- aco/optimizer: use opsel for VOP12C
2023- aco: keep label_mul/usedef/minmax in apply_extract
2024- aco/optimizer: remove to_SDWA
2025- aco: add tests for fma with opsel
2026- aco: add tests for dpp with opsel
2027- aco: add tests for swap operand with opsel
2028- aco: add tests for cmp ordering with opsel
2029- aco: add test for min/max combining with opsel
2030- aco/tests: run optimize.mad_mix.input_conv.modifiers on gfx11
2031- aco: add tests for neg(mul) with opsel
2032- aco/tests: add missing dependency on generated header
2033
2034Gert Wollny (49):
2035
2036- glsl/nir: only set uses_sample_shading when the output is a fbfetch
2037- nir: Add possibility to store image var offset in range_base
2038- nir: Add range_base to atomic_counter and an option to use it
2039- ntt: handle the image intrinsic range_base when translating to TGSI
2040- ntt: Make use of the range_base offset when translating atomics in NTT
2041- virgl: lower image variable offsets into the intrinsic range_base value
2042- virgl: Request setting the atomic offset in the range_base
2043- virgl: drop the separable flag for cases that can't be handled
2044- r600/sfn: Fix readport check
2045- r600/sfn: Do a bit of cleanup with the secondary read port validation
2046- r600/sfn: Fix opcode and result dest slot mask for variable size dot
2047- r600/sfn: Fix splitting of multislot alu ops
2048- virgl: remove unused virgl_encoder_inline_write
2049- r600/sfn: Use range_base for atomics and images
2050- r600/sfn: Work around dependency issue when splitting op to group
2051- r600/sfn: drop useless instr use count
2052- r600/sfn: Fix a typo
2053- r600/sfn: Silence warnings about unused parameters
2054- r600/sfn: Don't copy propagate indirect loads to more than one dest
2055- r600/sfn: Stop try scheduling in t-slot with empty related v-slot
2056- r600/sfn: rename texture coordinate offset for clarity
2057- r600/sfn: address use in group only if instr can be added
2058- r600/sfn: Forward setting the block ID and index
2059- r600: Don't start new CF for every fetch through tex clause
2060- r600/sfn: Fix handling of fetch through texture clause
2061- r600/sfn: Fix alu trans op flag setup
2062- r600/sfn: Fix Cayman trans from string and add test for copy prop
2063- vulkan/wsi: Take Xwayland into account for x11_min_image_count
2064- zink/kopper: Add extra swapchain images for Venus
2065- r600/sfn: be more conservative with channel use in multi-slot ops
2066- r600/sfn: Fix readport cylce map
2067- r600/sfn: Fix minimum required registers
2068- r600/sfn: Add AluGroup method to update readport validation from scratch
2069- r600/sfn: Split AluInstr replace_source into test and actual replace
2070- r600/sfn: Add method to AluGroup to replace sources
2071- r600/sfn: Add print method to AluReadportValidation
2072- r600/sfn: redirect copy propagation to alu parent group
2073- r600/sfn/tests: Add a test for the copy prop into a group
2074- r600/sfn: Fix atomic lowering
2075- virgl: Enable AMD_vertex_shader_(layer|viewport_index) when host supports it
2076- virgl: Don't try to do re-alloc or readback by transfer for blob resources
2077- ntt: add option to lower SSBO bindings to buffer index
2078- virgl: Lower binding start into buffer indices
2079- r600/sfn: fix container allocators
2080- r600/sfn: Lower tess levels to vectors in TCS
2081- r600/sfn: make sure f2u32 is lowered late and correctly for 64 bit floats
2082- r600/sfn: assign window_space_position in shader state
2083- r600/sfn: Ass support for image_samples
2084- r600/sfn: fix cube to array lowering for LOD
2085
2086Giancarlo Devich (25):
2087
2088- d3d12: Use varying comparison function for TESS stage key compare
2089- d3d12: Add unions to encompass shader key stage vars, use in hashing
2090- nir: Check sampler_binding is valid when lowering tex shadow
2091- d3d12: Don't clear d3d12_shader_key
2092- d3d12: Move d3d12_context_state_table_entry to d3d12_resource_state.h
2093- d3d12: Assign up to 16 simultaneously active contexts unique IDs
2094- d3d12: Track up to 16 active context resource states locally in d3d12_bo
2095- d3d12: Don't recompute has_flat_varyings or missing_dual_src_outputs
2096- d3d12: Track max varying slot, set and compare less bytes
2097- d3d12: Don't unnecessarily zero out gs/tcs keys
2098- d3d12: Don't memcmp gs/tcs keys
2099- d3d12: Create varying structures as necessary, reference them
2100- d3d12: Don't loop in update_draw_indirect_with_sysvals
2101- d3d12: Compare shader keys with a switch, instead of cascading if's
2102- d3d12: Compare shader keys with union-encompassing fields all at once
2103- d3d12: Compare shader key common parts with memcmp, instead of if's
2104- d3d12: Cache varying info to reduce compare/copy cost
2105- d3d12: Use memcmp for full tcs/gs variant keys
2106- d3d12: Track up to 16 contexts worth of pending barriers locally in bos
2107- d3d12: Don't unnecessarily recompute manual_depth_range
2108- d3d12: Use context-level sampler_state array for filling shader keys
2109- d3d12: Use short circuit in shader key compare; update key hash
2110- d3d12: Reduce gs variant key init cost; unnecessary validate gs calls
2111- d3d12: Unroll shader variant selection loop
2112- d3d12: Track up to 16 contexts worth of batch references locally in bos
2113
2114Guilherme Gallo (24):
2115
2116- radeonsi/ci: Update stoney test expectations
2117- radeonsi/ci: Skip slow traces on raven
2118- Revert "ci: disable Collabora's LAVA lab for maintance"
2119- ci/lava: Move LAVA dependencies to pip
2120- ci/lava: Add LavaFarm class to find LAVA farm from runner tag
2121- ci/lava: Fix LAVA logs issues for Collabora jobs
2122- ci: Upload debian-release artifact to S3
2123- ci: Create debian-arm64-release job
2124- ci: Use release builds in perf jobs
2125- ci: Use workflow to make CI aware of performance jobs
2126- ci: Reuse MESA_CI_PERFORMANCE_ENABLED in performance-rules
2127- ci: Handle carriage return characters in LAVA logs
2128- ci: Fix release build use for performance jobs
2129- ci/baremetal: Wrap artifact download curl with xtrace
2130- ci: Improve piglit-traces "no-perf" filter
2131- ci: Fix freedreno-rules-performance
2132- ci: Add piglit traces hidden jobs
2133- ci/freedreno: create a618-traces and perf jobs
2134- ci/zink: Add zink-a618 trace jobs
2135- ci/zink: Add zink-turnip-manual-rules
2136- ci/zink: Add zink-tu-a618-traces-performance job
2137- ci/zink: Fix zink-tu-a618-traces perf job rules
2138- ci/zink: Reduce zink-tu-a618-traces parallelism
2139- Revert "ci: disable Collabora's LAVA lab for maintance"
2140
2141Hampus Linander (4):
2142
2143- nir: Add extr_agx opcode
2144- agx: Add extr instruction to AGX backend
2145- agx: Use AGX extr for tex lowering
2146- agx: Optimize lower_resinfo for cube maps
2147
2148Hans-Kristian Arntzen (13):
2149
2150- radv: Fix invalid 64-bit shift.
2151- radv: Fix missing VK_ACCESS_2_SHADER_SAMPLED_READ_BIT.
2152- radv: Implement VK_ACCESS_2_DESCRIPTOR_BUFFER_READ_BIT_EXT.
2153- wsi/common: Add common implementation of vkReleaseSwapchainImagesEXT.
2154- wsi/x11: Implement EXT_swapchain_maintenance1.
2155- wsi/common: Implement swapchain present fence.
2156- wsi/common: Add comment about DEFERRED_ALLOCATION_BIT_EXT.
2157- wsi/common: Add function to modify present mode.
2158- wsi/wayland: Implement EXT_swapchain_maintenance1.
2159- wsi/display: Implement EXT_swapchain_maintenance1.
2160- wsi/win32: Implement VK_EXT_swapchain_maintenance1.
2161- radv: Expose VK_EXT_swapchain_maintenance1.
2162- wsi/x11: Fix present ID signal when IDLE comes before COMPLETE.
2163
2164Harri Nieminen (6):
2165
2166- docs/specs: Fix typos
2167- docs/gallium: Fix typos
2168- docs/freedreno: Fix typos
2169- docs/panfrost: Fix typo
2170- docs/svga3d: Fix typo
2171- bin: Fix typos
2172
2173Helen Koike (12):
2174
2175- ci/debian-android: move pkgconfig paths to the cross file
2176- ci: move patches to patches directory
2177- android: allow system = 'android' on cross file
2178- ci/android: move sdk version and ndk to a job variable
2179- ci: compile deqp for android
2180- ci: compile deqp-runner for android
2181- ci: debian-android compile virgl
2182- ci: export artifacts from debian-android
2183- ci/android: add android to the ci
2184- android/ci: fix removal of inexistent file
2185- android/ci: Fix call to adb
2186- android/ci: raise error on script when not related to the tests
2187
2188Hyunjun Ko (1):
2189
2190- vulkan/runtime: match the spec when taking pipeline subsets.
2191
2192Iago Toral Quiroga (20):
2193
2194- broadcom/compiler: produce better code for f2f16 with RTZ rounding
2195- v3dv: add paths to handle partial copies of linear images
2196- v3dv: drop unused field from v3dv_cmd_buffer
2197- v3dv: increase BO allocation size when growing CLs
2198- v3dv: ensure we allocate at least the requested space for a CL
2199- v3dv: add a cl_advance_and_end helper
2200- v3dv: ensure at least V3D_CL_MAX_INSTR_SIZE bytes in last CL instruction
2201- v3dv: ensure we apply binning syncs to secondary command buffers
2202- v3dv: fix stencil view aspect selection of depth/stencil image
2203- v3d: support r{g,gba}16f formats for vertex buffers
2204- broadcom/compiler: track pending ldtmu count with each TMU lookup
2205- v3dv: pause occlusion queries during vkCmdClearAttachments
2206- v3dv: fix format swizzle for buffer views
2207- v3dv: drop unused parameter
2208- v3dv: always acquire display device before checking if we can present
2209- vulkan/wsi/display: set pDisplay to NULL on error
2210- v3d,v3dv: stop trying to force 16-bit TMU output for shadow comparisons
2211- broadcom/compiler: fix v3d_qpu_uses_sfu
2212- broadcom/compiler: add a v3d_qpu_instr_is_legacy_sfu helper
2213- broadcom/compiler: fix incorrect check for SFU op
2214
2215Ian Romanick (60):
2216
2217- ntt: Add support for fcsel_gt and fcsel_ge opcodes
2218- nir/lower_int_to_float: Add support for i32csel opcodes
2219- r300: Enable generation of fcsel_gt and fcsel_ge opcodes
2220- i915: Enable generation of fcsel_gt and fcsel_ge opcodes
2221- gallium/draw: Enable aapoint NIR helpers to generate bool1, bool32, or float32 Booleans
2222- gallium/draw: Enable polygon stipple NIR helpers to generate bool1 or bool32 Booleans
2223- nir/builder: Eliminate nir_f2b helper (and use of nir_f2b32 helper)
2224- nir/builder: Handle f2b conversions specially in nir_type_convert
2225- nir: Eliminate nir_op_f2b
2226- lavapipe: Fix bad array index scale factor in lvp_inline_uniforms pass
2227- lavapipe: Only check NULL pointers in one place in src_only_uses_uniforms
2228- nir/inline_uniforms: Change num_offsets type to uint8_t
2229- nir/inline_uniforms: Pass max_num_bo and max_offset around as parameters
2230- nir/inline_uniforms: Allow possibility of more than one UBO
2231- nir/inline_uniforms: Allow possibility of uni_offsets and num_offsets being NULL
2232- nir/inline_uniforms: Make src_only_uses_uniforms public, change name
2233- nir/inline_uniforms: Make add_inlinable_uniforms public
2234- nir/inline_uniforms: Add inot condition support
2235- nir/tests: Don't unconditionally log shaders from this one CF test
2236- nir/tests: Refactor creation of loops for loop_analyze test cases
2237- nir/tests: Add tests for "inverted" loops
2238- nir/tests: Add tests for nir_loop_info::induction_vars tracking
2239- nir/loop_analyze: Track induction variables with uniform increments
2240- nir/loop_analyze: Use nir_loop_variable::update_src instead of nir_basic_induction_var::alu
2241- nir/loop_analyze: Use nir_loop_variable::init_src instead of nir_basic_induction_var::def_outside_loop
2242- nir/loop_analyze: Eliminate nir_basic_induction_var
2243- nir/loop_analyze: Track induction variables with uniform initializer
2244- nir/loop_analyze: Simplify some logic in compute_induction_information
2245- nir: ifind_msb_rev can only have int32 sources
2246- intel/compiler: Lower find_lsb in NIR
2247- nir: intel/compiler: Move ifind_msb lowering to NIR
2248- intel/compiler: Tighter src and dest size bounds checking for some opcodes
2249- nir/algebraic: Only lower ufind_msb with 32-bit sources
2250- nir: intel/compiler: Move ufind_msb lowering to NIR
2251- nir/builder: Do not generate 8- or 16-bit find_msb
2252- nir/algebraic: Do not generate 8- or 16-bit find_msb
2253- nir: Restrict ufind_msb and ufind_msb_rev to 32- or 64-bit sources
2254- nir/algebraic: Optimize some ifind_msb to ufind_msb
2255- nir/lower_int64: Optionally lower ufind_msb using uadd_sat
2256- intel/fs: Don't copy propagate from saturate to sel
2257- nir/algebraic: Undistribute fsat from fmax
2258- intel/fs: Output opt_combine_constants debug to stderr
2259- intel/fs: Refactor part of opt_combine_constants to a separate function
2260- intel/fs: Rework the loop of opt_combine_constants that collects constants
2261- intel/compiler: Remove one overload of backend_instruction::insert_before
2262- intel/compiler: Use NIR_PASS instead of NIR_PASS_V
2263- intel/compiler: Micro optimize inst_is_in_block
2264- intel/fs: Use specialized version of regions_overlap in opt_copy_propagation
2265- intel/compiler: Micro optimize regions_overlap
2266- intel/fs: Linked list micro optimizations in brw_nir_move_interpolation_to_top
2267- intel/fs: Preserve meta data more often in brw_nir_move_interpolation_to_top
2268- intel/fs: White space fixes
2269- nir/tests: Add many loop analysis tests for induction vars updated by shifts
2270- nir/tests: Add more loop analysis tests for induction vars updated by shifts
2271- nir/tests: Add many loop analysis tests for induction variables modified by imul
2272- nir/loop_analyze: Add a function to evaluate an ALU as constant
2273- nir/loop_analyze: Track induction variable basis information
2274- nir/loop_analyze: Change invert_cond instead of changing the condition
2275- nir/loop_analyze: Use try_eval_const_alu and induction variable basis info
2276- nir/tests: Port almost all loop_analyze tests to new macro-based infastructure
2277
2278Ikshwaku Chauhan (1):
2279
2280- radeonsi: Fix distortion for yuv422 format for GFX10.
2281
2282Illia Abernikhin (1):
2283
2284- util: Extend vk_enum_to_str with bitmasks vk_enum_to_str only generates literals for enums with type: @type="enum", but many enums have type: @type="bitmask" and were not taken into account here.
2285
2286Illia Polishchuk (4):
2287
2288- ANV: Add extra memory types for ANV driver instead of a single one
2289- hasvk: Add extra memory types for hasvk driver instead of a single one
2290- nir: Add sha1 hash for nir shaders converted from spir-v
2291- glx: fix indirect initialization crash
2292
2293Ilya K (1):
2294
2295- intel/vk/grl: don't install libgrl.a
2296
2297Isaac Bosompem (1):
2298
2299- tool/pps: Fix 32-bit build issue with format string
2300
2301Isabella Basso (5):
2302
2303- nir/algebraic: insert patterns inside optimizations list
2304- nir/algebraic: extend mediump patterns
2305- nir/algebraic: extend lowering patterns for conversions on smaller bit sizes
2306- nir/algebraic: make patterns for float conversion lowerings imprecise
2307- nir/algebraic: remove duplicate bool conversion lowerings
2308
2309Italo Nicola (10):
2310
2311- panfrost: fix off-by-one when exporting format modifiers
2312- panfrost: fix tiny sample_positions BO memory leak
2313- hud: use defines for default scale/rotation/visibility values
2314- hud: add GALLIUM_HUD_OPACITY envvar
2315- panfrost: fix strict-aliasing violations when packing fb ptrs
2316- etnaviv: abort() instead of assert(0) on compiler error
2317- etnaviv: use stderr for compiler error logging
2318- etnaviv: add default clear_buffer and clear_texture APIS
2319- etnaviv: lower (un)pack_{2x16,2x32}_split and extract_{byte,word}
2320- etnaviv: implement nir_op_uclz and lower find_{msb,lsb} to uclz
2321
2322Iván Briano (7):
2323
2324- anv: uncompressed views of compressed 3d images are now valid
2325- vulkan: track the right value on CmdSetColorWriteMasks
2326- anv: fix testing for dynamic color blend bits
2327- anv: stop tracking color blend state in the pipeline
2328- anv: use the parameter passed to the macro
2329- intel/fs: handle interpolation modes for at_sample and at_offset too
2330- vulkan/wsi/display: do not dereference a NULL pointer
2331
2332Jakub Kulík (1):
2333
2334- mesa: Fix format transform on big endian platforms
2335
2336Jan Beich (1):
2337
2338- util/u_process: implement util_get_command_line for BSDs
2339
2340Janne Grunau (1):
2341
2342- asahi: Fix typo in debug/error message helper macro
2343
2344Jarred Davies (16):
2345
2346- pvr: Use common queue submit implementation
2347- pvr: Add support for VK_KHR_timeline_semaphore
2348- pvr: Enable threaded submit when supported
2349- pvr: Clear wait syncs after job submission
2350- pvr: Don't update fragment signal sync when fragment stage is disabled
2351- pvr: Fix segfaults when pDepthStencilAttachment is NULL
2352- pvr: Generate EOT program at runtime
2353- pvr: Generate dummy emit for renders without any emits
2354- pvr: Add support for multiple emits from EOT program
2355- pvr: Select a single aspect format for the texture state of DS image views
2356- pvr: Add initial support for VK_FORMAT_S8_UINT
2357- pvr: Don't allocate/upload 0 size coeff programs
2358- pvr: Always mark robustBufferAccess as supported
2359- pvr: Rename pvr_xgl_pds.c to pvr_pipeline_pds.c
2360- pvr: Add robustness buffer support
2361- pvr: Mark all normalized formats as supporting with_packed_usc_channel
2362
2363Jesse Natalie (224):
2364
2365- ci/windows: Download updated WARP 1.0.4 package
2366- dzn/ci: Remove flakes/fails that don't hit anymore
2367- dzn/ci: Add image test group, which is all passing now
2368- dzn: Fix clear bind flag logic
2369- microsoft/compiler: Lower pack_[u/s]norm_2x16
2370- microsoft/compiler: Implement texture sample count query
2371- microsoft/compiler: Remove arrays when testing for structs in I/O
2372- microsoft/compiler: Always emit float types in the I/O signature for structs
2373- microsoft/compiler: Re-work the logic for adding SV_SampleIndex to force sample-rate
2374- microsoft/compiler: Use nir info.fs.uses_sample_shading to force sample-rate
2375- microsoft/compiler: Set num_components to 4 when updating pos write instructions
2376- spirv2dxil: For removing unused vars, consider the whole I/O var size
2377- spirv2dxil: When removing unused inputs, make sure they're actually inputs
2378- spirv2dxil: Allow killing position as an undef varying
2379- spirv2dxil: Replace not-provided inputs with zero instead of undef
2380- dzn: Get options13
2381- dzn: Support alpha blend factor
2382- dzn: When changing root signature, dirty descriptors too
2383- dzn: Use R24G8_TYPELESS for 24/8 depth resources
2384- dzn: Support int border colors
2385- dzn: Storage buffer sizes need to be 4-byte-aligned
2386- dzn: Set MultisampleEnable to enable MSAA lines
2387- dzn: Use typeless format for creation of depth-only or stencil-only D24S8
2388- dzn: Define a symbol that was present in older D3D headers
2389- dzn: Support root signature 1.2
2390- dzn: Support unnormalized coordinate samplers
2391- dzn: Always align cached pipeline header size to input element align
2392- dzn: Add a zeroed zsa state when depth or raster is disabled
2393- dzn: Disable depth when the rasterizer is disabled due to no position output
2394- dzn: Fix format support checks for storage/uniform texel buffers
2395- dzn: Remove cmdbuf query 'wait' list
2396- microsoft/compiler: Delete incorrect implementation for load_layer_id
2397- microsoft/compiler: Subpass textures are supposed to be arrays
2398- microsoft/compiler: Delete stale TODO comment
2399- microsoft/compiler: Support view instancing
2400- spirv2dxil: Pass runtime conf struct to lower_shader_system_values
2401- spirv2dxil: Implement lowering for multiview
2402- spirv2dxil: Claim multiview support
2403- dzn: Put nir compilation options in a struct
2404- dzn: Handle multiview pipeline creation
2405- dzn: Handle draws and clears for multiview rendering
2406- dzn: Implement multiview queries
2407- dzn: Enable multiview
2408- dzn: Enable independent blending
2409- dzn: Delete an unnecessary assert
2410- dzn: Rework meta blit VS
2411- microsoft/compiler: Add an overload param to unary function helpers
2412- microsoft/compiler: Implement a few basic wave/subgroup intrinsics
2413- microsoft/compiler: Add lowering passes for basic subgroup vars
2414- spirv2dxil: Use 32-bit shared offsets
2415- spirv2dxil: Support basic subgroups
2416- dzn: Support basic subgroups
2417- microsoft/compiler: Fix atomic image umax
2418- microsoft/compiler: Lower device index to zero
2419- spirv2dxil: Support dispatches with base group indices
2420- dzn: Support vkCmdDispatchBase
2421- dzn: Use common physical device list/enumeration helpers
2422- dzn: Respect suspending/resuming flags to omit clears/resolves
2423- dzn: Set dynamic rendering caps
2424- dzn: When rendering to 3D, don't treat layers as subresources for barriers
2425- dzn: Move patched vertex buffer capability check up a level
2426- dzn: Use SHADER_LOAD to indicate SAMPLED_IMAGE support
2427- dzn: Usage image view usage instead of image usage
2428- dzn: Support EXTENDED_USAGE bit
2429- dzn: Usage MULTISAMPLE_LOAD support instead of RT/DS support for MSAA
2430- dzn: Descriptor limits are based on binding tier, not heap tier
2431- dzn: A single sampler descriptor set needs to support 1024 samplers
2432- dzn: Don't expose variable pointers
2433- dzn: Fix independent blend check
2434- dzn: Enable Vulkan 1.1
2435- microsoft/compiler: Don't emit threadgroup barriers for graphics shaders
2436- microsoft/compiler: Handle i2i1 and u2u1
2437- microsoft/compiler: Handle i1 overloads
2438- microsoft/compiler: Implement more wave/quad ops
2439- microsoft/compiler: Support emitting the SM6.6 wave size tag
2440- spirv2dxil: Lower some wave op properties
2441- spirv2dxil: Support subgroup SPIR-V caps
2442- dzn: Support more subgroup/quad ops
2443- dzn: Implement subgroup size control extension
2444- dzn: Use core feature matching logic instead of rolling our own
2445- microsoft/compiler: Support float controls
2446- dzn: Fix dynamic rendering clear load op for non-multiview
2447- dzn: Handle separate stencil usage
2448- dzn: Cache GPUVA for buffers
2449- dzn: Support float control
2450- dzn: Always do clears with copies on non-graphics queues
2451- dzn: Enhanced barriers fixes/workarounds
2452- dzn: Ensure we don't mix DSV+simultaneous-access
2453- dzn: Support Vulkan 1.2
2454- dzn: Fix Windows WSI
2455- dzn: Don't recursively lock the physical device enum mutex
2456- dzn: Report as a software device for non-Windows
2457- CI/windows: Don't limit deqp-runner to 4 jobs
2458- CI/windows: Apply CI_FDO_CONCURRENT to piglit too
2459- dzn: Consider linked shaders when computing DXIL hash
2460- wsi/win32: Always use non-SRGB formats for DXGI
2461- wsi/win32: Use app-provided timeout instead of arbitrary hardcoded value
2462- CI: Lima farm is offline
2463- dzn, driconf: Add a driconf entry for NMS to claim wide line support
2464- vulkan/wsi: Add a wsi_device param to get_present_modes
2465- vulkan/wsi/win32: Support tearing (immediate) and VSync (FIFO) present modes
2466- wsi/win32: Don't require buffer blits for software drivers
2467- wsi/win32: We don't need a window DC for DXGI
2468- clc: Include opencl-c-base.h with LLVM 15 (using builtins)
2469- microsoft/clc: Set features that are used by CL tests
2470- ci/windows: Update LLVM to 15
2471- nir: Add alignment to load_push_constant
2472- nir_lower_fp16_casts: Allow opting out of lowering certain rounding modes
2473- microsoft/compiler: Handle struct consts in DXIL module dumper
2474- microsoft/compiler: Handle frcp for float16/float64
2475- microsoft/compiler: Ensure native_low_precision is set for 16-bit bitcasts/stores
2476- microsoft/compiler: Handle undef-rounding f2f16 as rtz
2477- microsoft/compiler: Move unaligned load/store pass from CL
2478- microsoft/compiler: Pass deref modes to unaligned pass and handle push const
2479- microsoft/compiler: Simplify bitpacking for load/store lowering with nir_extract_bits
2480- microsoft/compiler: Pass an alignment to constant buffer load lowering
2481- microsoft/compiler: Handle 48-bit stores to SSBO/shared
2482- microsoft/compiler: Support raw buffer load/store intrinsics with 16bit alignment
2483- microsoft/compiler: Support lowering SSBO accesses to 16bit vectors
2484- spirv2dxil: Set min UBO/SSBO alignments
2485- spirv2dxil: Lower unaligned loads and stores
2486- spirv2dxil: Move shader model into runtime conf struct
2487- spirv2dxil: Support 16bit types
2488- dzn: Enable get_surface_capabilities2
2489- dzn: Delete unused extensions table
2490- dzn: Get options4
2491- dzn: Enable 16bit types when supported
2492- dzn: Enable KHR_storage_buffer_storage_class
2493- vulkan/wsi: Fix Windows build
2494- radv: Fix returning an expression from a void function
2495- Revert "CI: Disable Windows runners"
2496- nir: Propagate alignment when rematerializing cast derefs
2497- microsoft/compiler: Implement wave reduce/exclusive scan ops that are supported
2498- microsoft/compiler: Add a lowering pass for scan ops that aren't supported
2499- spirv2dxil: Handle arithmetic subgroup ops
2500- dzn: Claim the arithmetic subgroup bit
2501- ci/windows: Update warp to 1.0.5
2502- microsoft/compiler: Handle writable buffer UAV size queries
2503- d3d12: Report correct texel buffer max size
2504- d3d12: Fix buffer SRV/UAV creation
2505- d3d12: Remove now-unused UAV format from shader info
2506- microsoft/compiler: Fix setting bit 31 in feature flags
2507- microsoft/compiler: Only set typed UAV load feature bit for multi-comp loads
2508- microsoft/compiler: Refactor type -> resource kind helper
2509- microsoft/compiler: Add helpers for getting res_props structs
2510- microsoft/compiler: Split handle annotation into two parts
2511- microsoft/compiler: Handle "bindless" image/tex sources as heap indices
2512- microsoft/compiler: Support descriptor heap indexing for UBO/SSBO
2513- microsoft/compiler: Use store_dest instead of store_dest_value more
2514- microsoft/compiler: Update header docs for binding modes supported by compiler
2515- spirv2dxil: Add a pass to lower deref tex/image and vulkan ubo/ssbo to bindless
2516- spirv2dxil: Only lower readonly images to SRVs when the option is set
2517- spirv2dxil: Support descriptor indexing capabilities
2518- dzn: Remove device pointers from descriptor heaps
2519- dzn: Remove descriptor heap type from descriptor heap wrapper
2520- dzn: Fix a leak in descriptor set layout creation
2521- dzn: Add some docs around descriptor sets and remove redundant/unused data
2522- dzn: Put UAVs first for storage images/buffers in descriptor tables
2523- dzn: Consistently order depth formats before stencil
2524- dzn: Don't use plane slice 1 for depth+stencil SRVs
2525- dzn: Set up SRV descs for 3D textures correctly
2526- dzn: Skip setting up UAVs for depth resources
2527- dzn: Add initial bindless infrastructure
2528- dzn: When bindless, only allocate one descriptor per layout entry
2529- dzn: Remove defragmenting of descriptor pools
2530- dzn: Delete unused function
2531- dzn: Allocate descriptor sets in buffers for bindless mode
2532- dzn: Don't dirty bindings if root signature doesn't change
2533- dzn: Use separate dirty bits for descriptor sets/dynamic buffers
2534- dzn: Bind buffers for bindless descriptor sets
2535- dzn: Add a binding classification in the pipeline layout remapping
2536- dzn: When binding a bindless root signature, bind descriptor heaps first
2537- dzn: Ensure root signatures are re-bound after a meta op
2538- dzn: Only bind descriptor sets up to the used amount of the current layout
2539- dzn: Apply bindless lowering when compiling pipelines
2540- dzn: Add a debug option for enabling bindless mode
2541- dzn: Support descriptor indexing via bindless
2542- dzn: Enable variable size bindings
2543- dzn: Use mesa_loge for DXIL validation errors
2544- microsoft/clc: Add shader model / validator to compiler API
2545- d3d12: Move forward-front-face pass to common DXIL code
2546- spirv2dxil: Expose yz flip pass to external callers
2547- dzn: Add a helper to generate triangle->point GS
2548- dzn: Handle polygon point mode
2549- dzn: Claim fillModeNonSolid
2550- CI/windows: Update headers and Agility redist to 1.710.0-preview
2551- CI/windows: Increase timeout for build container job
2552- microsoft/compiler: Fix 8-bit loads and stores when supporting 16-bit DXIL
2553- microsoft/compiler: Fix barrier for wave ID computation
2554- microsoft/compiler: Assign 1D wave IDs based on local thread ID
2555- microsoft/compiler: Fix large shifts
2556- spirv2dxil: Add some more supported caps
2557- dzn: Add a driconf entry for enabling 8bit loads and stores
2558- dzn: Add a driconf option for enabling subgroup ops in VS/GS
2559- dzn: Fix SRV barrier state on compute command lists
2560- dzn: Raise max number of descriptor sets to 8
2561- dzn: Report some more caps correctly that are supported
2562- dzn: Align descriptor sets in the bindless buffer
2563- dzn: Ensure pipeline variants are used for dynamic stencil masks
2564- dzn: Don't use write-combine memory for cache-coherent UMA
2565- dzn: Ensure buffer offsets are aligned
2566- dzn: Attempt to force depth write states for depth access in LAYOUT_GENERIC
2567- dzn: Don't do initial-layout barriers for simultaneous-access resources
2568- dzn: Batch command lists together
2569- dzn: Fix bindless descriptor sets with multiple dynamic buffers that need custom descriptors
2570- dzn: Early-out on no-op barriers
2571- dzn: Clean up ABI helpers now that we require DirectX-Headers 606
2572- dzn: Use GetResourceAllocationInfo3 for castable formats
2573- dzn: Don't leave deleted physical devices in the instance pdev list
2574- dzn: Remove skips now that WARP is faster
2575- dzn: Support >2K samplers with bindless
2576- dzn: Remove xfail for test that passes (if run)
2577- microsoft/compiler: Don't split loads/stores that will be split by lower_explicit_io
2578- dzn: Changes to descriptor set dirty flag handling
2579- dzn: Use a linear allocator for upload data on command buffers
2580- dzn: Ignore unnormalized sampling flag if driver doesn't support it
2581- dzn: Never set STATE_RENDER_TARGET on a compute command list
2582- dzn: Don't enable bindless by default
2583- d3d12: Support creating PSOs with no attachments with MSAA without TIR
2584- d3d12: Fix buffer reference leak for SO count staging buffer
2585- dzn: Handle mismatches in bound descriptor set vs pipeline layout
2586- d3d12: Respect buffer offsets for sampler views
2587- dzn: Hook up subgroup size to compute shader compilation
2588- dzn: Delete queue-level event waits
2589
2590Jonathan Gray (3):
2591
2592- egl/dri2: avoid undefined unlocks
2593- intel/dev: remove invalid EHL pci id
2594- intel/dev: Add another EHL pci id
2595
2596Jonathan Marek (1):
2597
2598- turnip: fix use of align() instead of util_align_npot() with tile_align_w
2599
2600Jordan Justen (8):
2601
2602- intel/vk/grl: Don't include anv_private.h in genX_grl.h
2603- intel/vk/grl: Allow genX_grl.h to be included by C++ files
2604- intel/vk/grl: Allow grl/grl_cl_kernel.h to be included by C++ files
2605- intel/vk/grl: genX-ify genX_grl_uuid.cpp
2606- intel/vk/grl: genX-ify grl_cl_kernel_name()
2607- intel/dev: Enable MTL PCI ids
2608- intel/compiler: Support fmul_fsign opt for fp64 when int64 isn't supported
2609- intel/compiler/gfx12.5+: Lower 64-bit cluster_broadcast with 32-bit ops
2610
2611Joshua Peisach (1):
2612
2613- gallum/asahi: fix memory leak in agx_resource_from_handle
2614
2615José Fonseca (4):
2616
2617- llvmpipe: Ensure floating point SSE state is reset regardless of the write mask.
2618- llvmpipe: Honor zero sample_mask when multisample is disabled.
2619- trace: Don't use italic escape code.
2620- wgl: Fix unintentional assignment on assert.
2621
2622José Roberto de Souza (107):
2623
2624- anv: Start to move i915 specific code from anv_device to i915/anv_device
2625- anv: Export anv_exec_batch_debug() and chain_command_buffers()
2626- anv: Split i915 code from anv_batch_chain.c
2627- anv: Move anv_device_check_status() code to i915/anv_device.c
2628- intel/dev: Export functions that will be used by different kernel drivers
2629- intel/dev: Move i915 code to i915/intel_device_info.c
2630- intel/dev: Split hwconfig i915 specific code
2631- intel/dev: Detect what is the kernel mode driver loaded
2632- intel: Add intel_kmd_type parameter to intel_engine_get_info()
2633- intel: Add kmd_type parameter to necessary intel_gem.h functions
2634- anv: Nuke anv_queue:index_in_family
2635- hasvk: Nuke anv_queue:index_in_family
2636- intel/ds: Nuke intel_ds_queue::queue_id
2637- intel/ds: Fix crash when allocating more intel_ds_queues than u_vector was initialized
2638- intel/genxml/gen125: Add walker configuration fields to 3DSTATE_WM
2639- intel/genxml/gen125: Tune 3DSTATE_WM Walker direction
2640- intel: Add intel_memory_class_instance
2641- anv: Convert drm_i915_gem_memory_class_instance to intel_memory_class_instance
2642- anv: Use DRM_IOCTL_I915_GEM_CREATE_EXT in all supported kernels
2643- anv: Add basic KMD backend infrastructure
2644- anv: Start to move anv_gem_stubs.c to kmd backend
2645- anv: Remove remaining bits of anv_i915_query()
2646- hasvk: Remove remaining bits of anv_i915_query()
2647- anv: Add gem_close to kmd backend
2648- anv: Add gem_mmap to kmd backend
2649- anv: Move execute_simple_batch() and queue_exec_locked() to kmd backend
2650- intel/common: Move i915 files to i915 folder
2651- iris: Export batch debug functions
2652- iris: Export update_batch_syncobjs()
2653- iris: Export num_fences()
2654- intel: Make gen12 URB space reservation dependent on compute engine presence
2655- intel/blorp: Allocate only necessary amount of VERTEX_BUFFER_STATE
2656- intel: Pull in xe_drm.h
2657- intel: Add Meson parameter to enable Xe KMD support
2658- intel/dev: Add INTEL_KMD_TYPE_XE
2659- intel/dev: Implement Xe functions to fill intel_device_info
2660- intel/dev: Implement Xe functions to handle hwconfig
2661- intel/dev: Query and compute hardware topology for Xe
2662- iris: Convert drm_i915_gem_memory_class_instance to intel_memory_class_instance
2663- iris/bufmgr: Add i915_gem_set_domain()
2664- iris: Use DRM_IOCTL_I915_GEM_CREATE_EXT in all supported kernels
2665- iris: Add initial skeleton of kmd backend
2666- iris: Move iris_bo_madvise() to i915/iris_bufmgr.c
2667- iris: Add iris_bo_set_caching()
2668- intel/common: Implement the Xe functions for intel_engine
2669- intel/common: Implement the Xe functions for intel_gem
2670- iris: Move bo_madvise to kmd backend
2671- iris: Move bo_set_caching to kmd backend
2672- iris: Move iris_bo_busy_gem() to i915/iris_bufmgr.c
2673- iris: Move iris_bo_wait_gem() to i915/iris_bufmgr.c
2674- iris: Don't mark protected bo as reusable
2675- intel/perf: Disable it for Xe KMD
2676- build: Block build of HASVK, Crocus and i915 in non-x86 architectures
2677- iris: Add gem_mmap() to kmd backend
2678- iris: Add batch_check_for_reset() to kmd backend
2679- iris: Move i915 submit_batch() to i915 backend
2680- anv: Implement gem_create for Xe backend
2681- anv: Implement Xe functions to create and destroy VM
2682- anv: Implement gem close and mmap for Xe backend
2683- anv: Add gem VM bind and unbind to backend
2684- anv: Integrate gem vm bind and unbind kmd backend functions
2685- iris: Drop I915_EXEC_FENCE types
2686- iris: Drop usage of i915 EXEC_OBJECT_WRITE
2687- iris: Move iris_bufmgr_init_global_vm() to i915/iris_bufmgr.c and prepare for Xe KMD
2688- anv: Implement Xe version of anv_physical_device_get_parameters()
2689- anv: Properly alloc buffers that will be promoted to framebuffer in Xe KMD
2690- anv: Handle external objects allocation in Xe
2691- iris: Only mark buffer as exported if drmPrimeHandleToFD() succeed
2692- iris: Implement the Xe version of iris_bufmgr_init_global_vm()
2693- iris: Implement the function to destroy VM in Xe
2694- iris: Implement gem_create() in Xe kmd backend
2695- iris: Implement gem_mmap() in Xe kmd backend
2696- iris: Store iris_context's priority
2697- iris: Move to i915/iris_batch.c code to create and replace i915 context
2698- iris: Move to iris_i915_batch.c code to destroy i915 context
2699- intel: Move memory aligment information to intel_device_info
2700- anv: Use intel_device_info memory alignment
2701- intel: Set mem_alignment in Xe kmd
2702- anv: Apply memory alignment requirements in Xe kmd
2703- intel: Add TODO about removal of 2Mb alignment in i915
2704- anv: Replace I915_ENGINE_CLASS_VIDEO by INTEL_ENGINE_CLASS_VIDEO
2705- anv: Create Xe engines
2706- anv: Implement Xe version of check_status()
2707- anv: Handle Xe queue/engine priority
2708- anv: Implement Xe version of execute_simple_batch()
2709- iris: Prepare iris_bufmgr functions for vm bind error paths
2710- iris: Add vm bind and unbind to kmd backend
2711- iris: Implement gem_vm_bind() and gem_vm_unbind() in Xe kmd backend
2712- iris: Ajust gem buffer allocation size in Xe kmd
2713- intel: Sync xe_drm.h
2714- anv: Partialy import drm-uapi/gpu_scheduler.h and use it
2715- anv: Fetch max_context_priority from drm_xe_query_config
2716- intel: Allocate mesh shader URB space before task shader
2717- anv: Move to a function code to clflush batch buffers
2718- anv: Implement Xe version of anv_queue_exec_locked() and queue_exec_trace()
2719- anv: Disable anv_bo_sync_type for Xe kmd
2720- anv: Add assert in functions not supported by Xe kmd
2721- iris: Add BO_ALLOC_SHARED
2722- iris: Handle allocation of exported buffers in Xe kmd
2723- iris: Handle allocation of scanout buffers in Xe
2724- iris: Implement Xe version of bo_madvise() and bo_set_caching()
2725- anv: Fix vm bind of imported buffers
2726- iris: Add function to close gem bos
2727- iris: Handle Xe syncronization with syncobjs
2728- loader: Add Xe KMD support
2729- iris: Fix close of exported bos
2730- iris: Allow shared scanout buffer to be placed in smem as well
2731
2732Juan A. Suarez Romero (8):
2733
2734- v3d/v3dv: define performance counters in common
2735- v3d: cache pipe query results
2736- v3d: include offset as part of streamout target
2737- v3d: implement NV_conditional_render extension
2738- v3d: fix condition for EZ disabling when stencil on
2739- v3d: set depth compare function correctly
2740- v3d: use primitive type to get stream output offset
2741- v3d: apply 1D texture miplevel alignment in arrays
2742
2743Julia Tatz (3):
2744
2745- zink: zink_heap isn't 1-to-1 with memoryTypeIndex
2746- zink: trival renames heap_idx -> memoryTypeIndex
2747- zink: correct sparse bo mem_type_idx placement
2748
2749Juston Li (29):
2750
2751- venus: refactor out vn_feedback_event_cmd_record2
2752- venus: refactor VK_KHR_synchronization2 ext sync fd requirements
2753- venus: require importable external semaphores for WSI
2754- venus: require exportable bit for ext fence sync fd
2755- venus: require exportable/importable bit for ext semaphores sync fd
2756- venus: remove filtering external semaphores for QueueSubmit
2757- venus: drop VkQueueBindSparse
2758- venus: append fence feedback batch
2759- venus: refactor QueueSubmit/QueueSubmit2
2760- venus: vn_queue: align vulkan object variable naming
2761- docs/envvars: add missing mesa disk cache envvars
2762- util/fossilize_db: don't destroy foz on RO load fail
2763- utils/fossilize_db: refactor out loading RO foz dbs
2764- util/fossilize_db: add runtime RO foz db loading via FOZ_DBS_DYNAMIC_LIST
2765- util/fossilize_db: fix macOS inotify build error
2766- util/fossilize_db: add ifdef for inotify header
2767- util/tests/cache_test: Skip Cache.List if not supported
2768- anv: check initial cmd_buffer is chainable
2769- venus: refactor semaphore helper functions
2770- venus: refactor batch submission fixup
2771- venus: add NO_TIMELINE_SEM_FEEDBACK perf option
2772- venus: add timeline semaphore feedback cmds
2773- venus: enable timeline semaphore feedback
2774- venus: add SHADER_DEVICE_ADDRESS_BIT to buffer cache
2775- venus: switch to lazy VkBuffer cache
2776- venus: add VN_DEBUG_CACHE flag
2777- venus: Add VkBuffer cache statistics for debug
2778- venus: shader cache fossilize replay fix
2779- util/disk_cache: use posix_fallocate() for index files
2780
2781Kai Wasserbäch (5):
2782
2783- fix: gallivm: limit usage of LLVMContextSetOpaquePointers() to LLVM 15
2784- fix(FTBFS): gallivm: fix LLVM #include of Triple.h, moved to TargetParser
2785- fix(FTBFS): clover: fix LLVM #include of Triple.h, moved to TargetParser
2786- fix: clover/llvm: replace llvm::None with std::nullopt for LLVM 17+
2787- fix: gallivm: fix LLVM #include of Host.h, moved to TargetParser
2788
2789Kai-Heng Feng (2):
2790
2791- iris: Retry DRM_IOCTL_I915_GEM_EXECBUFFER2 on ENOMEM
2792- Revert "iris: Avoid abort() if kernel can't allocate memory"
2793
2794Karmjit Mahil (28):
2795
2796- pvr: Process set and reset event sub commands.
2797- pvr: Process wait event sub command.
2798- pvr: Add SPM scratch buffer infrastructure.
2799- pvr: Acquire scratch buffer on framebuffer creation.
2800- pvr: Update comment about ZS and MSAA buffers for pvrsrvkm submission.
2801- pvr: Set SPMSCRATCHBUFFER flag.
2802- pvr: Add SPM load usc empty programs
2803- pvr: Upload spm load programs to device.
2804- pvr: Add support for VK_ATTACHMENT_LOAD_OP_LOAD.
2805- pvr: Move descriptor write into pvr_write_descriptor_set()
2806- pvr: Add support to copy descriptors on vkUpdateDescriptorSets()
2807- pvr: Handle VK_QUERY_RESULT_WAIT_BIT.
2808- pvr: Store enum pvr_stage_allocation instead of VkShaderStageFlags
2809- pvr: Put old descriptor set approach behind a hardcoding check
2810- pvr: Change last_DMA to last_dma
2811- pvr: Write descriptor set addrs table dev addr into shareds
2812- pvr: Add PVR_SELECT() helper macro
2813- pvr: Add push consts support to descriptor program.
2814- pvr: Add support for dynamic buffers descriptors
2815- pvr: Add support for blend constants.
2816- pvr: Move PBE START_POS into csb enum helpers header
2817- pvr: Setup SPM EOT state
2818- pvr: Remove unused msaa_mode field
2819- pvr: Remove component_alignment
2820- pvr: Setup SPM background object
2821- pvr: Don't advertise currently unsupported features
2822- pvr: Advertise STORAGE_IMAGE_BIT for B10G11R11_UFLOAT_PACK32
2823- pvr: Don't advertise S8_UINT support
2824
2825Karol Herbst (44):
2826
2827- rusticl: fix build error with valgrind being enabled
2828- rusticl/util: extract offset_of macro
2829- rusticl/icd: Make it work in case Rustc shuffles struct around
2830- rusticl/kernel: fix clGetKernelInfo CL_KERNEL_ATTRIBUTES for non source programs
2831- rusticl/program: enable spirv
2832- llvmpipe/ci: increase deqp-runner timeout
2833- rusticl/device: fix some device limits
2834- rusticl/device: limit CL_DEVICE_MAX_CONSTANT_ARGS
2835- rusticl: no compute only
2836- rusticl: allocate printf buffer as staging
2837- nir: Skip samplers and textures in lower_explicit_io
2838- nir/deref: don't replace casts with deref_struct if we'd lose the stride
2839- ci/zink: move threading tests to flakes
2840- rusticl/kernel: Images arg sizes also have to match the host pointer size
2841- gallivm: fix lp_vec_add_offset_ptr for 32 bit builds
2842- nvc0: enable fp helper invocation memory loads on Turing+
2843- nir: track existence of variable shared memory
2844- rusticl/kernel: set has_variable_shared_mem on the nir
2845- gallium: add get_compute_state_info
2846- lp: implement get_compute_state_info
2847- iris: implement get_compute_state_info
2848- nv50: implement get_compute_state_info
2849- nvc0: implement get_compute_state_info
2850- panfrost: move max_thread_count and take reg_count into account
2851- panfrost: implement get_compute_state_info
2852- rusticl/kernel: make use of cso info
2853- radeonsi: implement get_compute_state_info
2854- radeonsi: use default float mode for CL
2855- rusticl: enable radeonsi
2856- rusticl: split platform into core and api parts
2857- rusticl/platform: rename _cl_platform_id to Platform
2858- rusticl/platform: move getter into the type
2859- rusticl/platform: move device initialization to the platform
2860- rusticl/program: allow dumping compilation logs through RUSTICL_DEBUG
2861- rusticl/program: make IL programs look closer to CLC ones
2862- clc: add clc_validate_spirv
2863- rusticl/program: validate the SPIR-V when created from IL
2864- rusticl/program: extract common code of compile and build
2865- rusticl/program: rework source code tracking
2866- rusticl/event: drop work item before updating status
2867- radeonsi: lower mul_high
2868- ac/llvm: support shifts on 16 bit vec2
2869- rusticl: don't set size_t-is-usize for >=bindgen-0.65
2870- nvc0: do not randomly emit fences.
2871
2872Kenneth Graunke (37):
2873
2874- intel/blorp: Lower base_workgroup_id to zero
2875- intel/compiler: Move atomic op translation into emit_*_atomic()
2876- intel/compiler: Use LSC opcode enum rather than legacy BRW_AOPs
2877- intel/compiler: Add an lsc_op_num_data_values() helper
2878- intel/compiler: Eliminate SHADER_OPCODE_UNTYPED_ATOMIC_FLOAT
2879- intel/compiler: Drop redundant 32-bit expansion for shared float atomics
2880- intel/compiler: Delete fs_visitor::nir_emit_{ssbo,shared}_atomic_float()
2881- intel/compiler: Combine nir_emit_{ssbo,shared}_atomic into one helper
2882- intel/compiler: Delete all the A64 atomic variants for type sizes
2883- intel/compiler: Drop dest checking in atomic code
2884- intel/compiler: Use more symbolic source names in components_read()
2885- anv: Add missing untyped data port flush on PIPELINE_SELECT
2886- iris: Add missing untyped data port flush on PIPELINE_SELECT
2887- loader: Add infrastructure for tracking active CRTC resources
2888- egl: Rewrite eglGetMscRateANGLE to avoid probes and handle multi-monitor
2889- iris: Perform load_constant address math in 32-bit rather than 64-bit
2890- anv: Perform load_constant address math in 32-bit rather than 64-bit
2891- anv: Make a batch decoder for each queue family
2892- nir: Print divergence information for registers as well as SSA defs
2893- nir: Fix merge_set_dump() to compile again
2894- nir: Fix typos in the from-SSA pass comments
2895- intel: Use common helpers for TCS passthrough shaders
2896- intel/fs: Fix inferred_sync_pipe for F16TO32 opcodes
2897- intel/fs: Add builder helpers for F32TO16/F16TO32 that work on Gfx7.x
2898- intel/fs: Delete a TODO about using brw_F32TO16.
2899- intel/fs: Use new F16TO32 helpers for unpack_half_split_* opcodes
2900- Revert "intel/fs: Fix inferred_sync_pipe for F16TO32 opcodes"
2901- intel/fs: Use F32TO16/F16TO32 helpers in fquantize16 handling
2902- intel/fs: Move packHalf2x16 handling to lower_pack()
2903- intel/eu: Simplify brw_F32TO16 and brw_F16TO32
2904- intel/vec4: Retype texture/sampler indexes to UD
2905- intel/fs: Make bld.F16TO32 actually emit F16TO32 not F32TO16
2906- i965/vec4: Implement uclz in the vec4 backend
2907- st/mesa, iris: Add optional CPU-based ASTC void extent denorm flushing
2908- intel/compiler: Use nir_dest_bit_size() for ballot bit size check
2909- iris: Extend resource creation helpers to allow for explicit strides
2910- iris: Hack around gbm_gralloc stride restrictions
2911
2912Konrad Dybcio (3):
2913
2914- freedreno/registers: Add RBBM_GPR0_CNTL for non-GMU operation
2915- freedreno: Add A2xx perf counter reg values
2916- freedreno: Add A2xx REG_A2XX_RBBM_PM_OVERRIDE2 bitfields
2917
2918Konstantin Seurer (78):
2919
2920- vulkan: Track the nullDescriptor feature
2921- radv: Add a helper for finding memory indices
2922- radv: Create a null TLAS as meta state
2923- radv: Use the null accel struct instead of emitting 0
2924- radv/rt: Get rid of accel struct null checks
2925- radv: Advertise rt pipelines for Control (DX12)
2926- radv/bvh/meson: Add the option to set defines
2927- radv/bvh: Add a define for extended SAH
2928- radv: Add a shader variant for PLOC with extended SAH
2929- radv: Wrap internal build type inside a build_config struct
2930- radv: Enable extended SAH for shallow BVHs
2931- radv: Merge the leaf and internal converter
2932- radv: Improve the BVH size estimation
2933- radv: Fix creating accel structs with unbound buffers
2934- radv: Work around shader_call_data variables in raygen shaders
2935- radv/rq: Use 16 stack entries if there is only one ray query
2936- radv/llvm: Use the shader names as module name
2937- ac/llvm: Fix validation error with global io
2938- radv: Scalarize global IO with LLVM enabled
2939- radv: Make radv_compute_dispatch non-static
2940- radv: Implement ordered compute dispatches
2941- radv: Use an ordered dispatch for BVH encoding
2942- radv: Remove radv_indirect_unaligned_dispatch
2943- vulkan,nir: Refactor ycbcr conversion state into a struct
2944- radv: Use common ycbcr conversion lowering
2945- radv/rra: Find copy memory index when initializing the trace state
2946- radv/rra: Hide deferred accel struct data destruction behind an env var
2947- radv: Hash VK_PIPELINE_CREATE_RAY_TRACING_NO_NULL_* flags
2948- radv: Clean up dynamic RT stack allocation
2949- ac/llvm: Implement bvh64_intersect_ray_amd
2950- radv: Make accel struct meta state initialization thread safe
2951- radv: Force ACO for BVH build shaders
2952- radv: Pre-compile BVH build shaders if there is a cache
2953- radv: Advertise ray query support with LLVM
2954- radv/rt: Skip instances after loading the entire node
2955- vulkan: Add vk_acceleration_structure
2956- radv: Use vk_acceleration_structure
2957- anv: Use vk_acceleration_structure
2958- radv/bvh/encode: Use the node type for identifying internal nodes
2959- radv/bvh: Replace is_final_tree with bvh_offset
2960- radv/bvh/encode: Move bvh_offset NULL check to the top of the loop
2961- radv/bvh/encode: Introduce is_root_node
2962- radv/bvh/encoder: Move dst_node initialization into the loop
2963- radv: Add a build config for compact builds
2964- radv/bvh: Implement compact encoding
2965- radv: Use compact encoding
2966- radv: Move the geometry infos before the BVH
2967- radv/bvh: Move the size header field up
2968- radv/bvh: Add a shader for filling the header
2969- radv: Use indirect header filling for compact builds
2970- nir: Add cull_mask_and_flags_amd intrinsic
2971- radv/rt: Merge cull_mask and flags
2972- radv/rt: Pre shift cull_mask
2973- radv: Move header and geometry info init into separate functions
2974- radv: Only init geometry infos if RRA is enabled
2975- radv/rt: Use ushr for extracting the cull mask
2976- radv/rt: Fix updating stack_size if the shader uses scratch
2977- radv/rt: Use vk_pipeline_hash_shader_stage for RT stages
2978- vulkan: Add vk_shader_module_init
2979- radv/rt: Properly handle pNext of pipeline library stages
2980- radv/sqtt: Skip dumping pipeline libraries
2981- radv: Fix loading stack_size from the cache
2982- radv: Fix inserting stack_size into the cache
2983- radv/rt: Handle load_constant instructions when inlining shaders
2984- nir/lower_shader_calls: Remat derefs before lowering resumes
2985- radv/rt: Refactor rq_load lowering
2986- radv/rq: Rematerialize inv_dir before proceed
2987- radv: Set user SGPR locations when declaring args
2988- radv: Stop counting user SGPRS separately
2989- radv/ci: Update ray tracing pipeline fail/skip lists
2990- radv: Add radv_shader_type to fix gs_copy and trap handler handling
2991- radv: Remove some dead radv_shader_args setup
2992- aco: Remove is_gs_copy_shader
2993- radv: Remove has_previous_stage
2994- radv: Pack and encode geometry id and flags on the CPU
2995- radv/bvh: Remove calculate_node_bounds
2996- radv: Remove radv_bvh_aabb_node::aabb
2997- nir/lower_fp16_casts: Fix SSA dominance
2998
2999Kurt Kartaltepe (1):
3000
3001- drirc: Set limit_trig_input_range option for Nier games
3002
3003Lang Yu (1):
3004
3005- amd/common: fix a typo
3006
3007Lepton Wu (1):
3008
3009- egl/dri2: Use primary device in EGL device platform for kms_swrast
3010
3011Lina Versace (8):
3012
3013- util/glsl2spirv: Fix build with Python 3.6
3014- docs: Lower Python requirement to 3.6
3015- mailmap: Lina is Chad's new name
3016- mailmap: Add Lina's new google.com address
3017- venus: Update protocol for VK_EXT_memory_budget
3018- venus: Delete vn_renderer_info::has_cache_management
3019- venus: Refactor vn_physical_device_init_memory_properties
3020- venus: Enable VK_EXT_memory_budget
3021
3022Lionel Landwerlin (151):
3023
3024- pps: print out message when we get the first counters
3025- anv: record secondaries' traces into primaries
3026- intel/ds: track secondary cmdbuffers in perfetto
3027- intel/ds: move event_id access to perfetto lambda
3028- util/u_trace: add support for variable length trace points
3029- vulkan/debug_utils: copy debug util labels
3030- anv: add support for command buffer tagging in traces
3031- intel/ds: add INTEL_GPU_TRACEPOINT envvar to toggle tracepoints
3032- intel/ds: remove unused trace point
3033- intel/utrace: document tracepoints
3034- Revert "ci: build hasvk if we're building anv"
3035- intel: use a shared UUID with other drivers
3036- nir/divergence: add missing RT intrinsinc handling
3037- anv: fix generated indirect draw shader stats checks
3038- nir/lower_io: fix bounds checking for 64bit_bounded_global
3039- anv: fix preemption enable emission in gpu_memcpy
3040- intel/fs: avoid cmod optimization on instruction with different write_mask
3041- intel/decoder: print out compute push constants
3042- intel/common: add a INTEL_DECODE variable to parameter decoder at runtime
3043- vulkan/wsi/wayland: improve same gpu detection
3044- intel/fs: drop FS_OPCODE_UNIFORM_PULL_CONSTANT_LOAD_GFX7
3045- anv: fix null descriptors
3046- docs: list anv in EXT_extended_dynamic_state3 support
3047- intel: add missing PS restriction on BDW+
3048- anv: expose EXT_load_store_op_none
3049- intel/fs: make alpha_to_coverage a tristate
3050- anv: rename RT pipeline function helper
3051- anv: rename a few internal functions to highlight gfx use
3052- intel/ds: track end of pipe bits
3053- anv: use malloc for host only descriptor sets
3054- anv: put the video extension behind a variable
3055- intel/fs: fix mesh indirect movs
3056- intel/dev: add a default urb value for intel_stub_gpu on dg2
3057- anv/hasvk: handle a SAMPLED_READ/STORAGE_READ access flags
3058- anv: remove copied information from runtime graphics state
3059- anv: move 3DSTATE_VERTEX_ELEMENT emission to dynamic path
3060- anv: implement VK_EXT_vertex_input_dynamic_state
3061- intel/fs: bound subgroup invocation read to dispatch size
3062- hasvk: fix KHR_shader_float_controls reporting
3063- intel/perf: also add the oa timestamp shift on MTL
3064- anv: move debug submit to helper and call it on execbuf failure
3065- anv: track vram only BOs to print things out on ENOMEM execbuf
3066- anv: fix vma heap memory leak
3067- anv: fix invalid masking of 48bit address
3068- anv: remove assert typed write support when using NULL surface
3069- vulkan/runtime: store parameters of VK_EXT_sliced_view_of_3d
3070- anv: fixup Wa_16011107343 for Gfx12 only
3071- iris: fix Wa_16011107343 for Gfx12
3072- anv: remove more Gfx7 code
3073- genxml: Fix STATE_BASE_ADDRESS::BindlessSurfaceStateSize field size
3074- genxml: fix border color offset field on Gfx12+
3075- anv/hasvk: speed up null image/view descriptor writes
3076- anv: fix scratch buffer reloc in 3DSTATE_HS
3077- anv: fixup condition for Wa_14016118574
3078- anv: pull Wa_14016118574 out of some loop not changing state
3079- util/glsl2spirv: add support for include directive
3080- anv: fix incorrect parameter
3081- anv: correctly reset generation address on command buffer reset
3082- anv: fix generated forward jump with more than 67M draws
3083- anv: remove copied code from generation shader
3084- anv: remove BTI related flush in generation shaders
3085- anv: correctly program 3DSTATE_SF in generation shaders
3086- anv: limit push constant dirtyness with generation shaders
3087- anv: remove pre hasvk split assert
3088- anv: remove commented code
3089- anv: fix 3DSTATE_PS emission in generation shaders
3090- anv: fix indirect draws VF cache tracking of index buffer
3091- anv: make sure mi_memcpy lands before push constant loads
3092- anv: remove MI_NOOPs at the end of the generation batch
3093- anv: use a single generation shader for indirect draws
3094- anv: rename generated draws for Gfx11
3095- anv: use 64bit int support in generation shaders
3096- anv: pack more data into generated draws input
3097- anv: move common shader code into header
3098- anv: use a list of generated shaders
3099- anv: remove unused item_count parameter
3100- anv: add gfx9 generated draw support
3101- blorp: add dependency on idep_intel_dev
3102- vulkan/runtime: only consider slice info with 3D image views
3103- anv: VK_EXT_image_sliced_view_of_3d
3104- nir: fix nir_ishl_imm
3105- anv: enable VK_EXT_pipeline_library_group_handles
3106- anv/iris: report counter symbols with debug option
3107- intel/fs: report max register pressure in shader stats
3108- anv: report max register pressure in pipeline properties
3109- anv: force MEDIA_INTERFACE_DESCRIPTOR_LOAD reemit after 3D->GPGPU switch
3110- radv: use 1ull for alignment computations
3111- util: allow align64() to do alignments >= 4Gb
3112- docs: fix invalid link
3113- iris: trace frames with u_trace
3114- anv: export EXT_pipeline_library_group_handles only with RT
3115- docs: update Anv features support
3116- anv: more formats for acceleration structure vertices
3117- intel/fs: don't SEND messages as partial writes
3118- intel/fs: fix nir_opt_peephole_ffma max vec assumption
3119- intel/fs: fixup sources number from opt_algebraic
3120- intel/fs: add MOV source count validation
3121- intel/fs: prevent large vector ops generated by peephole_ffma
3122- intel/fs: fix subgroup invocation read bounds checking
3123- vulkan/wsi: add a headless swapchain implementation/option
3124- intel/compiler: report max dispatch width statistic
3125- anv: report shader max dispatch width in pipeline props
3126- intel/devinfo: add an option to pick platform to print
3127- intel/devinfo: printout URB entries
3128- intel/dev: use generated WA helpers for Wa_22012575642
3129- intel/devinfo: dedicated entries for XeHP
3130- intel/devinfo: initialize pci_device_id with from_pci_id()
3131- intel/dev: fold Gfx12 URB entries in Gfx12 HW info
3132- util/u_trace: move needs_cs_param option to tracepoints
3133- vulkan/runtime: also copy strings on queue debug utils
3134- intel/ds: rename frame timeline row to queue
3135- anv: fix incorrect utrace bo release
3136- anv: fixup locking for utrace submission increments
3137- anv: rename anv_utrace_flush_copy in anv_utrace_submit
3138- anv: add utrace support for queue debug utils
3139- anv: implement recommended flush/wait of AUX-TT invalidation
3140- iris: implement recommended flush/wait of AUX-TT invalidation
3141- anv: hash immutable sampler conversion data not pointers
3142- anv: compute the largest GRL kernel scratch size
3143- anv: move queue check helpers to anv_private
3144- anv: take care of maxStorageBufferRange being uint32_t
3145- isl: update max buffer size for SKL+
3146- intel/dev: set a default valid kmd_type
3147- intel/perf: fix OA format selection on MTL
3148- intel/fs: run VGRF compaction just before max live register accounting
3149- intel/fs: don't consider fixup_nomask_control_flow SENDs predicate
3150- intel/fs: UNDEF fixup_nomask_control_flow temp register
3151- intel/fs: copy instruction sources in logical send lowering
3152- intel/fs: factor out lsc surface descriptor settings
3153- nir: reuse nir_component_mask() where it makes sense
3154- nir: add 2 new intel intrinsics for uniform ssbo/shared loads
3155- intel/fs: optimize uniform SSBO & shared loads
3156- intel/fs: also allow vec8+ vectorization of load_global_const_block_intel
3157- anv: pass steam output as argument for anv_dump_pipe_bits
3158- anv: replace query flush before gpu copy by semaphore wait
3159- anv: fixup streamout write barriers
3160- intel/fs: use nomask for setting cr0 for float controls
3161- anv: exclude performance queries from blorp clears
3162- intel/ds: add a new timeline row for frames
3163- anv: add utrace tracking of frame boundaries
3164- vulkan/runtime: discard unused graphics stages in libraries
3165- intel/vec4: force exec_all on float control instruction
3166- vulkan/overlay: deal with unknown pNext structures
3167- isl: don't set inconsistent fields for depth when using stencil only
3168- isl: fix a number of errors on storage format support on Gfx9/12.5
3169- anv: rework Wa_14017076903 to only apply with occlusion queries
3170- nir/divergence: add missing load_global_constant_* intrinsics
3171- anv: fix anv_nir_lower_ubo_loads pass
3172- intel/fs: fix per vertex input clamping
3173- intel/compiler: make uses_pos_offset a tri-state
3174- intel/fs: fix scheduling of HALT instructions
3175
3176Liviu Prodea (1):
3177
3178- meson: Ignore unused variables when assertions are disabled
3179
3180Lone_Wolf (3):
3181
3182- compiler/clc: Fix embedded clang headers (microsoft-clc) for LLVM 16+
3183- clc: Add clangASTMatchers to fix static llvm build of microsoft-clc with LLVM 16+
3184- clc: Add clang frontendhlsl module to fix build of microsoft-clc with llvm 16+
3185
3186Luc Ma (2):
3187
3188- xlib: fix glXDestroyContext in Gallium frontends
3189- meson: keep Mako version checking in accord with build msg
3190
3191Lucas Fryzek (11):
3192
3193- crocus: Add support for \`get_screen_fd`
3194- tegra: Add support for \`get_screen_fd`
3195- nouveau: Add support for \`get_screen_fd`
3196- zink: Add support for \`get_screen_fd`
3197- iris: Add support for \`get_screen_fd`
3198- i915: Add support for \`get_screen_fd`
3199- svga: Add support for \`get_screen_fd`
3200- virgl: Add support for \`get_screen_fd`
3201- r300/r600/radeon_si: Add support for \`get_screen_fd`
3202- d3d12/llvmpipe/softpipe: Add support for \`get_screen_fd`
3203- gallium: Modify default path for DMABUF to use DRM
3204
3205Lucas Stach (7):
3206
3207- etnaviv: don't drop TS capability on GPUs with MMUv2
3208- etnaviv: drm: fix BO array leaks
3209- etnaviv: free pm queries dynarray on screen destroy
3210- etnaviv: drm: fix check if BO is on a deferred destroy list
3211- etnaviv: fix double scanout import of multiplanar resources
3212- etnaviv: flush VS texture cache when texture data is changed
3213- etnaviv: fix texture barrier implementation
3214
3215Luigi Santivetti (4):
3216
3217- pvr: fix uses_tile_buffers in clear color attachment
3218- pvr: add support for tile buffer output clear
3219- pvr: add padding bytes when allocating buffer memory
3220- pvr: fix segfault in dEQP-VK.ycbcr.query.*
3221
3222Luna Nova (3):
3223
3224- device_select_layer: fix inverted strcmp in device_select_find_dri_prime_tag_default (v1)
3225- device_select_layer: apply DRI_PRIME even if default device is > 1 to match opengl behavior
3226- device_select_layer: pick a default device before applying DRI_PRIME
3227
3228Lynne (1):
3229
3230- aco_validate: allow for wave32 in p_dual_src_export_gfx11
3231
3232M Henning (1):
3233
3234- nouveau/codegen: Check nir_dest_num_components
3235
3236Maarten Lankhorst (1):
3237
3238- iris: Place scanout buffers only into lmem for discrete GPUs
3239
3240Marcin Ślusarz (23):
3241
3242- intel/compiler: fix generation of vec8/vec16 alu instruction
3243- intel/compiler/mesh: handle const data in task & mesh programs
3244- intel/compiler: fine-grained control of dispatch widths
3245- nir: add nir_mod_analysis & its tests
3246- intel/compiler/mesh: optimize indirect writes
3247- intel/compiler/mesh: support longer write messages
3248- intel/compiler/mesh: remove dead code path supporting >4 dword writes
3249- intel/compiler/mesh: use U888X packed index format
3250- anv: bump ANV_MAX_QUEUE_FAMILIES
3251- intel/compiler: replace gl_Layer & gl_ViewportIndex by 0 in fs if ms doesn't write it
3252- anv: fix how unset gl_Viewport & gl_Layer are handled in mesh case
3253- intel/compiler/mesh: use slice id of task urb handles in mesh shaders
3254- anv: enable task redistribution
3255- intel/compiler/mesh: apply URB payload mask once per program
3256- intel/compiler/mesh: follow the type of offset variable
3257- intel/compiler: remove unused field from fs_thread_payload
3258- anv: halve the push constants space in mesh pipelines
3259- crocus/meson: add back dependency on libintel_dev
3260- anv,hasvk: remove stale comments
3261- anv: call nir_shader_gather_info early
3262- anv: work around for per-prim attributes corruption
3263- intel/compiler: compactify locations of mesh outputs
3264- anv: ignore structure types handled in vk_device_memory_create
3265
3266Marek Olšák (212):
3267
3268- glthread: fix an upload buffer leak
3269- util: fix util_is_vbo_upload_ratio_too_large
3270- mesa: allow GL_UNSIGNED_INT64_ARB as vertex format for ARB_bindless_texture
3271- glapi: autogenerate function parameters with no space between * and variable
3272- glthread: handle GL_*_ARRAY in glEnable/Disable
3273- glthread: set GL_OUT_OF_MEMORY if we fail to upload indices
3274- glthread: set GL_OUT_OF_MEMORY if we fail to upload vertices
3275- glthread: execute glMultiDrawArrays(draw_count < 0) asynchronously
3276- glthread: change multi_draw_elements_async() to never fail due to large size
3277- glthread: do vertex uploads if an index buffer is present for glDrawElements
3278- mesa: move gl_vertex_format_user definition into glthread.h
3279- glthread: pack and name the type of glthread_vao::Attrib
3280- glthread: make marshal functions for glBegin/End attribs non-static
3281- glthread: remove the vbo_upload_ratio_too_large fallback for glMultiDrawElements
3282- glthread: do vertex uploads if an index buffer is present for MultiDrawElements
3283- glthread: disallow glthread if buffer uploads are unsupported
3284- ac/llvm: run the LLVM sinking pass because LLVM will stop running it
3285- ac/llvm: run the IPSCCP pass
3286- ac/llvm: remove llvm:: now that we use "using namespace llvm"
3287- amd: update amdgpu_drm.h
3288- ac/gpu_info: add PCIe info
3289- radeonsi/ci: update gfx10.3 results
3290- radeonsi/ci: add gfx1100 results
3291- radeonsi: fix RB+ blending with sRGB formats
3292- radeonsi/gfx11: unset SAMPLE_MASK_TRACKER_WATERMARK to fix hangs
3293- amd: split GFX1103 into GFX1103_R1 and GFX1103_R2
3294- amd: fix tile_swizzle on gfx11 - should be shifted by 10 bits, not 8
3295- amd: update SX_BLEND_OPT_EPSILON.MRT0_EPSILON enum definitions
3296- amd: update shadowed register tables for gfx11
3297- amd: improve RB+ blending precision
3298- radeonsi: implement RB+ depth-only rendering for better perf
3299- radeonsi/gfx11: remove the INST_PREF_SIZE workaround
3300- radeonsi/gfx11: add a comment why we use PRIM_GRP_SIZE <= 252
3301- radeonsi/gfx11: adjust ACCUM_* fields for tessellation
3302- radeonsi/gfx11: fix blend->cb_target_mask dependency for shader keys
3303- radeonsi/gfx11: move the PIXEL_PIPE_STAT_CONTROL event into the GFX preambles
3304- radeonsi/gfx11: use new packet EVENT_WRITE_ZPASS
3305- radeonsi: deduplicate VS/TES/GS update code
3306- radeonsi/gfx11: always set MSAA_NUM_SAMPLES=0 for DCC_DECOMPRESS
3307- radeonsi: merge si_ps_key_update_framebuffer_blend & .._update_blend_rasterizer
3308- radeonsi: determine alpha_to_coverage robustly in si_update_framebuffer_blend_rasterizer
3309- radeonsi: never set INTERPOLATE_COMP_Z
3310- amd: unify and tune the attribute ring size for gfx11
3311- amd: change pbb_max_alloc_count for gfx11
3312- amd: update the cache size for gfx1103_r1
3313- amd: update late_alloc_wave64 for gfx11
3314- amd: sort and re-indent packet definitions
3315- amd: fix typo in shadowed uconfig registers on gfx11
3316- amd: document OOB behavior on gfx11
3317- amd/registers: remove confusing definitions from gfx10-rsrc.json
3318- radeonsi: set NEVER as the depth compare func if depth compare is disabled
3319- amd/llvm: fix LLVM 15 & 16 crashes in SelectionDAG.cpp
3320- radeonsi: call ac_init_llvm_once before any util_queue initialization
3321- radeonsi: set sampler COMPAT_MODE in the corresponding branch
3322- amd/ci: update sanctuary trace sha1
3323- radeonsi/gfx11: don't add mrt0 export for alpha-to-coverage if mrtz is present
3324- radeonsi/gfx11: don't add alpha to mrt0 format for A2C if exporting via mrtz
3325- amd: define new SET_*_REG_PAIRS packets
3326- radeonsi: clean up si_set_mutable_tex_desc_fields
3327- amd/surface: clean up is_dcc_supported_by_L2
3328- amd,util: fix how lod bias is converted to fixed-point
3329- amd: don't hardcode real VGPR allocation granularity on gfx10.3 and gfx11
3330- glthread: track the current element array buffer in the Core profile too
3331- mesa: ignore indices[i] if count[i] == 0 for MultiDrawElements
3332- glthread: initialize indices[i] for no-op MultiDrawElements
3333- glthread: upload non-BO indices in the core profile to fix GStreamer
3334- glthread: add a heuristic to stop locking global mutexes with multiple contexts
3335- glthread: ignore non-VBO vertex arrays with NULL data pointers
3336- Revert "ci/zink: Disable Amnesia trace until the linked issue gets fixed."
3337- glthread: rewrite glMultiDrawArrays to never fail to upload vertices
3338- glthread: change glMultiDrawElements to execute draw_count < 0 asynchronously
3339- glthread: don't execute glDraw code if we're inside glBegin/End
3340- glthread: don't pass index bounds to the driver for async calls
3341- glthread: move some draw call parameters closer to their use
3342- glthread: don't bind/unbind uploaded indexbuf, pass it to glDraw directly
3343- glthread: don't bind/unbind uploaded indexbuf, pass it to glMultiDraw directly
3344- glthread: track vertex formats for all attributes
3345- glthread: add a vertex upload path that unrolls indices for glDrawElements
3346- glthread: reorder draw code a little
3347- glthread: add ctx->GLThread.draw_always_async to simplify draw checking
3348- glthread: remove goto statements and add unlikely() into draw functions
3349- glthread: inline draw functions that have only one use
3350- glthread: don't execute Draw and BufferSubData calls if the context is lost
3351- glthread: handle non-VBO uploads for glMultiModeDraw{Arrays,Elements}IBM
3352- glthread: add API to allow passing DrawID from glthread to mesa
3353- glthread: convert (Multi)DrawIndirect into direct if user buffers are present
3354- glthread: remove unnecessary debug code
3355- glthread: don't free glthread for GL_DEBUG_OUTPUT_SYNCHRONOUS, only disable it
3356- glthread: don't restore non-VBO vertex arrays after all draws
3357- Revert "radeonsi/ci: Update stoney test expectations"
3358- radeonsi: fix COMPAT_MODE on gfx8-9
3359- amd: fix LOD_BIAS on gfx6-9 and adjust the lod bias CAP
3360- amd: add missing gfx11 register definitions
3361- amd: bump AMD_MAX_SE and change the CU mask type to 16 bits
3362- radeonsi/gfx11: fix the CU_EN clear mask for RSRC4_GS
3363- radeonsi/gfx11: don't set non-existent VGT_STRMOUT_BUFFER_CONFIG
3364- radeonsi/gfx11: set CB_COLORi_INFO.MAX_COMP_FRAG on GFX1103_R2
3365- radeonsi: move a few DB_SHADER_CONTROL states into si_shader_ps
3366- radeonsi: change si_shader::ctx_reg to a nameless union for better readability
3367- radeonsi: remove no-op setting of THDS_PER_SUBGRP
3368- radeonsi: use SPI_SHADER_USER_DATA_HS_0 definition instead of LS_0
3369- radeonsi: set PA_SU_VTX_CNTL consecutively with PA_CL_GB_VERT_CLIP_ADJ
3370- radeonsi/gfx11: ignore alpha_is_on_msb because the hw ignores it
3371- radeonsi: replace si_screen::has_out_of_order_rast with the radeon_info field
3372- radeonsi: disable Smart Access Memory because CPU access has large overhead
3373- amd,radeonsi: remove unused LLVM functions
3374- amd/registers: unify VRS combiner definition names between gfx103 and gfx11
3375- amd: replace SI_BIG_ENDIAN with UTIL_ARCH_BIG_ENDIAN
3376- radeonsi: remove returns from si_emit_global_shader_pointers
3377- radeonsi: reformat emit_cb_render_state, create_blend_state, create_rs_state
3378- radeonsi: remove a gfx11 check in si_shader_gs (legacy GS)
3379- radeonsi: remove unused VS_STATE_LS_OUT_PATCH_SIZE
3380- radeonsi: always add 1 to lshs_vertex_stride now that LS_OUT_PATCH_SIZE is gone
3381- radeonsi: correct and clean up obsolete vs_state_bits comments
3382- radeonsi: rename esgs_itemsize -> esgs_vertex_stride
3383- amd: query the per-SIMD VGPR counts from the kernel, don't hardcode them
3384- radeonsi: don't clamp z_samples to fix Unreal Tournament 99
3385- amd/registers: only define SPI and COMPUTE registers in the 0xB000 range
3386- radeonsi: reorganize emit_db_render_state and simplify VRS code
3387- radeonsi: reorganize si_initialize_color_surface for better readability
3388- radeonsi: reorganize si_init_depth_surface for better readability
3389- radeonsi: don't set PACKET_TO_ONE_PA for line stippling
3390- radeonsi/gfx11: change the default of COMPUTE_DISPATCH_INTERLEAVE to 256
3391- amd: implement conformant TRUNC_COORD behavior for gfx11
3392- amd/gpu_info: add a workaround for SI_FORCE_FAMILY=gfx1100
3393- nir,amd: add and use nir_intrinsic_load_esgs_vertex_stride_amd
3394- nir: lower to fragment_mask_fetch/load_amd with EQAA correctly
3395- glthread: fix a perf regression due to draw_always_async flag, fix DrawIndirect
3396- mesa: fix glPopClientAttrib with fixed-func VP and zero-stride varyings
3397- mesa: remove a redundant call to _mesa_update_edgeflag_state_vao
3398- mesa: initialize VertexProgram._VaryingInputs before the first use
3399- amd: update amdgpu_drm.h
3400- amd,radeonsi: change enabled_rb_mask to 64 bits
3401- amd: query cache sizes from the kernel
3402- ac/nir: don't use load_esgs_vertex_stride_amd on gfx6-8
3403- amd: massively simplify how info->spi_cu_en is applied
3404- amd/rtld: allow 64K LDS for all shader stages except for gfx6
3405- radeonsi/ci: update flakes and gfx8-polaris11 results
3406- radeonsi: remove Smart Access Memory because CPU access has large overhead
3407- radeonsi: reorganize si_emit_framebuffer_state for better readability
3408- radeonsi: don't merge SET_* packets that have a different index in si_pm4_state
3409- radeonsi: reindent code in si_state_binning.c
3410- radeonsi: add si_pm4_set_reg_va to simplify setting reg_va_low_idx for RGP
3411- radeonsi: check the pm4.reg_va_low_idx assertion unconditionally
3412- radeonsi: simplify encoding VGPRS and SGPRS
3413- radeonsi: assume shader is never NULL in si_emit_shader_*
3414- nir: return progress from nir_lower_io_to_scalar
3415- nir: skip nir_op_unpack_32_4x8 in nir_lower_alu_width
3416- ac/nir: add ac_nir_lower_subdword_loads to lower 8/16-bit loads to 32 bits
3417- aco: implement nir_op_unpack_32_4x8
3418- ac/llvm: implement nir_op_unpack_32_4x8
3419- amd: lower subdword UBO loads in NIR
3420- amd: lower multi-component subdword SSBO loads in NIR
3421- lavapipe/ci: add a new flake
3422- amd: add nir_intrinsic_xfb_counter_sub_amd and fix overflowed streamout offsets
3423- amd/llvm,radeonsi/gfx11: switch to using GDS_STRMOUT registers
3424- radeonsi/gfx11: only allocate GDS OA for streamout, GDS memory is not needed
3425- radeonsi: emulate VGT_ESGS_RING_ITEMSIZE in the shader on gfx9-11
3426- radeonsi: merge si_emit_initial_compute_regs with si_init_cs_preamble_state
3427- radeonsi: separate nir_texop_descriptor_amd lowering
3428- radeonsi: lower nir_texop_sampler_descriptor_amd
3429- radeonsi: set pm4.atom.emit in si_get_shader_pm4_state
3430- radeonsi: reindent si_shader_ls, si_shader_es, si_shader_gs, si_shader_vs
3431- radeonsi: reorganize si_shader_hs
3432- radeonsi: reorganize si_shader_ngg
3433- radeonsi: reorganize si_shader_ps
3434- radeonsi: other cosmetic changes in si_state_shaders.cpp
3435- radeonsi: allow using 64K LDS for NGG to allow larger workgroups
3436- radeonsi: increase NGG workgroup size to 256 for VS/TES with streamout and GS
3437- glapi: move files specific to shared-glapi into the shared-glapi subdirectory
3438- glapi: inline the meson list files_mapi_util
3439- mesa: move ctx->Table -> ctx->Dispatch.Table except Client & MarshalExec
3440- mesa: rename CurrentClientDispatch to GLApi
3441- mesa: put dispatch table initialization into one place
3442- glthread: qualify the \*cmd unmarshal parameter with restrict
3443- vbo: fix current attribs not updating gallium vertex elements
3444- radeonsi: remove unused TCS/TES SGPR fields
3445- radeonsi: dump shader stats only if dumping asm shaders
3446- radeonsi: replace nonir,noir,noasm,preoptir options with new reworked options
3447- radeonsi: remove duplicated gfx11 check in si_msaa_resolve_blit_via_CB
3448- radeonsi: rework MSAA resolve averaging to exploit instruction-level parallelism
3449- radeonsi: add AMD_DEBUG=nowcstream to enable caching for stream_uploader
3450- radeonsi: don't print the base non-view texture format for AMD_TEST=computeblit
3451- radeonsi: fix AMD_TEST=computeblit being rejected on gfx < 11
3452- radeonsi: don't convert to fp16 in the compute blit if not testing
3453- radeonsi: don't use fp16_rtz for FP formats in the compute blit
3454- radeonsi: correct an assertion if we get a display list with no vertex buffers
3455- ac/nir: don't emit duplicated parameter exports
3456- ac/nir: use plural correctly in the ac_nir_export_parameters name
3457- radeonsi: remove unused vs_output_param_mask
3458- egl: reorder code in _eglQueryDevicesEXT, add \*swrast variable
3459- egl: don't expose swrast device if swrast is not built
3460- amd/llvm: fix handling of unsupported vec3 loads on gfx6
3461- amd/llvm: remove no-op code for vec3 loads in ac_build_tbuffer_load
3462- amd: update addrlib
3463- amd: rename GFX1036 -> RAPHAEL_MENDOCINO
3464- amd: set the correct LLVM processor name for gfx1036
3465- radeonsi/gfx11: reduce MSAA samples to 8 for no-attachment framebuffer
3466- radeonsi: simplify binning settings to work around GPU hangs
3467- amd: add gfx940 register definitions
3468- amd: add initial code for gfx940
3469- radeonsi: use COMPUTE_DISPATCH_SCRATCH_BASE on gfx940
3470- radeonsi: always use ffma32 on gfx940
3471- ac/surface: force linear image layout for chips not supporting image opcodes
3472- radeonsi: add an emulated image descriptor for gfx940
3473- ac/nir: implement image opcode emulation for CDNA, enable it in radeonsi
3474- radeonsi: don't set registers that don't exist on gfx940
3475- amd/registers: simplify integer division by 0x1000 in the parser
3476- amd/registers: fix the parser to include CP_COHER registers for gfx940
3477- amd/registers: update gfx940.json
3478- amd/registers: use gfx9 packet definitions for gfx940
3479- nir: fix 2 bugs in nir_create_passthrough_tcs
3480
3481Mario Kleiner (1):
3482
3483- v3dv: Enable (leased) direct display extensions.
3484
3485Mark Collins (4):
3486
3487- meson: update flex/bison requirement to cover all usages
3488- meson: forcefully disable libdrm when host doesn't have it
3489- tu: KGSL backend rewrite
3490- tu: fix tu_GetInstanceProcAddr not handling null instance
3491
3492Mark Janes (13):
3493
3494- intel: Implement Wa_16011448509
3495- util: add macro to support gcc/clang poison
3496- intel/dev: generate helpers to identify platform workarounds
3497- intel/dev: Print required workarounds with intel_dev_info
3498- intel/fs: use generated workaround helpers for Wa_14010017096
3499- intel/fs: use generated helpers for Wa_1209978020 / Wa_18012201914
3500- intel/fs: use generated workaround helpers for Wa_14017989577
3501- intel: use generated workaround helpers for Wa_1409600907
3502- intel: use generated helpers for Wa_1409433168/Wa_16011107343
3503- intel/fs: use generated helpers for Wa_14013363432 / Wa_14012688258
3504- intel/dev: fix macro string concatenation for INTEL_WA_{id}_GFX_VER
3505- intel/dev: fix macro naming convention in gen_wa_helpers.py
3506- intel/dev: use GFX_VERx10 to detect genX compilation
3507
3508Martin Roukala (né Peres) (22):
3509
3510- ci/deqp-runner: compress results.csv before uploading it to GitLab
3511- ci/piglit: compress results.csv before uploading it to GitLab
3512- zink/ci/radv: remove a test from the fails list
3513- zink/ci: add a fail to the VG flake list
3514- zink/ci: relocate radv testing from radv's gitlab-ci.yml
3515- zink/ci: add spec\@!opengl 1.1\@line-smooth-stipple to the fails list
3516- ci/b2c: uprev to b2c v0.9.9
3517- ci/debian/x86_test-vk: drop an outdated dependency
3518- ci/core-manual-rules: enclose the whole condition in quotes
3519- zink/ci: allow running manual jobs again on RADV
3520- ci/init-stage2: allow sourcing the job env vars from the CWD
3521- ci/init-stage2: always set XDG_RUNTIME_DIR
3522- ci/b2c: move away from the hand-rolled initscript
3523- ci: bring back the valve farm online
3524- ci/valve-farm-rules: allow running jobs from outside the mesa namespace
3525- radv/ci: reduce the parallelism for vkcts-vangogh
3526- zink/ci: increase the parallelism of zink-radv-vangogh-valve
3527- zink/ci: update the radv expectations
3528- radv/ci: update VanGogh's expectations
3529- ci/b2c: increase the console timeout to 4 minutes
3530- radv/ci: update the navi10 expectations
3531- zink/ci: add a test to the fails list
3532
3533Matt Coster (6):
3534
3535- pvr: Extract setup of winsys job submit flags into separate functions
3536- pvr: Add support for geometry-only render jobs
3537- pvr: Add pvr_csb_bake()
3538- pvr: Rename global_queue_job_count to global_cmd_buffer_submit_count
3539- pvr: Split render job submission for multi-layer framebuffers
3540- pvr: Add firmware stream support for transfer submit
3541
3542Matthieu Bouron (1):
3543
3544- lavapipe: honor dst base array layer when resolving color attachments
3545
3546Mauro Rossi (1):
3547
3548- hasvk: include "vk_android.h" header in anv_android.c
3549
3550Maíra Canal (1):
3551
3552- v3dv: remove unused clamp_to_transparent_black_border property
3553
3554Michel Dänzer (29):
3555
3556- mesa/st: Fix GL_EXT_texture_type_2_10_10_10_REV name in comment
3557- mesa/st: Handle all 10 bpc types in st_choose_format
3558- glsl/standalone: Fix up _mesa_reference_shader_program_data signature
3559- glsl/standalone: Do not pass memory allocated with ralloc_size to free
3560- anv/grl: Use union for reinterpreting integer as float
3561- clover: Reserve vector memory in make_text_section
3562- ci: Update Fedora image to 36
3563- ci: Re-enable intel-clc in fedora-release job
3564- ci: Enable i915 Gallium driver in fedora-release job
3565- ci: Enable the hasvk Vulkan driver in the fedora-release job
3566- frontend/dri: Initialize callbacks in dri_swrast_kms_init_screen
3567- nouveau: Make getSize return unsigned int
3568- r600: Use container_of instead of direct pointer cast
3569- crocus: Use ralloc_free for memory allocated with rzalloc
3570- iris: Use ralloc_free for memory allocated with rzalloc
3571- ci: Remove some -Werror workarounds for debian-android job
3572- ci: Split up -Werror workarounds for debian-mingw32-x86_64 job
3573- intel/vk/grl: Do not use no_override_init_args for C++
3574- ci: Pass -Werror to compiler linking stage for LTO
3575- ci: Allow passing c{,pp}_link_args to meson
3576- ci: Make ccache optional
3577- ci: Drop ccache from Fedora image
3578- ci: Install procps-ng in Fedora image
3579- ci: Enable LTO for fedora-release job
3580- vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definitions
3581- svga: Make vmw_svga_winsys_buffer_map definition match declaration
3582- svga: Make declaration of emit_input_declaration match definition
3583- clover/llvm: Use llvm::DataLayout::getABITypeAlign with LLVM >= 16
3584- clover/llvm: Use std::nullopt already with LLVM 16
3585
3586Michel Zou (4):
3587
3588- ci/mingw: drop useless -Wno-error flags
3589- vulkan/wsi: fix -Wnarrowing warning
3590- vk/entry_points:: fix mingw build
3591- mesa/draw: fix -Wformat warning
3592
3593Mike Blumenkrantz (536):
3594
3595- zink: simplify get_slot_components() for xfb emission
3596- zink: add renderdoc handling
3597- zink: prune old swapchains on present
3598- zink: break out implicit feedback loop detection into separate function
3599- zink: set textures_used in analyze_io
3600- zink: outdent code in add_implicit_color_feedback_loop()
3601- zink: make implicit feedback loop application stricter
3602- zink: skip implicit feedback loop layout changes if feedback loop not present
3603- zink: store drm format as internal_format for imported resources
3604- zink: handle modifier nplanes queries correctly for planar formats
3605- zink: NV_compute_shader_derivatives
3606- zink: preserve present resources during async presentation
3607- zink: add a util function for creating semaphores
3608- zink: add a binary semaphore cache
3609- zink: move semaphore caching to zink_reset_batch_state()
3610- zink: consolidate semaphore creation where possible
3611- zink: simplify some dynarray concat descriptor code
3612- zink: delete need_blend_constants
3613- zink: don't use ds3 blend states without color attachments
3614- radv: repack radv_graphics_pipeline struct
3615- radv: reorder dynamic state checks during bind
3616- radv: simplify depth aspect check in radv_handle_image_transition()
3617- radv: add some graphics pipeline hints to optimize pipeline bind
3618- radv: remove redundant type sizing
3619- radv: add an early out in radv_cmd_buffer_flush_dynamic_state()
3620- zink: use actual swapchain object for surface comparison
3621- radv: stop using radv_pipeline_has_stage() in BindPipeline
3622- zink: flag old-style shadow tex mask for fragment shaders
3623- zink: break out tex dest rewriting into separate function
3624- zink: add an extra_data param to zink_shader_compile
3625- zink: track depth swizzle on samplerviews
3626- zink: add a fs shader key member to indicate depth texturing mode
3627- zink: rework depth sampler splatting in shaders
3628- zink: block pipeline fast-pathing for any programs using depth texture modes
3629- zink: plug in the program/module parts of shadow texture mode emulation
3630- zink: create another samplerview for shadow textures
3631- zink: remove old depth swizzle workaround
3632- zink: pass depth swizzle data block to shader compile
3633- mesa: remove dead parameter doc for _mesa_new_texture_object()
3634- mesa: populate gl_program::ShadowSamplers mask from shader data
3635- mesa: (more) correctly handle incomplete depth textures
3636- zink: fix implicit feedback loop detection
3637- radv: Move constant flushing check out to callers.
3638- zink: fix VK_DYNAMIC_STATE_LINE_WIDTH usage
3639- zink: move barrier jit to zink_context.c
3640- zink: don't skip repeated handling feedback loops
3641- zink: return false for implicit feedback loop check with image binds
3642- zink: update sampler layout when detecting feedback loop for first time
3643- zink: force GENERAL layout for all fb attachments with image binds
3644- zink: validation ci updates
3645- zink: reorder commands more aggressively
3646- Revert "zink: allow direct memory mapping for any COHERENT+CACHED buffer"
3647- zink: fix heap/memory type selection
3648- zink: add VK_BUFFER_USAGE_SHADER_DEVICE_ADDRESS_BIT for buffers if ext is enabled
3649- zink: set vkusage/vkflags for buffer resource objects
3650- zink: assert that buffer descriptor usage is populated before bind
3651- zink: always set RESOURCE usage for descriptor buffers
3652- zink: disable bindless texture ext with descriptor buffer
3653- zink: set VK_PIPELINE_CREATE_DESCRIPTOR_BUFFER_BIT_EXT when using DB
3654- zink: don't add dmabuf export type if dmabuf isn't supported
3655- zink: stop leaking push descriptor templates
3656- zink: always unref old images when adding new binds
3657- zink: hook up VK_EXT_multisampled_render_to_single_sampled
3658- zink: shrink zink_render_pass_state::msaa_expand_mask
3659- zink: use VK_EXT_multisampled_render_to_single_sampled for EXT_multisample_render_to_texture
3660- lavapipe: move noop fs creation to device
3661- lavapipe: add refcounting for shader nir
3662- lavapipe: refcount nir shaders instead of cloning
3663- lavapipe: break out (and slightly refactor) gallium shader cso creation
3664- lavapipe: create gfx gallium csos at pipeline bind
3665- lavapipe: delete unused pipelines immediately
3666- lavapipe: delete lvp_pipeline::mem_ctx
3667- lavapipe: try harder to reuse pipeline layouts during merge
3668- zink: only set VkPipelineColorBlendStateCreateInfo::attachmentCount without full ds3
3669- zink: fix zink_mem_type_idx_from_bits()
3670- zink: rework descriptor buffer templating to use offsets
3671- Revert "zink: fix zink_mem_type_idx_from_bits()"
3672- zink: enable PIPE_CAP_ALLOW_GLTHREAD_BUFFER_SUBDATA_OPT
3673- zink: make bindless buffer_infos a union
3674- zink: fix bindless struct member comments
3675- zink: skip updating descriptor buffer sets that aren't active
3676- zink: set VK_PIPELINE_CREATE_DESCRIPTOR_BUFFER_BIT_EXT on compute pipelines
3677- zink: break out descriptor binding into separate function
3678- zink: add a flag to indicate whether a descriptor buffer is bound
3679- zink: implement descriptor buffer handling of bindless texture
3680- zink: enable bindless texture with ZINK_DESCRIPTORS=db
3681- zink: free descriptor buffer maps on batch state destroy
3682- zink: fix more cases of heap/memtype suballocator mismatch
3683- zink: cache and reuse dummy inputattachment for fbfetch
3684- zink: handle missing line rasterization modes with ds3
3685- zink: add back VK_DESCRIPTOR_BINDING_PARTIALLY_BOUND_BIT for bindless
3686- gallium: add PIPE_CAP_NULL_TEXTURES
3687- radeonsi: set PIPE_CAP_NULL_TEXTURES
3688- zink: conditionally enable PIPE_CAP_NULL_TEXTURES
3689- zink: fix max acquired image count
3690- lavapipe: disable VK_FORMAT_FEATURE_2_COLOR_ATTACHMENT_BLEND_BIT for int formats
3691- zink: set PIPE_CAP_VALIDATE_ALL_DIRTY_STATES
3692- zink: move bindless_layout to screen and init on creation
3693- zink: take screen param in init_db_template_entry()
3694- zink: const-ify a bunch of shader key inlines
3695- zink: move gpl usability checks to static inline for reuse
3696- zink: remove duplicated gpl output blend initializations
3697- zink: store last pipeline directly for zink_gfx_program::last_pipeline
3698- zink: pass screen to descriptor_util_pool_key_get()
3699- zink: delete zink_screen::framebuffer_cache
3700- zink: make last_vertex_stage the first bit in zink_vs_key_base
3701- zink: ralloc zink_shader structs
3702- zink: add a define for the "default" optimal key
3703- zink: add a define for testing that an optimal key is the default
3704- zink: add VK_PIPELINE_CREATE_DESCRIPTOR_BUFFER_BIT_EXT for gpl libs
3705- zink: don't set blend_id with full_ds3
3706- zink: set gfx feedback loop bit in pipeline state for driver workaround
3707- zink: set zs feedback loop bit from driver workaround on ctx create
3708- zink: fix gpl lib hashing
3709- zink: use screen indexing for bindless descriptor set in db bind
3710- zink: use screen indexing for bindless descriptor set in template bind
3711- util/vbuf: fix multidraw unrolling
3712- zink: flag bindless_init before calling zink_batch_bind_db() in init
3713- zink: avoid the descriptor set multiplier for bindless buffers
3714- zink: split out VkShaderModule creation
3715- zink: add flags param to zink_pipeline_layout_create()
3716- zink: split out gfx pipeline library creation
3717- zink: add gpl flags for libraries based on shaders passed
3718- zink: allow multiple gpl libraries in zink_create_gfx_pipeline_combined()
3719- zink: move gpl input/output funcs to zink_pipeline.c
3720- zink: enable combining intermediate gpl libs from combine function
3721- zink: use GPL to handle (simple) separate shader objects
3722- zink: set PIPE_CAP_SURFACE_REINTERPRET_BLOCKS
3723- zink: store gfx_hash on zink_gfx_program
3724- zink: break out zink_gfx_program::libs into refcounted object
3725- zink: implement cross-program pipeline library sharing
3726- zink: add newlines to some debug printfs
3727- zink: rename some variables in zink_set_shader_images()
3728- zink: unset gfx shader read when unbinding shader images
3729- zink: remove stale comment
3730- zink: unref image buffer descriptors on unbind
3731- zink: rework set_shader_images() hook
3732- zink: pull out image descriptor updating in set_shader_images
3733- zink: add a local is_compute var for set_shader_images
3734- zink: allocate all batch command buffers in one call
3735- zink: sync LTO compiles for GPL pipelines on shader free
3736- zink: fix descriptor pool free iterating
3737- zink: don't fetch/update pipeline cache for separate shader programs
3738- zink: assert that the found program matches the expected one in shader_free
3739- zink: flag gfx programs as removed-from-cache by default
3740- zink: rework separate shader descriptor iterating
3741- zink: use a single descriptor buffer for all non-bindless types
3742- zink: add an io assignment pass for separate shaders
3743- zink: rename a struct member for clarity
3744- zink: move zink_batch_state::db_bound reset to zink_batch_descriptor_reset()
3745- zink: move db_bound to batch descriptor data
3746- zink: ensure db is bound before separate shader update
3747- zink: store base descriptor size on the screen
3748- zink: implement a scaling descriptor buffer size
3749- zink: calloc separable program zink_gfx_library_key struct
3750- zink: also replace hash_entry::key when replacing separable program
3751- zink: always use NEAREST for zs blits
3752- zink: fix indentation of rebind_image()
3753- zink: only try for a fb rebind if fb binds exist in rebind_image()
3754- zink: account for null surface when trying to retain clears on fb bind
3755- zink: break out pipe_surface init for new surface creation
3756- zink: const-ify a surface param
3757- zink: don't handle mutable init on surface creation with tc enabled
3758- zink: verify compressed format layer count when creating surfaces
3759- zink: set ZINK_DESCRIPTORS=db for radv jobs
3760- zink: enable renderpass optimizations by default for selected drivers
3761- Revert "zink: always use NEAREST for zs blits"
3762- zink: block LINEAR filtered blits for zs formats
3763- lavapipe: enable linear filtering for depth formats
3764- gallium: plumb resolve attachments through from frontends -> pipe_framebuffer_state
3765- dri3: avoid deadlocking when polling deleted windows for events
3766- vulkan/wsi: avoid deadlocking dri3 when polling deleted windows for events
3767- vl/dri3: avoid deadlocking when polling deleted windows for events
3768- zink: delete some now-broken ntv dref sampling code
3769- zink: more accurately handle i/o for separate shaders
3770- zink: handle semi-matching i/o for separate shaders
3771- zink: enable renderpass optimizing on lavapipe
3772- kopper: fix loop iterating for msaa texture creation
3773- zink: move db input attachment size check to screen init
3774- zink: make ZINK_DESCRIPTOR_MODE=db the default
3775- driconf: add zink glthread disable for a game
3776- aux/tc: add a 'has_resolve' member to tc_renderpass_info
3777- zink: actually hook up ZINK_DEBUG=norp
3778- zink: add ZINK_DEBUG=map
3779- zink: add debug marker tracing for qbo updates
3780- util/box: add intersection test functions for 1d/3d
3781- zink: add some tracking for copy box regions
3782- zink: add a util function for optimizing TRANSFER_DST image barriers
3783- zink: utilize copy box tracking to avoid barrier emission for buf2img copies
3784- zink: fix slab allocator sizing
3785- zink: delete dead uniform variables
3786- zink: fix shader read access removal for barrier generation
3787- zink: rework descriptor unbind params to use is_compute directly
3788- zink: fix bindless texture barrier generation
3789- zink: delete unused emit_image param in ntv
3790- zink: simplify/rework image typing in ntv
3791- zink: avoid adding ubo/ssbo bindings multiple times for different bitsizes
3792- zink: add locking for zink_screen::copy_context and defer creation
3793- zink: allow direct memory mapping for any COHERENT+CACHED buffer
3794- lavapipe: EXT_image_sliced_view_of_3d
3795- vulkan/wsi: switch to using an options struct for last param
3796- vulkan/wsi/x11: make 4 image minimum for xwayland driver-specific
3797- docs: add pipeline library support for tu
3798- aux/tc: track whether queries have been terminated in a renderpass
3799- aux/tc: only call tc_parse_draw() when parsing renderpass info
3800- zink: move zink_batch_no_rp call for query reset
3801- zink: remove suspended queries from list before resuming
3802- zink: reset queries on the promoted cmdbuf when possible
3803- zink: rewrite zink_query_start struct
3804- zink: fix zink_query_start initialization
3805- zink: fix possible query destroy leak
3806- zink: make zink_vk_query unref consistent
3807- zink: delete zink_query::last_start_idx
3808- zink: handle multiple query starts in qbo update
3809- zink: don't auto-sync qbos on query end
3810- zink: add zink_query::suspended to indicate suspended state of queries
3811- zink: un-suspend queries if they end while suspended
3812- zink: don't double suspend queries
3813- zink: refuse to start cs invocation queries in renderpass
3814- zink: only try doing qbo updates on query suspend if !in_rp
3815- zink: handle null query results
3816- zink: handle null query results for conditional render
3817- zink: only update qbo for TIME_ELAPSED on start if !in_rp
3818- zink: try updating qbos on query resume if !in_rp
3819- zink: reorder some query code
3820- zink: rework find_or_allocate_qp()
3821- zink: create/use query pools dynamically
3822- zink: rework query pool overflow
3823- zink: track whether queries were started in a renderpass
3824- zink: break out query suspend functionality for reuse
3825- zink: use more consistent check for deleting zink_query::stats_list links
3826- zink: pull 'was_line_loop' into ctx for query updating
3827- zink: always start/stop/resume queries inside renderpasses
3828- zink: use tc renderpass optimizing to more optimally start queries
3829- zink: skip buffer barriers for ACCESS_NONE -> ACCESS_READ / ACCESS_WRITE
3830- zink: disable queries for clear_texture()
3831- zink: resume queries after conditional render and clears are processed
3832- zink: only resume queries inside renderpasses from set_active_query_state
3833- zink: track whether a primgen query is suspended and needing color write hacks
3834- zink: rework xfb queries for drivers with poor primgen support
3835- zink: merge qbo update copies when possible
3836- zink: set predicate_dirty on query creation
3837- zink: eliminate internal qbo copy barrier
3838- util/box: fix off-by-one calc error in intersection funcs
3839- zink: avoid ballooning of copy box tracking
3840- zink: add perfetto tracing for barriers
3841- zink: avoid unnecessary read-only layout changes for zs attachments
3842- zink: skip attachment barrier for redundant layout-setting if !valid
3843- zink: add functions for faster batch-usage completion checks
3844- zink: fix zink_resource_access_is_write()
3845- zink: use split image barriers if the image can be easily proved idle
3846- zink: skip buffer barriers if the buffer can be easily proved idle
3847- zink: determine whether debug markers are used on screen create
3848- zink: be more descriptive with perfetto buffer barriers
3849- zink: reorder some blit debug markers
3850- zink: pass cmdbuf to debug marker begin
3851- zink: remove debug markers for u_blitter ops
3852- zink: when skipping a TRANSFER_DST image barrier, set access tracking
3853- zink: add frame trace markers on the queue
3854- zink: unbind fb on context destroy
3855- zink: only add deferred barrier on fb unbind when layout needs to change
3856- zink: fix descriptor update flagging on null ssbo set
3857- zink: propagate valid_buffer_range when replacing buffer storage
3858- zink: check for layout updates when unbinding samplerviews
3859- zink: eliminate pre barrier for adding resource binds
3860- zink: don't unset existing access when adding resource binds
3861- zink: explicitly flush src clears when u_blittering
3862- zink: always set color writes on the unordered cmdbuf
3863- zink: bind descriptor buffers to unordered cmdbuf
3864- zink: set dynamic pcp for unordered cmdbuf
3865- zink: always set batch usage for descriptors after barrier
3866- llvmpipe: fix LP_PERF=no_depth to ignore depth format
3867- zink: track the last write access for resources
3868- zink: add a mechanism for managing TRANSFER_DST buffer barriers
3869- zink: add a mechanism to trigger copy box resets from batch state reset
3870- zink: add a driver workaround to disable copy box optimizations
3871- zink: hook up buffer TRANSFER_DST barrier optimizing
3872- aux/tc: fix rp info resizing clobbering current info
3873- vulkan/wsi: fix crash in failed swapchain creation for wayland
3874- lavapipe: split out spirv compile of shaders
3875- lavapipe: split out shader struct members into their own struct
3876- lavapipe: pass shader struct and layout to scan_pipeline_info()
3877- lavapipe: more small shader struct usage tweaks
3878- lavapipe: move xfb init to shader struct
3879- lavapipe: rename inline uniform function params
3880- lavapipe: move uniform inline functions to shader struct
3881- lavapipe: break out main shader lowering into separate function
3882- Revert "Revert "ci: disable mesa-swrast runner jobs""
3883- zink: ignore renderdoc if ZINK_RENDERDOC isn't in use
3884- radv: delete radv_graphics_pipeline_compile() asserts
3885- radv: avoid a huge memset in radv_graphics_pipeline_compile()
3886- aux/tc: use renderpass tracking to optimize texture_subdata calls
3887- lavapipe: fix dynamic depth clamping
3888- lavapipe: set render_condition_enabled=false for vkCmdClearDepthStencilImage
3889- lavapipe: add command debugging
3890- lavapipe: beef up LVP_POISON_MEMORY
3891- ci: fix LVP_POISON_MEMORY usage
3892- zink: rework zink_resource::valid_buffer_range
3893- zink: return the unordered state from zink_resource_buffer_transfer_dst_barrier()
3894- zink: unify image TRANSFER_DST barrier checks
3895- zink: rename zink_check_transfer_dst_barrier()
3896- zink: super reorder buffer copies
3897- tu: don't set startup debug on debug builds
3898- zink: fix copy box iteration
3899- glthread: align small buffer uploads to 4 bytes
3900- zink: fix copy box iteration when adding
3901- zink: fix copy box merging adjacency
3902- aux/trace: delete GALLIUM_TRACE_NIR log message
3903- zink: fix layer check for compressed format surface creation
3904- zink: track current queue for resources
3905- zink: remove redundant dmabuf_acquire setting
3906- zink: use res->queue to auto-handle queue transitions back to gfx queue
3907- zink: track tc fences better
3908- zink: add an assert to catch renderpass optimizing bugs
3909- zink: don't use/update tc rp info while blitting
3910- zink: reset tc fb info upon splitting a renderpass
3911- zink: add and use a function for "safely" ending renderpasses
3912- zink: disable queries when flushing clears from set_fb
3913- zink: disable tc flush notify with rp optimizing
3914- zink: trigger oom flushes more aggressively from copy ops
3915- aux/tc: flag late zs clears as partial clears
3916- aux/tc: use a local 'deferred' variable in tc_flush()
3917- aux/tc: fix renderpass splitting on flush
3918- aux/tc: track the number of active queries
3919- aux/tc: don't sync for get_sample_position
3920- aux/tc: add a function to reset rp info
3921- aux/tc: fix initial rp info allocation
3922- aux/tc: make some of the rp tracking api private
3923- aux/tc: rework inter-batch renderpass info handling
3924- zink: only flag rp info for updating on flush, don't actually update
3925- zink: add tracing for copy ops
3926- zink: expand ZINK_DEBUG=sync to cover copy ops
3927- zink: add some asserts for zs layout in dynamic render
3928- zink: double check layouts for possible feedback loop images
3929- zink: end rp earlier in set_framebuffer_state
3930- zink: add a function for applying u_blitter barriers
3931- zink: add a dynamic render version of clear_texture hook
3932- zink: reset fb clears using the clears_enabled mask
3933- zink: manually apply barriers whenever zink_context::blitting is set
3934- zink: split out pipeline rp info update function
3935- zink: add zink_context::unordered_blitting to preserve unordered flags
3936- zink: implement unordered u_blitter calls
3937- zink: round geometry for u_blitter debug markers
3938- zink: catch zs u_blitter ops for draw markers
3939- zink: add debug markers for draws
3940- zink: track zsbuf info even when rp optimizing is disabled
3941- zink: fix dsa state parsing for tc info
3942- zink: track whether the fb zsbuf is readonly
3943- zink: add a fixup case for readonly zsbuf clears
3944- zink: improve no-oping of write -> readonly zsbuf layouts
3945- zink: don't flag rp layout check on next draw when rp optimizing
3946- zink: allow zink_is_zsbuf_used() without tc / rp optimizing
3947- zink: rename add_implicit_color_feedback_loop()
3948- zink: explicitly eliminate feedback loops for unused zsbufs
3949- zink: further eliminate zs implicit feedback loops for read-only access
3950- zink: split out luminance/alpha clear conversion code for reuse
3951- zink: convert luminance/alpha clear colors in dynamic texture clear
3952- zink: handle swapchain creation failure less lazily
3953- zink: simplify resource_check_defer_buffer_barrier()
3954- zink: delete unused barrier api
3955- zink: never split a renderpass for a loadop change
3956- zink: flag some rp ends as unsafe
3957- zink: add batch refs for framebuffer surfaces on bind and ref update
3958- zink: fix unordered access for image descriptors
3959- zink: force unordered_write=false when binding image descriptors
3960- zink: add an assert to ensure zsbuf invalidation doesn't break rendering
3961- zink: only run post-fb-unbind layout stuff if the resource isn't being destroyed
3962- zink: always set sampler layouts when unbinding fb images while rp optimizing
3963- zink: rework handling of unordered->ordered write buffer barriers
3964- zink: don't update fbfetch in db mode if inputAttachmentDescriptorSize==0
3965- zink: add ZINK_DEBUG=flushsync
3966- zink: track whether zsbuf is unused
3967- zink: flag rp layout change if zsbuf usedness changes on dsa/fs state bind
3968- driconf: make glthread=true default for source games
3969- aux/trace: dump blend states with enums
3970- aux/trace: fix GALLIUM_TRACE_NIR handling
3971- zink: fix some type mismatches for c++ compilation
3972- zink: break out a src region barrier check for reuse
3973- zink: move all barrier-related functions to c++
3974- zink: use c++ template to deduplicate all the buffer barrier code
3975- zink: minor tweaks for image barriers
3976- zink: use c++ template to deduplicate image barrier functions
3977- zink: stop leaking separate shader nir
3978- lavapipe: always copy streamout info when creating shaders
3979- lavapipe: don't memcpy tess_ccw when copying pipeline library shaders
3980- lavapipe: refactor shader compile functions to not take pipeline params
3981- lavapipe: track bound shader stages on rendering_state
3982- lavapipe: add a device member to rendering_state
3983- lavapipe: stop using rendering_state::pipeline
3984- lavapipe: refactor compute shader binding
3985- lavapipe: merge some loops in handle_graphics_pipeline()
3986- lavapipe: PIPE_SHADER\_ -> MESA_SHADER\_
3987- lavapipe: don't access pipeline shader structs as much during bind
3988- lavapipe: pull out dynamic tess origin check in gfx pipeline bind
3989- lavapipe: break out all the important parts of gfx pipeline setting for reuse
3990- lavapipe: delete unused struct member
3991- lavapipe: refactor pipeline destroy a bit
3992- lavapipe: add a ref for the tess_ccw nir on creation
3993- lavapipe: unify lvp_pipeline_nir creation
3994- lavapipe: dynamically bind noop fs at draw time when needed
3995- lavapipe: don't double unbind gfx stages on pipeline bind
3996- lavapipe: split out gfx stage unbinding
3997- lavapipe: only update shader access for bind/unbind stages
3998- lavapipe: only unset tess_states pointers on tes bind
3999- lavapipe: avoid uniformly unsetting gs_output_lines
4000- lavapipe: move default rasterizer state values to rendering_state init
4001- mapi: add InternalInvalidateFramebufferAncillaryMESA
4002- glthread: add _mesa_glthread_invalidate_zsbuf()
4003- kopper: apply ancillary invalidation through glthread on swapbuffers
4004- llvmpipe: fix linear fs analysis with nonzero fs outputs
4005- llvmpipe: fix handling of unused color attachments
4006- zink: add spirv builder function for terminate
4007- zink: set src access when rebinding buffers, unset unordered_*
4008- zink: fix quads emulation gs with array variables
4009- zink: block resolves where src extents > dst extents
4010- zink: omit VkPipelineVertexInputStateCreateInfo with dynamic vinput
4011- zink: flag vertex buffers for rebind after vstate draws
4012- zink: use search_or_add for masking vstate
4013- zink: bind vertex state directly from draw hook
4014- zink: add another vstate draw template for popcnt presence
4015- zink: explicitly pass null velems when creating pipelines with dynamic vinput
4016- zink: don't swizzle velems state for vstate draws
4017- zink: use fast popcnt for vstate draws
4018- zink: stop caching vertex states
4019- lavapipe: break out pipeline layout creation for reuse
4020- lavapipe: implement EXT_shader_object
4021- lavapipe: advertise EXT_shader_object
4022- zink: delete shader reordering in assign_io()
4023- zink: add and populate a shader_info struct to zink_shader
4024- zink: pass nir_shader to update_so_info()
4025- zink: generate flat_flags during shader creation
4026- zink: use zink_shader::info instead of zink_shader::nir::info
4027- zink: simplify fbfetch output detection from fs
4028- zink: pass nir directly to zink_shader_tcs_create()
4029- zink: swap nir pointers when compiling compute shaders
4030- zink: directly return nir from zink_shader_tcs_create
4031- zink: streamline nir cloning for assign_io
4032- zink: store nir as serialized on zink_shader structs
4033- zink: simplify assign_io() further
4034- zink: break out nir blob deserializing
4035- zink: move nir cloning out to callers of zink_shader_compile
4036- zink: store num_inlinable_uniforms separately for cs programs
4037- zink: always store nir serialized
4038- zink: be explicit about separate shader dsl indexing during creation
4039- zink: rework choose_pdev (again)
4040- glthread: use id 0 for internal buffer objects
4041- radv: fix leak of nir from retained shaders
4042- zink: don't try copying multiple results for conditional render copy
4043- zink: more explicitly track/check rp optimizing per-context
4044- zink: don't access non_fs part of zink_shader from fs
4045- zink: reuse d3d12 variable copying to make passthrough gs more robust
4046- zink: reuse copy_vars for generated tcs
4047- zink: don't trigger shader variants on pcp change if driver supports dynamic pcp
4048- Revert "zink: don't trigger shader variants on pcp change if driver supports dynamic pcp"
4049- zink: try to prune resources from barrier jit on fb unbind
4050- lavapipe: copy fragment shader when merging GPL pipelines
4051- lavapipe: refactor/consolidate GPL shader copying
4052- lavapipe: don't double-inline ubo0
4053- lavapipe: implement inline variant caching
4054- zink: block oom flushes during unordered blits
4055- zink: unroll array loop when copying vars for passthrough shaders
4056- zink: free GPL input/output libs on context destroy to avoid leaking
4057- zink: fix GPL lib leaking
4058- zink: remove redundant 'blitting' check in zink_prep_fb_attachment()
4059- zink: break out feedback loop pipeline state flagging for reuse
4060- zink: pre-convert attachment id to attachment idx
4061- zink: eliminate implicit feedback loops on rp begin
4062- zink: track per-image swapchain layouts
4063- zink: handle swapchain handoffs around makecurrent
4064- zink: remove a fixed validation error for ci
4065- mesa/st/program: don't init xfb info if there are no outputs
4066- zink: remove atomics from zink_query
4067- zink: pass ctx through query destroy paths
4068- zink: always defer query pool deletion
4069- zink: move memoryTypeIndex selection down in general bo allocation
4070- zink: slightly rework memoryTypeIndex selection to pre-determine heap
4071- zink: restore BAR allocation failure demotion
4072- zink: make general bo allocation more robust by iterating
4073- zink: avoid zero-sized memcmp for descriptor layouts
4074- iris: use util_framebuffer_get_num_samples when setting ps dispatch samples
4075- zink: manually re-set framebuffer after msrtss replicate blit
4076- zink: handle 'blitting' flag better in msrtss replication
4077- zink: skip msrtss replicate if the attachment will be full-cleared
4078- zink: avoid recursion during msrtss blits from flushing clears
4079- nir/lower_alpha_test: rzalloc state slots
4080- zink: fix non-db bindless texture buffers
4081- zink: emit demote cap when using demote
4082- zink: only print copy box warning once per resource
4083- util/debug: move null checks out of debug message macro
4084- zink: don't bitcast bool deref loads/stores
4085- drisw: don't leak the winsys
4086- zink: check for extendedDynamicState3DepthClipNegativeOneToOne for ds3 support
4087- draw: fix viewmask iterating
4088- zink: don't pin flush queue threads if no threads exist
4089- zink: add z32s8 as mandatory GL3.0 profile attachment format
4090- nir/gs: fix array type copying for passthrough gs
4091- zink: fix array copying in pv lowering
4092- gallivm: break out native vector width calc for reuse
4093- llvmpipe: do late init for llvm builder
4094- zink: break out VkImageViewUsageCreateInfo applying for reuse
4095- zink: reapply VkImageViewUsageCreateInfo when rebinding a surface
4096- draw: fix robust ubo size calc
4097- llvmpipe: fix native vector width init
4098- zink: add extendedDynamicState3DepthClipNegativeOneToOne to profile
4099- zink: only unset a generated tcs if the bound tcs is the generated one
4100- zink: set depth dynamic state values unconditionally
4101- zink: null some descriptor buffer pointers during destruction
4102- zink: sync queries at the end of cmdbufs
4103- cso: unbind fb state when unbinding the context
4104- i915: use util_copy_framebuffer_state to set fb state
4105- i915: use util_unreference_framebuffer_state to unref fb state
4106- iris: use util_unreference_framebuffer_state to unref fb state
4107- softpipe: use util_unreference_framebuffer_state to unref fb state
4108- v3d: use util_unreference_framebuffer_state to unref fb state
4109- vc4: use util_unreference_framebuffer_state to unref fb state
4110- llvmpipe: use util_unreference_framebuffer_state to unref fb state
4111- svga: use util_unreference_framebuffer_state to unref fb state
4112- zink: don't init mutable resource bit for swapchain images
4113- zink: don't init mutable for swapchain src during blit
4114- zink: allow vk 1.2 timelineSemaphore feature if extension isn't supported
4115- zink: stringify unsupported prim restart log error
4116- zink: delete persistent map tracking
4117- zink: add PERSISTENT for db buffer maps
4118- zink: delete unnecessary pipeline stage flags from inference
4119- zink: use an intermediate variable for binding ssbo slots
4120- zink: unbind the ssbo slot being iterated, not the index of the buffer
4121- zink: flush INDIRECT_BUFFER mem barrier for compute
4122- zink: disable batched unordered barries with ZINK_DEBUG=noreorder
4123- zink: block batching of unordered barriers if previous usage was write
4124- zink: fix uncached memory readback
4125- glsl/lower_samplers_as_deref: apply bindings for unused samplers
4126- zink: bind bindless db set when updating separate shader db sets
4127- zink: compare desc set to detect bindless vars in separate shaders
4128- zink: adjust bindless texel buffer handle before indexing
4129- zink: block more flushes during unordered blits
4130- zink: also cache swapchain semaphores
4131
4132Mohamed Ahmed (3):
4133
4134- vulkan/runtime: move common buffer related entrypoints to vk_buffer.c
4135- vulkan/runtime: implement vkGetBufferMemoryRequirements2()
4136- anv: remove GetBufferMemoryRequirements2()
4137
4138Nanley Chery (16):
4139
4140- docs: Document the implicit barriers around blits
4141- glsl: Add compute shaders to encode DXT5/BC3
4142- glsl: Modify the #includes in the DXT5 shaders
4143- mesa: Create _mesa_CreateShaderProgramv_impl
4144- mesa/st: Add get_compute_program
4145- mesa/st: Add and use create_bc1_endpoint_ssbo
4146- mesa/st: Add st_compute_transcode_astc_to_dxt5
4147- mesa/st: Add st_texture_image_resource_level
4148- mesa/st: Enable compute-based transcoding to DXT5
4149- mesa/st: Measure compressed fallback unmap paths
4150- iris: Update comment in iris_cache_flush_for_render
4151- iris: Flush caches for aux-mode changes more often
4152- iris: Drop iris_cache_flush_for_render
4153- iris: Allocate ZEROED BOs for shared resources
4154- iris/bufmgr: Add and use zero_bo
4155- iris/bufmgr: Handle flat_ccs for BO_ALLOC_ZEROED
4156
4157Nataraj Deshpande (1):
4158
4159- anv: Bump VkDeviceMemory objects limit to 4GB
4160
4161Neha Bhende (1):
4162
4163- docs: Add GL 4.3 support info in mesa docs
4164
4165Nicolas Dufresne (1):
4166
4167- util/format: Fix wrong colors when importing YUYV and UYVY
4168
4169Nicolas F (1):
4170
4171- driconf: remove the adaptive sync special case for mpv
4172
4173Oleksii Bozhenko (5):
4174
4175- glsl: fix gl_CullDistance lowering from float[8] to vec4[2]
4176- ci: Uprev Piglit
4177- Move combining clip and cull optimization before linking
4178- wsi: add rgb_component_bits_are_equal
4179- wsi: remove get_sorted_vk_formats duplication
4180
4181Patrick Lerda (25):
4182
4183- lima: fix memory leak related to u_transfer_helper_create()
4184- mesa/program: fix memory leak triggered by parser errors
4185- mesa/st: fix possible crash related to arb invalid memory access
4186- r600: fix shader blob memory leak
4187- vbo/save: fix possible crash related to fixup_vertex()
4188- mesa/shaderapi: fix path memory leak
4189- mesa/framebuffer: fix gl_framebuffer.resolve refcnt imbalance
4190- mesa/program: fix memory leak triggered by invalid extended swizzle selector
4191- mesa/program: fix memory leak triggered by multiple targets used on one texture image unit
4192- mesa/program: fix memory leak triggered by arb alias
4193- radeonsi: fix memory leak related to ureg_get_tokens()
4194- glx: fix memory leak related to __glXCloseDisplay()
4195- r600: fix refcnt imbalance related to shader
4196- intel: fix memory leak related to brw_nir_create_passthrough_tcs()
4197- r600: fix typo that could lead to a possible crash
4198- egl: fix memory leak related to _eglRefreshDeviceList()
4199- r600: fix refcnt imbalance related to r600_set_vertex_buffers()
4200- r600: fix refcnt imbalance related to evergreen_set_shader_images()
4201- lima: fix refcnt imbalance related to framebuffer
4202- r600/sfn: fix memory leak related to sh_info->arrays
4203- aux/draw: fix memory leak related to ureg_get_tokens()
4204- crocus: fix refcnt imbalance related to framebuffer
4205- crocus: fix refcnt imbalance related to crocus_create_surface()
4206- r600: fix refcnt imbalance related to atomic_buffer_state
4207- radeonsi: set proper drm_amdgpu_cs_chunk_fence alignment
4208
4209Paul Gofman (1):
4210
4211- driconf: add a workaround for Kaiju-A-Gogo
4212
4213Paulo Zanoni (8):
4214
4215- anv: don't leave undefined values in exec->syncobj_values
4216- anv: check the return value of anv_execbuf_add_bo_bitset()
4217- anv: run buf_finish() if add_bo() fails during execute_simple_batch()
4218- anv: rename anv_execbuf->array_length to bo_array_length
4219- anv: use vk_realloc for the anv_execbuf arrays
4220- hasvk: don't leave undefined values in exec->syncobj_values
4221- hasvk: check the return value of anv_execbuf_add_bo_bitset()
4222- anv: there's no need to set exec_obj offsets twice
4223
4224Pavel Ondračka (16):
4225
4226- nir/lower_bool: ntt: Generate a good opcode for bcsel
4227- r300: update rv515 ci failures list
4228- r300: skip sin/cos input range transformation for nine and ntt
4229- r300: remove backend input range transformation for sin and cos
4230- ntt: pass ubo_vec4_max nir_opt_offsets flag through ntt options
4231- r300: set ubo_vec4_max ntt option properly
4232- r300: remove backend negative addressing emulation
4233- nir: nir opt_shrink_vectors whitespace fix
4234- nir: mark progress when removing trailing unused alu channels
4235- nir: mark progress when removing trailing unused load_const channels
4236- r300: set register file to none if swizzles are constant only
4237- nir: shrink phi nodes in nir_opt_shrink_vectors
4238- r300: drop VDPAU support
4239- r300: simplify KILL transformation
4240- nine: use separate register for aL emulation
4241- r300: fix unconditional KIL on R300/R400
4242
4243Pedro J. Estébanez (4):
4244
4245- spirv_to_dxil: Unify spirv_to_nir_options
4246- spirv2dxil: Split read-only image as SRV logic into declared and inferred
4247- spirv: Assume input attachments are read-only
4248- Revert "microsoft/compiler: Use SRVs for read-only images"
4249
4250Philip Langdale (1):
4251
4252- radeonsi: correctly declare YUV420_10 RT Format support for AV1
4253
4254Philipp Zabel (2):
4255
4256- vulkan/wsi/wayland: fix acquire_next_image to report timeouts properly
4257- zink: fix build with -Dvulkan-beta=true
4258
4259Pierre-Eric Pelloux-Prayer (37):
4260
4261- radeonsi: simplify dpbb settings
4262- ac/info: move pci bus info in a struct
4263- ac: add ac_query_pci_bus_info helper
4264- ac: don't call ac_query_pci_bus_info from ac_query_gpu_info
4265- radeonsi/sqtt: don't read results for disabled SEs
4266- radeonsi/sqtt: disable SE1+ on GFX11
4267- radeonsi/sqtt: update registers for gfx11
4268- radeonsi/sqtt: implement offset workaround for gfx11
4269- vbo: remove bogus assert
4270- vbo: lower VBO_SAVE_BUFFER_SIZE to avoid large VRAM usage
4271- glthread: fix glArrayElement handling
4272- drm-uapi/dma-buf.h: use __u32/__u64 types
4273- winsys/amdgpu: use DMA_BUF_SET_NAME_B if available
4274- radeonsi/gfx11: clamp PRIM_GRP_SIZE
4275- radeonsi/gfx11: fix ge_cntl programming
4276- amd/surface: fix base_mip_width of subsampled formats
4277- winsys/amdgpu: use amdgpu_device_get_fd
4278- radeonsi/video: use specific PIPE_BIND\_ value for video buffers
4279- radeonsi: fix incorrect vgpr indices in the ps_prolog
4280- radeonsi/test: use gbm-skips.txt
4281- radeonsi/test: update test results
4282- radeonsi: don't use PKT3_SET_SH_REG_INDEX on gfx9 and older
4283- radeonsi: fix fast depth_clear_value/stencil_clear_value
4284- yegl/wayland: fix glthread deadlocks
4285- Revert "driconf: add a workaround for plasmashell freezing"
4286- ac/llvm: fix build with LLVM 17
4287- mesa: fix CopyImageSubDataOES with GL_TEXTURE_EXTERNAL_OES
4288- amd/surface: rename metadata functions
4289- ac/surface: introduce umd metadata v2
4290- radeonsi: add AMD_DEBUG=extra_md
4291- radeonsi: don't use si_decompress_dcc if the blitter is running
4292- radv: add RADV_DEBUG=extra_md
4293- radeonsi: don't use alignment_log2 of imported buffers
4294- mesa: fix invalid index_bo refcounting
4295- util/vbuf: clarify indirect draws handling
4296- util/vbuf: fix index_bo leak
4297- radeonsi: update test results
4298
4299Pino Toscano (1):
4300
4301- symbols-check: support OSes based on GNU toolchain
4302
4303Qiang Yu (78):
4304
4305- radeonsi: implement nir_load_ring_gsvs_amd
4306- radeonsi: implement nir_load_ring_gs2vs_offset_amd
4307- radeonsi: lower nir streamout intrinsics in abi
4308- radeonsi: use nir_print_xfb_info to replace si_dump_streamout
4309- radeonsi: use ac_nir_lower_legacy_vs to replace si_llvm_vs_build_end
4310- radeonsi: add nir implementation of gs copy shader generation
4311- radeonsi: build legacy gs output info when shader compile
4312- radeonsi: replace llvm gs copy shader generation with nir
4313- radeonsi: remove llvm gs copy shader generate
4314- radeonsi: replace llvm legacy gs code with nir lowering
4315- radeonsi: move gfx10_ngg_export_vertex to si_shader_llvm.c
4316- gallium/aux: remove nir_helpers
4317- nir/xfb_info: nir_gather_xfb_info_from_intrinsics update nir xfb_info
4318- radeonsi: update nir xfb info after medium io lowering
4319- nir: add nir_export_amd intrinsic
4320- ac/llvm: implement nir_export_amd
4321- aco: implement nir_export_amd
4322- ac/nir: gs and nogs use ac_nir_export_primitive
4323- ac/nir: add ac_nir_export_position
4324- ac/nir: add ac_nir_export_parameter
4325- ac/nir: add force_vrs to ac_nir_export_position
4326- amd,radeonsi: implement nir_load_force_vrs_rates_amd in driver abi
4327- radeonsi: clamp vertex color in legacy gs instead of gs copy shader
4328- radeonsi: update outputs written nir info
4329- radeonsi: remove the extra handling for VS/TES primitive id
4330- radeonsi: set nr_pos_exports outside of llvm translation
4331- ac/nir,radv,radeonsi: legacy vs use ac_nir_export_(position|parameter)
4332- ac/nir,radv,radeonsi: gs copy shader use ac_nir_export_(position|parameter)
4333- ac/nir/ngg: fix clip dist culling mask uninitialized
4334- ac/nir/ngg: change clipdist_neg_mask_var type to uint32
4335- ac/nir/ngg,radv,radeonsi: nogs use ac_nir_export_(position|parameter)
4336- ac/nir/ngg: prepare gather_vs_outputs to be used by gs
4337- ac/nir/ngg: gs use ac_nir_export_(position|parameter)
4338- ac/nir/ngg,radv: ms use ac_nir_export_(primitive|position|parameter)
4339- nir,ac/llvm,aco: remove nir_export_primitive_amd
4340- nir,ac/llvm,aco,radv,radeonsi: remove nir_export_vertex_amd
4341- aco: remove early_rast wait insert
4342- radv: move radv_consider_force_vrs above radv_fill_shader_info
4343- radv: use amd common force_vrs option
4344- ac/llvm,radeonsi: lower nir_load_barycentric_at_sample in abi
4345- radeonsi: add num_component param to load_internal_binding
4346- ac/llvm,radeonsi: lower fbfetch in abi
4347- radeonsi: only init llvm output when needed.
4348- ac/llvm: only init outputs when fragment shader for radv
4349- aco: only ls and ps use store output now
4350- aco, radv: Add load_grid_size_from_user_sgpr to aco options.
4351- aco, radv: Move is_trap_handler_shader to aco info.
4352- ac/nir: move store_var_components to common place
4353- ac/nir: tcs write tess factor support pass by reg
4354- ac/nir: init tess factor location with IO remap
4355- ac/nir: handle tess factor output missing case
4356- ac/llvm,radeonsi: lower nir_load_ring_tess_factors_amd
4357- radeonsi: lower nir_load_ring_tess_factors_offset_amd
4358- radeonsi: monolithic TCS emit tessfactor in nir directly
4359- ac/llvm: respect channel_type when ac_build_buffer_load
4360- ac/llvm: add missing type convert for nir_load_buffer_amd
4361- nir: pack_(s|u)norm_2x16 support float16 as input
4362- ac/llvm: implement float16 nir_op_pack_(s|u)norm_2x16
4363- aco: implement float16 nir_op_pack_(s|u)norm_2x16
4364- nir,radeonsi: add and implement nir_load_alpha_reference_amd
4365- nir: add nir_fisnan helper function
4366- ac/nir: add ac_nir_lower_ps
4367- radeonsi: monolithic PS emit epilog in nir directly
4368- radeonsi: expose si_nir_load_internal_binding
4369- ac/nir: add ac_nir_load_arg_at_offset
4370- radeonsi: add si_nir_lower_vs_inputs
4371- ac/llvm: vs_rel_patch_id can also be fixed up
4372- ac/llvm: move ac_fixup_ls_hs_input_vgprs to amd common
4373- radeonsi: monolithic VS emit prolog in nir directly
4374- ac/llvm,radeonsi: remove abi->load_inputs implementation
4375- ac/llvm: remove ac_build_opencoded_load_format
4376- radeonsi: fix max scrach lds size calculation when ngg
4377- ac/nir/ngg: fix gs culling vertex liveness check for odd vertices
4378- ac/nir/ngg: fix store shared alignment
4379- ac/llvm: remove some unused code replaced by nir
4380- ac,aco: move gfx10 ngg prim count zero workaround to nir
4381- aco: fix nir_f2u64 translation
4382- ac/nir/cull: fix line position w culling
4383
4384Raun (2):
4385
4386- dzn: Enable VK_KHR_bind_memory2
4387- dzn: Enable VK_KHR_get_memory_requirements2
4388
4389Rhys Perry (48):
4390
4391- radv: implement GS load_ring_gsvs_amd/load_ring_gs2vs_offset_amd
4392- radv,aco: use ac_nir_lower_legacy_gs
4393- aco: restore semantic_can_reorder for GS output stores
4394- ac/nir: use store_buffer_amd's base index
4395- ac/llvm: add support for fp32 addition atomics
4396- aco: add support for fp32 addition atomics
4397- radv: load ssbo_atomic_fadd descriptor
4398- radv/gfx11: expose shaderBufferFloat32AtomicAdd
4399- aco/tests: fix assembler.gfx11.vop12c_v128 with LLVM 15
4400- aco/tests: update assembler tests for latest LLVM 16
4401- radv: skip creation of null TLAS for null winsys
4402- aco: set has_color_exports with GPL
4403- aco: end reduce tmp after control flow, when used within control flow
4404- aco/tests: add setup_reduce_temp.divergent_if_phi
4405- aco/spill: always end spill vgpr after control flow
4406- aco: limit VALUPartialForwardingHazard search
4407- radv: set state.vbo_misaligned_mask_invalid in radv_bind_vs_input_state
4408- ac: move ring_offsets to ac_shader_args
4409- ac/llvm: let ring_offsets be accessed like a normal arg
4410- radv/llvm: use the ring_offsets shader arg
4411- aco: fix out-of-bounds access when moving s_mem(real)time across SMEM
4412- aco: don't modify exec in p_interp_gfx11
4413- aco: don't apply modifiers through DPP to unsupported instructions
4414- aco: fix pathological case in LdsDirectVALUHazard
4415- aco: always update orig_names in get_reg_phi()
4416- radv: remove is_internal pipeline creation parameter
4417- aco/tests: add tests for v_fma_f32 with 2 fp16 literals
4418- aco: make IDSet sparse
4419- nir/range_analysis: fix vectorized phis and intrinsics
4420- nir: use xyzw order for precise fdot
4421- nir: make fdph lowering match fdot
4422- nir: add nir_lower_alu_width_test.fdot_order
4423- aco/gfx11: fix RT prolog scratch initialization
4424- aco: set needs_flat_scr=true for RT
4425- util/dynarray: allow an initial stack allocation to be used
4426- nir/range_analysis: add missing masking of shift amounts
4427- nir/range_analysis: add helpers for limiting stack usage
4428- nir/range_analysis: use perform_analysis() in nir_unsigned_upper_bound()
4429- nir/range_analysis: use perform_analysis() in nir_analyze_range()
4430- radv: fix setting radv_shader_info::user_data_0 with rt
4431- aco: don't optimize s_or_b64(v_cmp_u_f32(a, b), cmp(a, a))
4432- aco: fix nir_var_shader_out barriers for task shaders
4433- radv/gfx11: improve RT scratch allocation
4434- nir: make nir_fisnan helper exact
4435- aco: remove SMEM_instruction::prevent_overflow
4436- ac/nir/ps: fix null export write mask miss set to 0xf
4437- aco: don't move exec reads around exec writes
4438- aco: don't move exec writes around exec writes
4439
4440Rob Clark (180):
4441
4442- freedreno/ci: Switch a630 jobs over to manual
4443- freedreno/ci: Cleanup a618 yaml
4444- freedreno/ci: Add a618 egl/skqp/piglit jobs
4445- Revert "freedreno/ci: Switch also performance a630 job to manual"
4446- Revert "freedreno/ci: Switch a630 jobs over to manual"
4447- freedreno/ci: Add an a618 flake
4448- freedreno/drm: Remove assert
4449- freedreno: Fix tracking of enabled SSBOs
4450- freedreno/a6xx: Workaround for no pos/psize
4451- freedreno: Don't re-install a flushed batch
4452- freedreno/a6xx: Rework barrier handling
4453- freedreno/ir3: Stop copying options
4454- freedreno/ir3: Let driver specify fb-read descriptor
4455- freedreno: Track image/SSBO usage for all stages
4456- freedreno/ir3: Add descriptor set lowering
4457- freedreno/a6xx: Pre-bake IBO descriptor sets
4458- freedreno/a6xx: Add bindless state
4459- freedreno/a6xx: Switch over to bindless IBO
4460- freedreno/a6xx: Remove bindfull IBO state
4461- freedreno/a6xx: Removing munging of tex state for IBO
4462- freedreno/a6xx: Remove tex fb_read state
4463- freedreno/a6xx: Move compute to tex state group
4464- freedreno/a6xx: Move tex state building
4465- freedreno/a6xx: Expose SSBO/image for all shader stages
4466- freedreno: Restore GL_VENDOR string
4467- gallium/util: Add util_writes_depth() helper
4468- freedreno/a6xx: Add LRZ perf warn for ztest direction changes
4469- freedreno/a6xx: Invalidate LRZ on blend+depthwrite
4470- turnip: Rename lrz force_disable_mask
4471- turnip: Invalidate LRZ on blend+depthwrite
4472- util/xmlconfig: Use os_get_option()
4473- freedreno: Add driconf to disable conservative LRZ
4474- freedreno/a6xx: Add a few kernel regs/etc
4475- freedreno/drm: Add some ref/unref debugging
4476- freedreno/drm: Detect zombie BOs
4477- freedreno/drm: Remove bo_del_or_recycle()
4478- freedreno/drm: Split out bo->finalize()
4479- freedreno/drm: Synchronize handle close and lookup
4480- freedreno/drm/virtio: Flush before CREATE_BLOB
4481- freedreno/drm: Restart import on zombie race
4482- freedreno/gmem: Fix for partial z/s fast-clear
4483- freedreno/decode: Increase size of offsets table
4484- freedreno/a6xx: LRZ for MSAA
4485- freedreno/ir3: Scalarize load_ssbo
4486- freedreno/a6xx: Add missing CS_BINDLESS mapping
4487- freedreno/a6xx: Add CS instrlen workaround
4488- freedreno: nondraw-batch
4489- freedreno: Skip flush_resource with explicit sync
4490- freedreno/a6xx: Don't double-write SP_CS_OBJ_START
4491- freedreno: Don't open-code setting dirty CS state
4492- freedreno/a6xx: Make shader state independent of grid info
4493- freedreno/a6xx: Also FLUSH_CACHE on image barrier
4494- freedreno/a6xx: Remove excess CS flushing
4495- freedreno+ir3: Move storage_16bit to compiler options
4496- freedreno/a6xx: Move CS state to PROG state group
4497- freedreno/drm: Move sa_cpu_prep() to core
4498- freedreno/drm/virtio: Limit guest handles passed to virtgpu
4499- ir3: Quiet unused variable warning
4500- freedreno: Quiet unused variable warnings
4501- freedreno/a2xx: Move pack_rgba()
4502- freedreno: Indent fixes
4503- freedreno/a6xx: Move num_driver_params to program state
4504- freedreno: Move num_vertices calc to backend
4505- freedreno: Remove impossible NULL check
4506- freedreno: Add FD_DIRTY_QUERY
4507- freedreno: Avoid screen lock when no rsc tracking needed
4508- freedreno: Account for multi-draw in num_draws
4509- freedreno: Push num_draws down to backend
4510- freedreno/a6xx: Drop unused return
4511- freedreno/a6xx: Split out flush_streamout() helper
4512- freedreno/a6xx: Multi-draw support
4513- freedreno/a6xx: Do tex-state invalidates in same ctx
4514- freedreno/drm: Make rb refcnt non-atomic
4515- freedreno/a6xx: Remove tex-state refcnting
4516- freedreno: Move blend out of dirty-rsc tracking
4517- freedreno: Move FD_MESA_DEBUG cases out of draw_vbo
4518- freedreno/a6xx: Pre-compute PROG related LRZ state
4519- freedreno: Avoid taking screen lock
4520- freedreno/batch: Stop tracking cross-context deps
4521- freedreno: Drop batch lock
4522- freedreno: Add seqno helper
4523- freedreno/drm: Optimize stateobj re-emit
4524- freedreno/a6xx: Move rsc seqno out of tex cache key
4525- freedreno/a6xx: Fix set_sampler_views(start != 0)
4526- freedreno/a6xx: Drop unneed fd6_texture_state() arg
4527- freedreno/a6xx: Fix sampler view rsc_seqno for X32_S8X24
4528- freedreno/a6xx: Add a way to assert valid format
4529- freedreno/a6xx: Remove needs_invalidate flag
4530- freedreno/a6xx: Small cleanup
4531- freedreno/a6xx: Static-ify sampler_view_update()
4532- freedreno/a6xx: Fix view_seqno in tex cache key
4533- freedreno/cffdec: Fix unitialized count for pkt2
4534- freedreno/cffdec: Add helper to find next pkt
4535- freedreno/cffdec: Add helper to parse CP_INDIRECT_BUFFER
4536- freedreno/cffdec: Fix hang location detection
4537- freedreno/crashdec: Refactor crashdec tests
4538- freedreno/crashdec: Add another prefetch test
4539- freedreno/crashdec: Handle multi-IB prefetching
4540- freedreno/crashdec: Disable GALLIUM_DUMP_CPU
4541- vk/runtime: Allow enumerate and try_create_for_drm to coexist
4542- turnip: Move things to prep for multi-kernel support
4543- turnip: drm code-motion
4544- turnip: Split out vfuncs for kernel interface
4545- turnip: Allow knl backend specific entrypoints
4546- turnip: Move QueueWaitIdle entrypoint to kgsl
4547- turnip: Handle kgsl vs drm specifics at runtime
4548- turnip: Refactor device loading
4549- tu+meson: Re-work KMD selection
4550- tu/kgsl: Propagate tu_physical_device_init() errors
4551- Revert "freedreno: Account for multi-draw in num_draws"
4552- freedreno/a6xx: Namespace reg/pkt packer vars
4553- freedreno/a6xx: Convert blitter to OUT_REG()
4554- freedreno/a6xx: Fix mirror x/y blits
4555- util: Add a simple no-op libdrm shim
4556- turnip: Use libdrm shim
4557- loader: Use libdrm shim
4558- vk/runtime: Use libdrm shim
4559- freedreno/common: Replace or_mask() with BitsetEnum<T>
4560- freedreno: Promote non-drawing batches to sysmem
4561- freedreno: Nerf strict-aliasing warning for all of gcc
4562- freedreno/registers: Schema validation for gen_header.py
4563- freedreno/registers: Add regs for a690
4564- freedreno: Quiet c++ warning about designated initializers
4565- freedreno/ir3: Un-inline enums
4566- freedreno/ir3: Don't use negative opc for meta instructions
4567- freedreno/ir3: c++-proof the headers
4568- freedreno/ir3+tu: Calculate subgroup size in ir3
4569- freedreno/ir3: Add missing driver params
4570- freedreno: Un-inline buffer-mask enum
4571- freedreno: c++-proofing
4572- freedreno/a6xx: Rework texture_clear fallback
4573- freedreno/a6xx: Add missing "inline"
4574- freedreno/a6xx: Fix designator initializer order
4575- freedreno/a6xx: Convert to c++
4576- freedreno/registers: Fix designator order
4577- freedreno/registers: Add prefix="variant"
4578- freedreno/registers: Merge a6xx and a7xx regs
4579- freedreno/registers: Start adding a7xx pipe/control regs
4580- freedreno/decode: Start adding a7xx support
4581- freedreno/registers: Start adding stuff for a7xx
4582- freedreno/registers: Track varset
4583- freedreno/registers: Split out regpair builder helper
4584- freedreno/registers: Add c++ magic for register variants
4585- freedreno/registers: Fix nameless fields
4586- freedreno/registers: Define rest of CP_REG_WRITE
4587- freedreno/a6xx: Simplify iova emit
4588- mesa: Rework discard_framebuffer()
4589- driconf: Add ignore_discard_framebuffer option
4590- driconf: Work around incorrect GI discard/invalidate
4591- freedreno: Specify GMEM tile alignment per GPU
4592- freedreno+tu: Big GMEM support
4593- freedreno+tu: Add a690 support
4594- freedreno/a6xx: Restore mode
4595- freedreno/rnn: Fix reg names for regs with variants
4596- freedreno/afuc: Add raw mode for disasm
4597- freedreno/registers: Add control reg for zap fw base
4598- Revert "CI: Disable freedreno"
4599- dri2/android: Bypass throttling
4600- freedreno/drm: Fast path for idle check
4601- freedreno/drm: Stop cleanup at first active BO
4602- mesa: Add a few more function traces
4603- freedreno/drm: Make threaded-submit optional
4604- freedreno/drm: Disable threaded-submit for msm
4605- freedreno: Optimize repeated finishes
4606- freedreno: Stop being too clever by half
4607- freedreno: Hoist dirty vars
4608- freedreno: Extract out a helper
4609- freedreno: Inline single-use helpers
4610- freedreno: Re-work dirty-resource tracking
4611- freedreno: Avoid looping shader stages if nothing dirty
4612- freedreno: Move driconf settings into sub-struct
4613- freedreno: Support the disable_throttling=true driconf option
4614- util/disk_cache: Split out queue initialization
4615- util/disk_cache: Add NONE type
4616- util/disk_cache: Use queue state to skip put
4617- util/disk_cache: Move blob_put_cb to the async queue
4618- freedreno/a6xx: Allow z24s8 format casts
4619- freedreno/a6xx: Fix valid_format_cast logic for newer a6xx
4620- freedreno: Fix resource tracking vs rebind/invalidate
4621- dri/android: Fix MSAA resolve
4622
4623Rohan Garg (9):
4624
4625- iris: Don't flush the render cache for a compute batch
4626- anv: drop unused headers
4627- anv: reuse the VK_IMAGE_ASPECT_PLANES_BITS_ANV macro
4628- isl: fix some documentation
4629- anv/blorp: use existing function to convert the op to a string
4630- anv: break out of the loop when the first color attachment is found
4631- anv,hasvk: cleanup unused enum
4632- intel/genxml: Add the preferred slm size enum for gen125
4633- anv,blorp,iris: Set PreferredSLMAllocationSize on gfx125+
4634
4635Roland Scheidegger (2):
4636
4637- llvmpipe: only use accurate_a0 hack if there are no textures bound
4638- lavapipe, nir: Fix wrong array index scaling in nir_collect_src_uniforms
4639
4640Rose Hudson (4):
4641
4642- radeonsi: report 0 block size for Polaris HEVC encoding
4643- asahi: wire up shader disk cache support
4644- agx: isolate compiler debug flags
4645- asahi: disable disk cache in debug runs
4646
4647Ruijing Dong (5):
4648
4649- frontends/va: revert commit 0b02db30
4650- raseonsi/vcn: fix a h264 decoding issue
4651- frontends/va: disable skip_frame_enable in vaapi interface.
4652- radeonsi/vcn: correct cropping for hevc case
4653- radeonsi/vcn: fix decoding bs buffer alignement issue.
4654
4655Ryan Neph (17):
4656
4657- ci: fix directory existence racing in parallel test execution
4658- util/u_process: add MESA_PROCESS_NAME override to util_get_process_name()
4659- util/u_process: remove util_get_process_name_may_override()
4660- util/xmlconfig: add MESA_DRICONF_EXECUTABLE_OVERRIDE
4661- venus: update venus-protocol headers to partially fix WA1
4662- venus: temporarily redirect VkDrmFormatModifierPropertiesListEXT to "2" variant
4663- ci: uprev virglrenderer
4664- venus: update venus-protocol headers to fix WA1
4665- Revert "venus: temporarily redirect VkDrmFormatModifierPropertiesListEXT to "2" variant"
4666- venus: add vn_relax_init/_fini()
4667- venus: set/check ring status bits independently
4668- venus: init exp features before ring init again
4669- venus: update to latest protocol for ringMonitoring
4670- venus: check and configure new ringMonitoring feature
4671- venus: re-use VN_DEBUG_NO_ABORT to disable ring monitoring abort()
4672- virgl: hook new get_fd proc for drm winsys
4673- i915: hook new get_fd proc for drm winsys
4674
4675Sagar Ghuge (10):
4676
4677- iris: Stop marking context unconditionally as guilty
4678- intel/fs: Always stall between the fences on Gen11+
4679- nir: Handle other variants of image_samples properly while lowering
4680- intel/compiler: Add swsb_stall debug option
4681- anv: Implement Wa_14015297576
4682- iris: Implement Wa_14015297576
4683- intel/compiler: Add Wa_14014063774 for slm_fence
4684- intel/decoder: Bump the binding table guess value to 32
4685- anv: Drop unused param from add_surface_reloc
4686- anv: Drop dead code that sets the L3BypassDisable field
4687
4688Sai Teja Pottumuttu (2):
4689
4690- iris: Fix to release BO immediately if not busy
4691- anv: Fix stride mismatch in mesa and minigbm
4692
4693Sajeesh Sidharthan (3):
4694
4695- radeonsi/vcn: disable fence for JPEG decoding
4696- radeonsi/vcn: set bitstream buffer size to encoded bitstream size
4697- radeonsi/vcn: optimize bitstream buffer resize logic
4698
4699Sam Edwards (1):
4700
4701- nouveau: Fix null dereference in nouveau_pushbuf_destroy
4702
4703Samuel Iglesias Gonsálvez (1):
4704
4705- docs/developers: Add Igalia as Mesa consultancy
4706
4707Samuel Pitoiset (279):
4708
4709- radv: fix missing implementation of creating images from swapchains
4710- radv: fix hashing pipeline keys if RADV_PERFTEST=ngg_streamout is used
4711- radv: fix re-emitting RB+ when the non-compacted color format changes
4712- ac/nir: clear unused components before storing XFB outputs to LDS
4713- ac: add TC_OP_ATOMIC_SUB_32
4714- radv: fix setting MAX_MIP for BC views
4715- radv: fix buffer to image copies with BC views on the graphics queue
4716- radv: fix creating BC image views when the base layer is > 0
4717- radv: rename ac_surf_nbc_view::max_mip to num_levels
4718- radv: move some color blend helpers to radv_private.h
4719- radv: add a new helper for normalizing blend factors
4720- radv: add support for dynamic blend equation
4721- radv: enable compiling PS epilogs on-demand for dynamic color blend equations
4722- radv: fix detecting that blend is enabled when all CB states are dynamic
4723- radv: advertise extendedDynamicState3ColorBlendEquation
4724- radv: remove an old FIXME about a possible bug with TC-compat HTILE
4725- radv/winsys: fix incorrect PCIID for GFX11 in the null winsys
4726- radv: print depth image size with RADV_DEBUG=img
4727- radv: fix RADV_DEBUG=hang with multiple cmdbuffer per submission
4728- radv/winsys: prefix all error messages with RADV
4729- radv: fix creating libraries with PS epilog and all CB states as dynamic
4730- radv: fix ignoring graphics shader stages that don't need to be imported
4731- radv: add a layer for fixing rendering issues with RAGE2
4732- radv: simplify VK_PIPELINE_CREATE_FAIL_ON_PIPELINE_COMPILE_REQUIRED
4733- radv: pass the number of stages to radv_hash_shaders()
4734- radv: split radv_create_shaders() between graphics and compute shaders
4735- radv: rename radv_create_shaders() to radv_graphics_pipeline_compile()
4736- radv: ignore all CB dynamic states when there is no color attachments
4737- radv: regroup dynamic states initialization
4738- radv: only initialize non-zero values for the default dynamic state
4739- radv: stop setting INTERPOLATE_COMP_Z
4740- radv: fix RB+ for SRGB formats
4741- radv: adjust ACCUM tessellation fields on GFX11+
4742- radv: fix GPL fast-linking with libs that have retained NIR shaders
4743- radv: skip shaders cache for fast-linked pipelines with GPL
4744- radv: remove useless check about CS in radv_lower_io()
4745- radv: simplify pipeline_has_ngg during graphics shaders compilation
4746- radv: add helpers for capturing shaders and statistics
4747- radv: pass radv_graphics_pipeline to radv_graphics_pipeline_compile()
4748- radv: move retained shaders info to radv_graphics_pipeline
4749- radv: pass radv_compute_pipeline to radv_compute_pipeline_compile()
4750- radv: pass pCreateInfo to radv_graphics_pipeline_compile()
4751- radv: optimize radv_pipeline_layout_add_set() slightly
4752- radv: remove redundant zero initialization of pipeline layout
4753- radv: remove radv_pipeline_stage::spirv::sha1
4754- radv: allow to create a noop FS in a library with GPL
4755- radv: remove one unused variable in radv_graphics_lib_pipeline_init()
4756- radv: pass the lib flags for generating the pipeline key
4757- radv: return a boolean value in radv_pipeline_needs_dynamic_ps_epilog()
4758- radv: stop using the graphics pipeline key after compilation
4759- radv: determine the last VGT API stage earlier
4760- radv: skip compilation when possible with GPL fast-linking
4761- radv: simplify an assertion after considering RADV_FORCE_VRS
4762- radv: do not insert fast-linked libraries to the shaders cache
4763- radv: fix skipping graphics pipeline compilation when the FS is NULL
4764- radv: cleanup graphics pipeline library flags uses
4765- radv: simplify determining when the fragment shader needs an epilog
4766- radv: regroup PS epilog info when generating the graphics pipeline key
4767- radv: fix disabling MRT compaction for on-demand PS epilogs
4768- radv: make sure to disable MRT compaction when compiling a PS epilog with GPL
4769- radv: simplify creating a FS epilog from a library
4770- radv: stop skipping the cache for compute/raytracing pipelines with GPL
4771- radv: stop skipping the cache for monolithic graphics pipelines with GPL
4772- docs: add missing RADV_PERFTEST=video_decode
4773- docs: stop reporting RADV_PERFTEST=gpl as experimental/suboptimal
4774- radv/ci: set RADV_PERFTEST=GPL for all VKCTS jobs
4775- radv/ci: bump the number of runners to 3 for vkcts-navi21-valve
4776- radv: restore uploading shaders individually instead of consecutively
4777- radv: implement graphics shaders relocation for a RGP workaround
4778- radv: fix importing retained NIR shaders when a lib uses the RETAIN bit
4779- radv: use last_vgt_api_stage for determining the last stage with XFB
4780- radv: only initialize shader arguments for the active stages
4781- radv simplify compiling graphics shaders with a mask of active NIR stages
4782- radv: disable DCC for mipmaps on GFX11
4783- radv: ignore registering pipeline libaries with SQTT
4784- radv/ci: add missing expected failures with RADV_PERFTEST=gpl on GFX1100
4785- radv: reduce maximum line width to 8.0
4786- radv: add support for rectangularLines
4787- Revert "radv: acquire pstate on-demand when capturing with RGP"
4788- radv/amdgpu: only set a new pstate if the current one is different
4789- radv: only skip emitting the pipeline blend state if the FS uses an epilog
4790- radv: stop using a PS epilog when the FS doesn't write any color outputs
4791- ci: uprev vkd3d-proton
4792- zink/ci: skip KHR-GL46.texture_swizzle.functional with RADV
4793- zink/ci: set RADV_PERFTEST=gpl for RADV jobs
4794- radv/ci: disable vkcts-kabini-valve
4795- radv/ci: move CI lists for external GPUs in separate folder
4796- radv: configure SQ_THREAD_TRACE_CTRL.REG_AT_HWM on GFX11
4797- radv: only enable SQTT for SE0 on GFX11
4798- radv: make sure to wait for the trace buffer also on GFX11
4799- radv: implement a workaround for SQTT on GFX11
4800- radv: disable SPM counters with RGP on GFX11
4801- radv: enable SQTT tracing on GFX11
4802- radv: set VS_OUT_MISC_SIDE_BUS_ENA for clip distances on GFX10.3+
4803- radv/ci: cleanup CI lists for dEQP-VK.memory.* tests that timeout
4804- ac/nir: add resinfo lowering for sliced storage 3D views
4805- radv: implement VK_EXT_image_sliced_view_of_3d on GFX10+
4806- radv: advertise VK_EXT_image_sliced_view_of_3d on GFX10+
4807- radv: cleanup radv_emit_{conservative,msaa}_state() functions
4808- radv: stop setting ENABLE_POSTZ_OVERRASTERIZATION to 1
4809- radv: set MSAA_NUM_SAMPLES to 0 for underestimate rasterization
4810- radv: enable primitiveUnderestimation on GFX9+
4811- zink/ci: skip one more test that timeout with RADV
4812- radv: fix flushing non-coherent images inside secondaries on GFX9+
4813- radv: fix flushing non-coherent images in EndCommandBuffer()
4814- radv: fix draw calls with 0-sized index buffers and robustness on NAVI10
4815- radv: only expose EXT_pipeline_library_group_handles if RT is enabled
4816- amd,ac/rgp: fix SQTT memory types
4817- radv: ignore alpha_is_on_msb on GFX11 because the hw ignores it
4818- radv: use new EVENT_WRITE_ZPASS packet3 on GFX11
4819- radv: fix DCC decompress on GFX11
4820- radv: stop allocationg the attr ring BO for compute queues on GFX11
4821- ci: uprev CTS to 1.3.5.0
4822- radv/ci: adjust timeouts for Vega10 and Renoir
4823- radv/ci: stop skipping some graphics pipeline library tests
4824- radv/ci: update CI lists for CTS 1.3.5.0 on GFX110/POLARIS10/PITCAIRN
4825- wsi: move an assertion in wsi_xxx_surface_get_capabilities2()
4826- radv: do not add descriptor BOs on update when the global BO list is used
4827- radv: fix incorrect stride for primitives generated query with GDS
4828- radv: fix border color swizzle for stencil-only format on GFX9+
4829- radv: fix defining RADV_USE_WSI_PLATFORM
4830- radv: move disabling DCC for VRS rate images in radv_get_surface_flags()
4831- ac/surface: add RADEON_SURF_VRS_RATE for selecting swizzle mode on GFX11
4832- radv: add support for VRS attachment on GFX11
4833- radv: do not emit PA_SC_VRS_OVERRIDE_CNTL from the pipeline on GFX11
4834- radv: advertise attachmentFragmentShadingRate on GFX11
4835- radv: enable VK_KHR_fragment_shading_rate on GFX11
4836- radv: disable DCC with signedness reinterpretation on GFX11
4837- radv: move instance related code to radv_instance.c
4838- radv: move physical device related code to radv_physical_device.c
4839- radv: move queue related code to radv_queue.c
4840- radv: move sampler related code to radv_sampler.c
4841- radv: move event related code to radv_event.c
4842- radv: move buffer related code to radv_buffer.c
4843- radv: move device memory related code to radv_device_memory.c
4844- radv: zero-initialize radv_shader_args right before declaring them
4845- radv: zero-initialize radv_shader_info earlier for graphics pipeline
4846- radv: fix the error code when the driver fails to create a PS epilog
4847- radv: determine if a graphics pipeline needs a noop FS earlier
4848- radv: keep track of the retained NIR shaders sha1 for LTO pipelines
4849- radv: allow to cache optimized (LTO) pipelines with GPL
4850- radv: rename RADV_PIPELINE_LIBRARY to RADV_PIPELINE_RAY_TRACING_LIB
4851- radv: add helpers for destroying various pipeline types
4852- radv: fix NGG streamout with VS and GPL on GFX11
4853- spirv: add SpvCapabilityFragmentFullyCoveredEXT
4854- spirv,nir: add support for SpvBuiltInFullyCoveredEXT
4855- radv: lower nir_intrinsic_load_fully_covered
4856- radv: enable SAMPLE_COVERAGE_ENA if the fully covered built-in is used
4857- radv: implement fullyCoveredFragmentShaderInputVariable
4858- radv: enable fullyCoveredFragmentShaderInputVariable on GFX9+
4859- radv: remove set but never used num_preserved_sgprs
4860- radv: stop storing the binary as part of radv_shader_part
4861- radv: store spi_shader_col_format to radv_shader_part_binary
4862- radv: store the total radv_shader_part_binary size
4863- radv: upload prologs/epilogs as part of radv_shader_part_create()
4864- radv: allow to return the PS epilog binary to the pipeline
4865- radv: make radv_shader_part_create() non-static
4866- radv: add support for caching PS epilogs
4867- radv: stop using radv_get_shader_shader() for task shaders
4868- radv: replace radv_lookup_user_sgpr() by radv_get_user_sgpr()
4869- radv: pass shader/base_reg to radv_emit_descriptor_pointers()
4870- radv: pass shader/base_reg to radv_emit_inline_push_consts()
4871- radv: pass shader/base_reg to radv_emit_userdata_address()
4872- radv: use a separate compute path in radv_flush_constants()
4873- radv: pass radv_shader to radv_dump_shader_stats()
4874- radv: rework dumping shaders when a GPU hang is reported
4875- radv: pass radv_ray_tracing_pipeline to radv_rt_pipeline_compile()
4876- radv: pass radv_shader to radv_shader_need_indirect_descriptor_sets()
4877- radv: implement VK_KHR_map_memory2
4878- radv: use common GetBufferMemoryRequirements2()
4879- radv: move cs_regalloc_hang_bug to radv_shader_info
4880- radv: pass a radv_shader to radv_emit_dispatch_packets()
4881- radv: remove radv_pipeline::device completely
4882- radv: rework binding shaders to cmdbuf by introducing new helpers
4883- radv: move radv_meta_* to a new folder
4884- radv: copy the multisample state to radv_cmd_state
4885- radv: move uses_user_sample_locations to radv_multisample_state
4886- radv: separate the sample shading state between FS and graphics pipeline
4887- radv: add DI_PT_RECTLIST to si_conv_prim_to_gs_out()
4888- radv: stop checking dynamic states when emitting the guardband state
4889- radv: rename gfx9_gs_info to radv_legacy_gs_info
4890- radv: move {esgs,gsvs}_ring_size to radv_legacy_gs_info
4891- radv/rt: bind the pipeline stack when it's not dynamic
4892- radv/ci: update CI lists for Polaris10 and Pitcairn
4893- radv: stop using get_vs_output_info() when emitting VS/NGG shaders
4894- radv: emit the GS copy shader outside of radv_pipeline_emit_hw_gs()
4895- radv: add radv_get_last_vgt_shader() helper
4896- radv: stop using the pipeline for emitting PS inputs
4897- radv: use the shader info stage to simplify emitting NGG shaders
4898- radv: use the ES type to apply a workaround for NGG on GFX10
4899- radv: pass the ES shader to radv_pipeline_emit_hw_ngg()
4900- radv: stop using the pipeline for emitting shaders
4901- radv: pass shader/base_reg to radv_emit_view_index_per_stage
4902- radv: pass a shaders array to radv_get_shader()
4903- radv: add radv_bind_shader() helper
4904- radv: add an assertion about shader stage to radv_bind_pre_rast_shader()
4905- radv: keep track of active stages as part of the cmdbuf state
4906- radv: determine the last VGT shader at pipeline bind time
4907- radv: stop using last_vgt_api_stage_{locs} during cmdbuf recording
4908- radv: move dirtying flags for mesh shading to radv_bind_pre_rast_shader()
4909- radv: copy bound shaders to the cmdbuf state
4910- radv: determine and store the next graphics stage to radv_shader_info
4911- radv: move user_data_0 to the shader info pass
4912- radv: replace pipeline->is_ngg occurrences during cmdbuf recording
4913- radv: replace pipeline->force_vrs_per_vertex during cmdbuf recording
4914- radv: use serialized NIR for graphics libs with the RETAIN flag
4915- radv: remove radv_graphics_pipeline::use_per_attribute_vb_descs
4916- radv: remove radv_graphics_pipeline::last_vertex_attrib_bit
4917- radv: remove radv_graphics_pipeline::next_vertex_stage
4918- radv: remove radv_graphics_pipeline::can_use_simple_input
4919- aco: remove unused aco_shader_info::vb_desc_usage_mask
4920- radv: adjust vb_desc_usage_mask for dynamic VS inputs in the info pass
4921- radv: remove radv_graphics_pipeline::vb_desc_usage_mask
4922- radv: remove radv_graphics_pipeline::vb_desc_alloc_size
4923- radv: rework emitting inner coverage when a fragment shader is bound
4924- radv: copy custom blend mode to the cmdbuf state
4925- radv: add a helper that returns the current rasterized primitive
4926- radv: copy rast_prim to the cmdbuf state
4927- radv: copy uses_{drawid,baseinstance} to the cmdbuf state
4928- radv: copy ia_multi_vgt_param to the cmdbuf state
4929- radv: add a helper to convert a VkPipelineBindPoint
4930- radv: copy need_indirect_descriptor_sets to radv_cmd_state
4931- radv: add push constant state to the cmdbuf state
4932- radv: fix sample shading when a new fragment shader is bound
4933- vulkan: add dynamic support for rectangles enable/mode
4934- radv: add dynamic support for rectangles enable/mode
4935- vulkan: Update XML and headers to 1.3.246
4936- radv: copy db_render_control to the cmdbuf state
4937- radv: set PS_ITER_SAMPLE(1) for sample shading during cmdbuf recording
4938- radv: configure PA_SC_MODE_CNTL_1 during cmdbuf recording
4939- radv: add the raygen shader BO to the cmdbuf list
4940- radv: fix binding raytracing/compute pipelines
4941- zink/ci: remove primitive-id-no-gs-quads from the NAVI10 fail list
4942- radv/ci: add one more flake
4943- radv: only copy non-NULL shaders when loaded from the cache
4944- radv: rely on non-NULL binaries when inserting shaders to the cache
4945- radv: allow to create/insert PS epilogs from/to the cache for libs
4946- radv: remove dead code in radv_pipeline_get_nir()
4947- radv: add VkGraphicsPipelineLibraryFlag to the graphics pipeline key
4948- radv: ensure to retain NIR shaders for GPL libs found in the cache
4949- radv: enable shaders cache for libraries with GPL
4950- radv: fix VS prologs with GPL and static binding stride
4951- radv: emit the PS epilog after the graphics pipeline
4952- radv: add a helper for retaining NIR shaders
4953- radv: move the serialized NIR to radv_graphics_lib_pipeline
4954- radv: simplify a check when retaining NIR shaders
4955- radv: do not retain noop FS for libs when a cache hit happened
4956- radv: import retained NIR shaders later in the compilation process
4957- radv/rt: stop storing unused hashes/identifiers
4958- radv: create a helper for copying VkPipelineShaderStageCreateInfo
4959- radv: copy stages instead of serializing NIR for GPL with the RETAIN flag
4960- radv: enable VK_EXT_graphics_pipeline_library by default
4961- radv/ci: update expected failures for PITCAIRN
4962- radv/ci: remove no longer existing tests for PITCAIRN
4963- radv/ci: update expected failures with BONAIRE
4964- docs: add more release notes for RADV
4965- radv: fix re-emitting vertex user SGPRs when binding a graphics pipeline
4966- radv/ci: remove one RT test from the expected failures on RDNA3
4967- radv: split radv_pipeline.c into radv_pipeline_{compute,graphics}.c
4968- radv: fix pipeline creation feedback with imported graphics libs
4969- radv: cleanup after splitting radv_pipeline.c
4970- radv: fix detecting FMASK_DECOMPRESS/DCC_DECOMPRESS meta pipelines
4971- vulkan: ignore rasterizationSamples when the state is dynamic
4972- radv: try to keep HTILE compressed for READ_ONLY_OPTIMAL layout
4973- radv: re-emit the guardband state when related PSO are bound
4974- radv: disable fast-clears with CMASK for 128-bit formats
4975- radv: do not allow 1D block-compressed images with (extended) storage on GFX6
4976- radv: fix usage flag for 3D compressed 128 bpp images on GFX9
4977- radv: update binning settings to work around GPU hangs
4978- radv/amdgpu: fix adding continue preambles and postambles BOs to the list
4979- radv: wait for occlusion queries in the resolve query shader
4980- radv: delay enabling/disabling occlusion queries at draw time
4981- radv: track DB_COUNT_CONTROL changes to avoid context rolls
4982- radv: add the perf counters BO to the preambles BO list
4983- radv: only enable extendedDynamicState3ConservativeRasterizationMode on GFX9+
4984- ac/nir: fix 8-bit/10-bit PS exports clamping
4985- radv: fix dynamic depth clamp enable support
4986- radv: fix fast-clearing images with VK_REMAINING_{ARRAY_LAYERS,MIP_LEVELS}
4987- radv: disable RB+ blend optimizations on GFX11 when a2c is enabled
4988
4989Sarah Walker (1):
4990
4991- pvr: Update FWIF transfer queue register structures
4992
4993Sathishkumar S (8):
4994
4995- radeonsi/vcn: add register definitions for JPEG 4.0.3
4996- radeonsi/vcn: use register versions for jpeg
4997- radeonsi/vcn: add support for picture crop on JPEG 4.0.3
4998- radeonsi/vcn: support ARGB/RGBA conversion on JPEG 4.0.3
4999- radeonsi/vcn: set jpeg reg version for gfx940
5000- radeonsi/vcn: reset to default value when ROI/FC is not used
5001- frontends/va: support crop region in jpeg decode
5002- radeonsi/vcn: enable RGBA/ARGB formats on gfx940 jpeg
5003
5004Sebastian Wick (1):
5005
5006- loader: do not check the mesa DRI_Mesa version if it was not found
5007
5008Sergi Blanch Torne (8):
5009
5010- ci: disable Collabora's LAVA lab for maintance
5011- Revert "ci: Collabora's LAVA lab for maintance"
5012- ci: Uprev kernel to 6.1.7
5013- ci: disable Collabora's LAVA lab for maintance
5014- ci: disable Collabora's LAVA lab for maintance
5015- ci: include setup test environment script in the output artifacts
5016- Revert "ci: disable Collabora's LAVA lab for maintance"
5017- ci: disable Collabora's LAVA lab for maintance
5018
5019Sidney Just (4):
5020
5021- zink: Fix non debug builds failing to compile on
5022- loader: Add missing brace to fix compile
5023- zink: add check for samplerMirrorClampToEdge Vulkan 1.2 feature
5024- zink: Add missing features to the profile file
5025
5026Sil Vilerino (15):
5027
5028- d3d12: Honor suggested driver profile/level for H264/HEVC encode
5029- d3d12: Video processing - Fix out of bounds array access
5030- d3d12: Video Encode - Fix ID3D12CommandAllocator leak
5031- d3d12: Fix VP9 Decode - Checking 0xFF instead of 0x7F for invalid frame_ref[i].Index7Bits
5032- frontend/va: Add format support checks for VA_RT_FORMAT_* in VaCreateConfig/VaGetConfigAttributes
5033- frontend/va: Remove duplicate code in format support checking/reporting.
5034- frontend/va: Keep track of some VP9 previous frame data for current frame use_prev_in_find_mvs_refs
5035- d3d12: VP9 Decode - Fix use_prev_in_find_mvs_refs calculation
5036- d3d12: Fix video decode for interlaced streams with reference only textures required
5037- d3d12: H264/HEVC Encode - Set both VBV InitialCapacity/Size in CBR Rate Control to same value when requested
5038- d3d12: Encode H264/HEVC - Do not write PPS unless different from active
5039- d3d12: Encode - Only upload headers when written headers size is > 0
5040- nir: Fix use of alloca() without #include c99_alloca.h
5041- Revert "d3d12: Honor suggested driver profile/level for H264/HEVC encode"
5042- d3d12: Video processor to only promote resources to permanent residency when there is work to be flushed
5043
5044Simon Fels (2):
5045
5046- venus: allow vtest socket being specified by env variable
5047- virgl/vtest: allow socket being specified by env variable
5048
5049Simon Perretta (38):
5050
5051- pvr: Add new Rogue compiler framework
5052- pvr: Add support for optional instruction params
5053- pvr: Support dual-destination ALU instructions
5054- pvr: Commonise some instruction member defs
5055- pvr: Drop the ENUM_PACKED macro
5056- pvr: Keep NIR SSA defs instead of registers
5057- pvr: Adjust instruction repeat offset
5058- pvr: Validate instruction repeat and src/dst sizes
5059- pvr: Add block printing support during validation
5060- pvr: Clarify unreachable text
5061- pvr: Add ADD64 support
5062- pvr: Add memory load support
5063- pvr: Add bitwise instruction support
5064- pvr: Additional register subarray support
5065- pvr: Support loading immediate values
5066- pvr: Load descriptors from memory
5067- pvr: Split pvr_private.h
5068- pvr: Use descriptor/set/table offsets from driver
5069- pvr: Add NIR pass to lower vars to SSA
5070- pvr: Amend subarray ownership code
5071- pvr: Add support for fitr.pixel
5072- pvr: Add support for sample instructions
5073- pvr: Add support for validating modifier combos
5074- pvr: Add support for emitpix
5075- pvr: Add support for WOP
5076- pvr: Register allocation improvements
5077- pvr: Fix descriptor set address calculation
5078- pvr: Add support for generating per-job EOT program
5079- pvr: Add support for generating NOP program
5080- pvr: Add support for IDF
5081- pvr: Add support for ST
5082- pvr: Add branch support
5083- pvr: Add support for TST
5084- pvr: Add basic support for manual instruction grouping
5085- pvr: Add support for MOVC
5086- pvr: Add late op lowering pass and conditional execution
5087- pvr: Amend definitions for ST and IDF
5088- pvr: Add encodings for index registers
5089
5090Simon Ser (1):
5091
5092- egl: fix fd_display_gpu on surfaceless and device platforms
5093
5094Sonny Jiang (5):
5095
5096- radeonsi: Add NV12 support for AV1
5097- gallium/pipe: change PIPE_DEFAULT_DECODER_FEEDBACK_TIMEOUT_NS to 1 second
5098- amd/common: Add gfx940 codec query support
5099- radeonsi/vcn: Add video capabilities support for gfx940
5100- radeonsi/vcn: Add decode support for gfx940
5101
5102SoroushIMG (31):
5103
5104- zink: add pass checking for lod overflow in txf
5105- zink: add zink_cs_key
5106- zink: add VK_EXT_image_robustness
5107- zink: add robust_access field to shader key
5108- zink: lower LOD-invalid txf when imageRobustAccess2 is missing
5109- zink: update gl43 profile to allow imageRobustAccess
5110- zink: fix sparse residency query and minLOD feature checks
5111- zink: fix cap check for arb sparse texture2
5112- zink: only save frag const buffers when used by blit
5113- zink: fix leak when rebinding same image surface
5114- zink: clear null image surfaces to 0
5115- zink: fix pointcoord y inversion
5116- zink: relax bresenhamLines requirement for non-strictLine drivers
5117- zink: fix compute shader leaks
5118- zink: allocate program shader caches from the program's mem ctx
5119- zink: stop creating pipeline library cache for non-optimal_key drivers
5120- zink: free resource objects' views array during destruction
5121- zink: fix stale point sprite mode state
5122- zink: fix shadow mask change logic when binding sampler views
5123- zink: track shadow swizzle for all shader stages
5124- zink: minor formatting change
5125- zink: add needs_zs_shader_swizzle shader key
5126- zink: extend shadow swizzle pass to all zs textures
5127- zink: add depth/stencil needs shader swizzle workaround field
5128- zink: workaround undefined swizzle 1 for z/s textures
5129- zink: rename shadow key to zs swizzle
5130- zink: Add driver name and API version to renderer name
5131- zink: do not emit line stipple dynamic state when emulating
5132- zink: take location_frac into account in lower_line_smooth_gs
5133- zink: fix incorrect line mode check for bresenham
5134- zink: refcount the correct query pool
5135
5136Sui Jingfeng (1):
5137
5138- meson: add basic support for loongarch
5139
5140SureshGuttula (1):
5141
5142- radeonsi: Add support for DPB resize
5143
5144Sviatoslav Peleshko (9):
5145
5146- anv: Handle VkAccelerationStructureBuildRangeInfoKHR::transformOffset
5147- driconf/anv: Apply limit_trig_input_range WA to Rise of the Tomb Raider
5148- iris: Avoid creating uncompressed view with unaligned tile offsets on BDW
5149- anv: Handle all fields in VkAccelerationStructureBuildRangeInfoKHR
5150- anv: Move WA MEDIA_VFE_STATE after stalling PIPE_CONTROL
5151- glsl: Fix codegen for constant ir_binop_{l,r}shift with mixed types
5152- isl: Check all channels in isl_formats_have_same_bits_per_channel
5153- anv: Handle UNDEFINED format in image format list
5154- anv: Improve image/view usage bits verification
5155
5156Tapani Pälli (42):
5157
5158- intel/compiler: add cpp_std=c++17 when building tests
5159- intel/hasvk: remove some stale comments, wa was removed
5160- anv: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable
5161- hasvk: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable
5162- iris: add restrictions for 3DSTATE_RASTER::AntiAliasingEnable
5163- mesa: move component bits queries as GL ES only
5164- intel/genxml: set unused 3DSTATE_PS_EXTRA field as mbz
5165- intel: enable existing workaround for ICL platform
5166- intel/blorp: disable REP16 for gfx12+ with R10G10B10_FLOAT_A2
5167- iris: disable preemption for 3DPRIMITIVE during streamout
5168- iris: handle error in iris_resource_from_handle
5169- spirv: add workaround for Metro Exodus in spirv_to_nir
5170- radv: revert Metro Exodus workaround which was moved to common code
5171- mesa/st: refactor st_destroy_texcompress_compute condition
5172- mesa/st: add astc decoder lookup tables
5173- mesa/st: initialize resources for ASTC decoding
5174- mesa: add astc decoder shader template (glsl es version)
5175- mesa/st: support compute shader decoding of ASTC
5176- anv: Wa_14016407139, add required pc when SBA programmed
5177- iris: implement emission of 3DSTATE_HS for Wa_1306463417
5178- anv: emit 3DSTATE_HS in cmd_buffer_flush_gfx_state
5179- anv: limit generated draws to pipelines without HS stage
5180- anv: implement emission of 3DSTATE_HS for Wa_1306463417
5181- iris: emit 3DSTATE_HS for each primitive on gfx12
5182- anv: emit 3DSTATE_HS for each primitive on gfx12
5183- intel/compiler: add comment about workaround on simd width
5184- anv: fix sends_count_expectation assert on simd32
5185- intel/isl: disable TILE64 for YCRCB formats
5186- anv: implement occlusion query related Wa_14017076903
5187- iris: implement occlusion query related Wa_14017076903
5188- intel/fs: restore message layout changes for cube array
5189- anv: use primitive ID override when shader does not supply it
5190- anv: take primitive ID override to account Wa_14015297576
5191- anv: check for MESA_SHADER_TESS_CTRL with get_tcs_prog_data
5192- intel/common: limit the amount of SLM with Wa_14017341140
5193- intel/fs: use intel_needs_workaround for Wa_22013689345
5194- intel/compiler: use intel_needs_workaround for Wa_14012437816
5195- isl: disable mcs (and mcs+ccs) for color msaa on gfxver 125
5196- iris: implement state cache invalidate for Wa_16013063087
5197- anv: cleanup bitmask construction for PIPELINE_SELECT
5198- anv: implement state cache invalidate for Wa_16013063087
5199- isl: fix layout for comparing surf and view properties
5200
5201Tatsuyuki Ishi (22):
5202
5203- radv: Fix depth-only-with-discard when epilogs are used.
5204- radv: Fix emitting tess indirect descriptors twice.
5205- radv: Loop over shader stages in flush_indirect_descriptor_sets.
5206- radv: Fix noop FS not getting constructed for GPL pipelines.
5207- radv: Fix missing rbplus_allowed check for dynamic PS epilogs.
5208- radv: Assert the hardware support rbplus when emitting rbplus state.
5209- radv: Keep shader code ptr in a separately allocated buffer.
5210- radv/sqtt: Use code buffer from radv_shader directly instead of copying.
5211- radv: Replace radv_trap_handler_shader with radv_shader.
5212- radeonsi: SDMA v4 size field is size - 1
5213- radv: SDMA v4 size field is size - 1
5214- radv: Remove SDMA padding from copy helpers.
5215- radv: Use common helpers to translate format in SDMA copy.
5216- radv/rt: Don't upload the prolog twice.
5217- radv: Use radeon_cmdbuf for sdma_copy_image.
5218- radv: Introduce sdma_copy_buffer for GFX7+.
5219- radv: Upload shaders to invisible VRAM on small BAR systems.
5220- radv: Wait for shader uploads asynchronously.
5221- radv: Fix missing wait of GS copy shader upload for dmashaders.
5222- amd: Add radv_foreach_stage to ForEachMacros.
5223- radv: Pre-compute descriptor set layout hash.
5224- ci/android: Make armv8's arch aarch64 instead of arm.
5225
5226Teng, Jin Chung (2):
5227
5228- frontend/va: Add large_scale_tile from VADecPictureParameterBufferAV1
5229- d3d12: AV1 Dec - Set anchor_frame_idx only when large_scale_tile equals 1
5230
5231Thomas H.P. Andersen (4):
5232
5233- docs/panvk: VK_KHR_descriptor_update_template
5234- meson: use summary()
5235- meson: use sections in summary()
5236- v3dv: use common code for descriptor update template
5237
5238Thong Thai (6):
5239
5240- gallium/auxiliary/vl: clean-up progressive shader
5241- radeonsi/vcn: use encoder/decoder caps reported by kernel
5242- gallium/auxiliary/vl: add crop to compute shader
5243- mesa/main: rework locale setup/teardown
5244- util: check and initialize locale before using it
5245- tgsi: use locale independent float and double parsing
5246
5247Timothy Arceri (15):
5248
5249- nir/nir_opt_copy_prop_vars: remove extra loop
5250- nir/nir_opt_copy_prop_vars: avoid comparison explosion
5251- nir/nir_opt_copy_prop_vars: reuse hash tables
5252- nir/nir_opt_copy_prop_vars: reuse dynamic arrays
5253- nir/nir_opt_copy_prop_vars: reorder clone calls
5254- nir/nir_opt_copy_prop_vars: don't call memset when cloning
5255- ci: enable dEQP-VK.ubo.random.all_shared_buffer.48
5256- glsl: copy prop vars before scalarizing alus
5257- glsl: add _token_list_prepend() helper to the parser
5258- glsl: isolate object macro replacments
5259- glsl: remove do_copy_propagation_elements() optimisation pass
5260- glsl: allow 64-bit integer on RHS of shift
5261- util/00-mesa-defaults: add Akka Arrh workaround
5262- mesa: add _mesa_is_api_gles2() helper
5263- glsl: move some GL ES checks to the NIR linker
5264
5265Timur Kristóf (155):
5266
5267- aco/optimizer: Add missing v_lshlrev condition to can_apply_extract.
5268- aco/optimizer: Optimize p_extract + v_mul_u32_u24 to v_mad_u32_u16.
5269- radv: Make NGG query emission a dirty flag.
5270- radv: Get rid of app_shaders_internal.
5271- radv, aco: Add uses_full_subgroups to compute shader info.
5272- aco: Enable constant exec mask based optimization on compute shaders.
5273- radv: Lower dynamic VS inputs in NIR.
5274- aco: Remove dynamic VS input loads.
5275- nir: Add pack_half_2x16_rtz_split opcode.
5276- radv, aco, ac: Implement pack_half_2x16_rtz_split.
5277- nir: Lower pack_half_2x16_split to RTZ if available.
5278- nir: Add algebraic optimization for VKD3D-Proton fp32->fp16 conversion.
5279- ac/gpu_info: Add has_pcie_bandwidth_info.
5280- radv: Don't place CS in VRAM when bandwidth is too low.
5281- nir/opt_algebraic: Add optimization for ieq/ine and right-shift.
5282- radv: Disable NGG culling when conservative overestimation is used.
5283- ac/nir/cull: Alway remove zero-area triangles in face culling.
5284- ac/nir/ngg: Include culled primitives in query.
5285- radv: Don't change LDS_SIZE for NGG culling shaders.
5286- radv: Move checking primitive topology to radv_get_ngg_culling_settings.
5287- radv: Use shader code to skip NGG culling in small workgroups.
5288- radv: Remove NGG culling skip from command buffer.
5289- radv: Refactor radv_emit_ngg_culling_state so it's based on dirty flags.
5290- nir: Clarify comment above load_buffer_amd.
5291- ac: Port ACO's get_fetch_format to ac_get_safe_fetch_size.
5292- ac/llvm: Remove "structurized" argument and instead check vindex.
5293- ac/llvm: Fix buffer_load_amd with larger than 32-bit channel sizes.
5294- ac/llvm: Fix ac_build_buffer_load to work with more than 4 channels.
5295- ac/llvm: Change ac_build_tbuffer_load to take format and channel type.
5296- radv: Move VS input lowering to new file: radv_nir_lower_vs_inputs.
5297- aco: Get rid of redundant load_vmem_mubuf function.
5298- aco: Don't set scalar offset on buffer load instructions when it's zero.
5299- aco: Remove MTBUF zero operand.
5300- radv: Call nir_lower_array_deref_of_vec in radv_lower_io_to_scalar_early.
5301- aco/optimizer: Change v_cmp with subgroup invocation to constant.
5302- radv: Emulate VGT_ESGS_ITEMSIZE in shaders on GFX9+.
5303- util: Add util_format_get_array.
5304- ac: Add pending_vmem field to args.
5305- radv: Set pending_vmem on dynamic VS input args.
5306- aco: Generalize vs_inputs to args_pending_vmem.
5307- aco, radv: Rename aco_*_key to aco_*_info.
5308- aco, radv: Move PS epilog and VS prolog args to their info structs.
5309- aco, radv: Don't use radv_shader_args in aco.
5310- aco: Don't include headers from radv.
5311- ac/nir: clear nir_var_shader_out from TCS barriers
5312- aco: Remove vtx_binding from MUBUF/MTBUF instructions.
5313- nir: Add load_typed_buffer_amd intrinsic.
5314- aco: Implement load_typed_buffer_amd.
5315- ac/llvm: Implement typed buffer load intrinsic.
5316- radv: Lower non-dynamic VS inputs in NIR.
5317- radv: Apply swizzle and alpha adjust in radv_nir_lower_vs_inputs.
5318- aco: Remove VS inputs from visit_load_input.
5319- aco: Rename visit_load_input to visit_load_fs_input.
5320- radv: Remove VS inputs code from LLVM backend.
5321- ac/llvm: Remove unused function ac_build_struct_tbuffer_load.
5322- aco, radv: Remove VS IO information from ACO.
5323- aco: Don't add soffset to swizzled MUBUF base.
5324- aco: Use zero for MUBUF/MTBUF when soffset is undefined.
5325- aco: Disable MUBUF/MTBUF offsets when they are zero.
5326- aco: Always enable idxen for swizzled buffer access on GFX11.
5327- ac/nir/ngg: Remove usused lds_es enum values.
5328- ac/nir/ngg: Rename saved_uniform to reusable_nondeferred_variable.
5329- ac/nir/ngg: Split some functions out of save_reusable_variables.
5330- ac/nir/ngg: Move divergence analysis call to analyze_shader_before_culling.
5331- ac/nir/ngg: Rename state variables to "s".
5332- ac/nir/ngg: Remove some superfluous variables.
5333- ac/nir/ngg: Create separate variable for repacked rel_patch_id.
5334- ac/nir/ngg: Rename repacked variables to clarify their name.
5335- ac: Add more defines for mesh shading packets.
5336- radv: Use new mesh shading packet defines.
5337- radv: Add per-prim attributes to ring_attr stride.
5338- radv: Use per-prim params in has_param_exports.
5339- radv: Add extra offset to per-prim params.
5340- radv: Use PRIM_ATTR for PS inputs on GFX11.
5341- radv: Include per-prim params in NUM_INTERP on GFX11.
5342- radv: Adjust mesh draw packets for GFX11.
5343- ac/nir/ngg: Clarify mesh shader scratch ring.
5344- ac/nir/ngg: Use attribute ring for mesh shader params.
5345- ac/nir/ngg: Split legacy workgroup index function.
5346- ac/nir/ngg: Fix mesh shader layer on GFX11.
5347- ac/nir/ngg: Store special MS outputs in attribute ring for PS to read.
5348- radv: Enable mesh shading on GFX11.
5349- radv: Fix swizzled VS input loads when some components are unused.
5350- radv: Don't expose NV_mesh_shader and don't use it in CI.
5351- radv: Remove NV_mesh_shader API entrypoints.
5352- radv: Remove first_task and ib_addr/ib_stride.
5353- radv: Clean up emitting zero mesh shader draw id.
5354- ac/nir/ngg: Remove NV_mesh_shader support.
5355- ac/nir: Remove ac_nir_apply_first_task_to_task_shader.
5356- nir: Remove IB address and stride intrinsics.
5357- radv: Move radv_nir_* to a new folder.
5358- radv: Move radv_nir_lower_primitive_shading_rate to new file.
5359- radv: Move radv_nir_lower_fs_intrinsics to new file.
5360- radv: Move radv_nir_lower_intrinsics_early to new file.
5361- radv: Move radv_nir_lower_view_index to new file.
5362- radv: Move radv_nir_lower_viewport_to_zero to new file.
5363- radv: Move radv_nir_export_multiview to new file.
5364- radv, ac/nir: Move sin/cos lowering to a common pass.
5365- radv: Move I/O lowering functions into a new file.
5366- radv: Use radv_get_shader to get vertex shader when binding pipeline.
5367- ac/nir/ngg: Slightly improve attribute ring offset calculation.
5368- ac/nir: Store only lowest 8 bits for task draw ring DWORD3.
5369- ac/nir: When task->mesh dispatch Y or Z are 0, also set X to 0.
5370- aco: Consider p_cbranch_nz as divergent branch too.
5371- aco: Don't remove exec writes that also write other registers.
5372- aco: Simplify get_phi_operand using Operand::c32_or_c64.
5373- aco: Don't verify branch exec read when eliminating exec writes.
5374- aco: Pop branch operands when targets are same in SSA elimination.
5375- aco: Call dominator_tree before lower_phis.
5376- aco: Better phi lowering for merge block when else-side is const.
5377- nir: Gather compile time constant task->mesh dispatch size.
5378- radv: Use linear_dispatch info in GFX11 task/mesh draw packet.
5379- radv/amdgpu: Extract CS chain and unchain functions.
5380- radv/amdgpu: Expose CS chain and unchain on the winsys.
5381- radv/amdgpu: Extract radv_amdgpu_add_cs_to_bo_list function.
5382- radv/amdgpu: Remember which CS the current one is chained to.
5383- radv/amdgpu: Walk chained CS objects for BO list.
5384- radv/amdgpu: Unchain CS array in queue code not in winsys.
5385- radv: Chain cmd buffers in queue code when possible, not in winsys.
5386- radv/amdgpu: Remove can_patch and chained submit code path.
5387- ac/llvm: Cover runtime 0 in GFX10 gs_alloc_req workaround.
5388- aco: Fix optimization of v_cmp with subgroup invocation.
5389- aco: Don't use nir_selection_control in aco_ir.
5390- aco: Only include nir.h in instruction selection.
5391- radv: Don't include nir.h in radv_shader.h
5392- radv: Create continue preamble on GFX6 even when no shader rings are used.
5393- ac: Add maximum number of submitted IBs.
5394- radv/amdgpu: Fix mismatching return type of radv_amdgpu_cs_submit.
5395- radv/amdgpu: Only allow IB BOs on graphics and compute queues.
5396- radv/amdgpu: Use correct alignment when creating CS BOs.
5397- radv/amdgpu: Extract radv_amdgpu_cs_add_old_ib_buffer.
5398- radv/amdgpu: Add a few assertions during submit.
5399- radv/amdgpu: Remove hw_can_chain in favour of use_ib.
5400- radv/amdgpu: Rewrite fallback code path so it can split submissions.
5401- radv/amdgpu: Allow multiple continue preambles.
5402- radv/amdgpu: Add continue preambles to fallback submit.
5403- radv/amdgpu: Add postambles to fallback submit.
5404- radv/amdgpu: Add ability to submit non-chained CS to fallback.
5405- radv/amdgpu: Split gang submissions correctly when not chained.
5406- radv: Fill continue preambles and postambles properly.
5407- radv: Split submission in winsys instead of radv_queue.
5408- radv/amdgpu: Use fallback submit for queues that can't use IBs.
5409- radv/amdgpu: Clean up submission functions.
5410- radv/amdgpu: Respect maximum number of submitted IBs per IP type.
5411- radv: Allow task/mesh shaders with RADV_DEBUG=noibs.
5412- radv/amdgpu: Add bool is_secondary argument to cs_create function.
5413- radv/amdgpu: Extract radv_amdgpu_cs_bo_create function.
5414- radv/amdgpu: Place secondary CS without IB2 in non-WC GTT.
5415- ac, aco, radv: Clarify LDS size on GFX6, and NGG shaders.
5416- radv: Don't hardcode LDS granularity in gfx9_get_gs_info.
5417- aco: Remove setup_*_variables and add setup_lds_size instead.
5418- aco, radv: Remove "key" from aco_compiler_options.
5419- aco, radv: Remove redundant enable_mrt_output_nan_fixup from PS epilog info.
5420- ac/nir/ngg: Don't store primitive IDs from culled primitives.
5421- aco: Disallow constant propagation on SOPP and fixed operands.
5422
5423Tomeu Vizoso (7):
5424
5425- android: Make libbacktrace optional again
5426- android: Cleanup unneeded headers from the sync stub
5427- ci: Build for Android with libbacktrace=false
5428- ci: Use NDK 25b to build for the Android ABI level 33
5429- etnaviv: handle missing alu conversion opcodes
5430- etnaviv: print writemask of store operations
5431- etnaviv: don't read too much from uniform arrays
5432
5433Turo Lamminen (4):
5434
5435- radv: Change radeon_cmdbuf counters to uint64_t to make alias analysis optimize radeon_emit better
5436- radv: Clean up variables in si_get_ia_multi_vgt_param
5437- radv: Avoid redundant fetch of radv_device
5438- radv: Optimize emitting prefetches
5439
5440Val Packett (1):
5441
5442- mailmap: Remap name and email for Val Packett
5443
5444Vincent Davis Jr (1):
5445
5446- gbm/backend: fix gbm compile without dri
5447
5448Vinson Lee (2):
5449
5450- radv: Fix memory leak.
5451- pps: Fix build errors.
5452
5453Vitaliy Triang3l Kuzmin (2):
5454
5455- radv: Set DB_Z_INFO.NUM_SAMPLES to MSAA_EXPOSED_SAMPLES without Z/S
5456- r600: Alpha to coverage dithering on Evergreen+
5457
5458Väinö Mäkelä (12):
5459
5460- intel/vec4: Set the rounding mode
5461- intel/vec4: Don't optimize multiply by 1.0 away
5462- hasvk: Don't claim shaderDenormPreserveFloat32 on gfx7
5463- hasvk: Tell spirv_to_nir float controls are always supported
5464- hasvk: Enable PixelShaderKillsPixel when omask is used
5465- hasvk: Mark VK_IMAGE_LAYOUT_ATTACHMENT_OPTIMAL as stencil write optimal
5466- hasvk: Handle subpass self-dependencies for stencil shadow copies
5467- ci/intel: Update hasvk HSW xfails
5468- hasvk: Disable non-zero fast clears for 8xMSAA images
5469- intel/ci: Remove outdated hasvk copy_and_blit xfails
5470- intel/ci: Remove hasvk xfails missing from the CTS
5471- intel/ci: Remove skipped float_controls tests from hasvk xfails
5472
5473X512 (3):
5474
5475- hgl: remove
5476- haiku: fix build
5477- EGL: implement Haiku driver over Gallium
5478
5479Yevhenii Kolesnikov (2):
5480
5481- nir/loop_analyze: Track induction variables incremented by more operations
5482- nir/loop_analyze: Determine iteration counts for more kinds of loops
5483
5484Yiwei Zhang (46):
5485
5486- venus: log upon device creation
5487- venus: lazily query and cache gralloc front rendering usage
5488- venus: disable non AHB external memory bits on Android
5489- venus: fix formating
5490- venus: fix tracing init to include instance creation
5491- venus: render server enforces blob_id_0
5492- venus: move exp features init back to use ring submit
5493- venus: further disallow sparse resource
5494- venus: replace binary search with hardcode for max buffer size
5495- venus: start requiring all experimental features
5496- venus: clean up memoryResourceAllocationSize
5497- venus: clean up globalFencing
5498- venus: refactor sync fd fence and sempahore features
5499- venus: tighten up the sync fd requirements for Android wsi
5500- venus: distinguish external memory from mappable memory support
5501- venus: fix external buffer creation
5502- venus: remove redundant abstractions for wsi struct search
5503- venus: refactor image create info pnext tracking
5504- venus: simplify ahb image creation
5505- venus: simplify support for non-AHB external images
5506- venus: fix external image creation
5507- venus: fix device memory export alloc info
5508- venus: fix VK_EXT_image_view_min_lod feature query
5509- venus: ensure invariance of buffer memory requirement size
5510- venus: sync to latest protocol for ring status enum
5511- venus: abort ring submit when ring is in fatal status
5512- venus: propagate vn_ring to vn_relax
5513- venus: vn_relax to abort on ring fatal status upon warn order
5514- venus: revert back the warn order
5515- venus: sync to latest protocol for asyncRoundtrip
5516- venus: switch to use 64bit roundtrip seqno
5517- venus: make vn_instance_wait_roundtrip asynchronous
5518- venus: let vn_instance_submit_command track ring seqno
5519- venus: make common wsi bo submission async
5520- venus: refactor to add vn_sync_payload_external
5521- venus: make external fence and semaphore export async
5522- Revert "zink/kopper: Add extra swapchain images for Venus"
5523- venus: sync latest protocol for layering extensions
5524- venus: add VK_EXT_load_store_op_none support
5525- venus: add VK_EXT_rasterization_order_attachment_access support
5526- venus/docs: sync to latest venus supported extensions
5527- venus: requires asyncRoundtrip
5528- venus: requires ringMonitoring
5529- venus: move exp feature init back to ring and remove unused function
5530- venus: forward ARM driverVersion for ANGLE workarounds
5531- radv: respect VK_QUERY_RESULT_WAIT_BIT in GetQueryPoolResults
5532
5533Yogesh Mohan Marimuthu (18):
5534
5535- egl: add render_gpu tag to dri2_dpy->fd and dri2_dpy->dri_screen variable
5536- loader,glx: add render_gpu tag psc->driScreen and psc->fd
5537- loader,glx,egl,vl,d3d: loader_get_user_preferred_fd() function to return original_fd
5538- egl: remove is_different_gpu variable from struct dri2_egl_display
5539- glx: remove is_different_gpu variable from struct dri_screen
5540- loader,glx,egl: remove is_different_gpu variable from loader
5541- ac,radeonsi: move shadow regs create ib preamble function to amd common
5542- radv: add shadowregs variable to RADV_DEBUG environment variable
5543- radv: add support for register shadowing
5544- radv: set preemp flag and pre_ena bit for shadowregs
5545- radv: INDEX_TYPE and NUM_INSTANCES PKT3 are not shadowed
5546- radv: fence complete struct is 4 qw size
5547- radv: allow NULL initial_preamble_cs in radv_amdgpu_winsys_cs_submit_sysmem()
5548- radeonsi: remove some shadow reg optimization for bf1 game
5549- wsi/display: check alloc failure in wsi_display_alloc_connector()
5550- ac/surface: only adjust pitch if surf_pitch was modified
5551- amd/surface: add RADEON_SURF_NO_TEXTURE flag
5552- radv: set RADEON_SURF_NO_TEXTURE flag in radv_get_surface_flags()
5553
5554Yogesh Mohanmarimuthu (7):
5555
5556- egl: add fd_display_gpu to struct dri2_egl_display
5557- egl,egl/x11: keep display fd open for prime
5558- egl: create DRI screen for display GPU in case of prime
5559- loader,glx,egl/x11: init dri_screen_display_gpu in struct loader_dri3_drawable
5560- egl/wayland: keep display fd open for prime
5561- loader: make image_format_to_fourcc() non-static
5562- egl/wayland: for prime, allocate linear_copy from display GPU VRAM
5563
5564Yonggang Luo (8):
5565
5566- util: Implement util_iround with lrintf unconditionally
5567- util: Fixes error: no previous prototype for 'mesa_cache_db_entry_remove' Fixes: c92c99481fd ("util/mesa-db: Support removal of cache entries")
5568- vulkan: Use static_assert for check HWVULKAN_DISPATCH_MAGIC == ICD_LOADER_MAGIC
5569- meson: Split c_cpp_args from pre_args
5570- meson: Combine duplicated c_args and cpp_args
5571- meson: When sse2 enabled, both c and cpp using sse2 options
5572- meson: Split sse2_arg and sse2_args out of c_cpp_args
5573- meson: Use sse2_arg and sse2_args to replace usage of c and c_sse2_args
5574
5575Yusuf Khan (2):
5576
5577- nvc0/nv50: support and enable EXT_memory_object*
5578- gallium: create query_memory_info implementation for sw drivers
5579
5580Yuxuan Shui (1):
5581
5582- loader: unregister special event in loader_dri3_drawable_fini
5583
5584antonino (54):
5585
5586- zink: fix line smooth lowering
5587- zink: add \`zink_emulate_point_smooth` driconf
5588- zink: add \`lower_point_smooth` to \`zink_fs_key`
5589- zink/nir_to_spirv: add support for \`nir_intrinsic_load_point_coord`
5590- nir: handle output beeing written to deref in \`nir_lower_point_smooth`
5591- zink: handle point_smooth emulation
5592- drirc: set \`zink_emulate_point_smooth` for Quake II
5593- zink: fix stipple pattern in oblique lines
5594- zink: fix \`final_hash` update in \`zink_gfx_program_update`
5595- mesa: correctly allocate space for converted primtives
5596- gallium: decompose quad strips into quads if supported
5597- zink: handle switching between primitives
5598- nir: handle primitives with adjacency
5599- nir: avoid generating conflicting output variables
5600- nir: calculate number of vertices in nir_create_passthrough_gs
5601- nir: handle edge flags in nir_create_passthrough_gs
5602- zink: add \`has_edgeflags` flag to zink_shader and zink_gfx_program
5603- zink: handle edgeflags
5604- nir: allow to force line strip out in nir_create_passthrough_gs
5605- zink: force line strip out when emulating stipple
5606- zink: filled quad emulation gs generation function
5607- zink: add \`zink_rast_prim` enum
5608- zink: handle quads
5609- zink: fix flat shading on filled quads
5610- zink: add flags to \`zink_gfx_program` and \`zink_context`
5611- zink: add \`needs_inlining` to \`zink_shader`
5612- zink: implement flat shading using inlined uniforms
5613- nir/zink: handle provoking vertex mode in \`nir_create_passthrough_gs`
5614- zink: handle provoking vertex mode for filled quads
5615- nir: keep xfb properties in nir_create_passthrough_gs
5616- zink: keep xfb properties in quad emulation gs
5617- zink: advertise support for the quad primitive
5618- zink: prevent crash when freeing
5619- zink: unified \`zink_set_primitive_emulation_keys` and \`zink_create_primitive_emulation_gs`
5620- zink: zink: add \`parent` to \`zink_shader::non_fs`
5621- zink: improve generated gs unbinding
5622- zink: unbind generated gs in \`bind_last_vertex_stage`
5623- zink/ci: remove \`primitive-id-no-gs-quads` from radv-vangogh-fails
5624- nir: only handle flat interpolation when needed in \`nir_create_passthrough_gs`
5625- zink: simplify logic to call \`zink_set_primitive_emulation_keys`
5626- zink: add field to 'zink_gs_key' and enum
5627- zink: add provoking vertex mode lowering
5628- zink: always advertize provoking vertex mode support
5629- zink: update requirements now that pv mode can be emulated
5630- zink: add \`descriptor_bindless_id` to \`zink_shader_info`
5631- zink: fix sampler array collision in \`nir_to_spirv`
5632- zink: don't emulate edgeflags for patches
5633- zink: use correct primitives for passthrough gs with tess
5634- zink: fix pv mode lowring index calculation
5635- zink: use ring buffer to preserve last element
5636- zink: fix exit condition on pv emulation loop
5637- zink: fix line strip offsets in pv mode emulation
5638- zink: fix store subsitution in \`lower_pv_mode_gs_store`
5639- zink: take location_frac into account in pv emulation
5640
5641driver1998 (1):
5642
5643- gallium: Use DETECT_OS_WINDOWS instead of 'WIN32'
5644
5645osy (1):
5646
5647- virgl: enable timer queries only if host supports it
5648
5649t0b3 (1):
5650
5651- nir/nir_opt_move: fix ALWAYS_INLINE compiler error
5652
5653volodymyr.o (1):
5654
5655- mesa ctx->API --> _mesa_is_foo(ctx)
5656
5657xurui (2):
5658
5659- panfrost: Check the return value of drmGetVersion
5660- zink: bs->dd.push_pool[1].pool should be freed
注意: 瀏覽 TracBrowser 來幫助您使用儲存庫瀏覽器

© 2025 Oracle Support Privacy / Do Not Sell My Info Terms of Use Trademark Policy Automated Access Etiquette