VirtualBox

source: vbox/trunk/src/VBox/Additions/3D/mesa/mesa-24.0.2/docs/relnotes/23.3.0.rst@ 105254

最後變更 在這個檔案從105254是 103996,由 vboxsync 提交於 12 月 前

Additions/3D/mesa: export mesa-24.0.2 to OSE. bugref:10606

檔案大小: 297.0 KB
 
1Mesa 23.3.0 Release Notes / 2023-11-29
2======================================
3
4Mesa 23.3.0 is a new development release. People who are concerned
5with stability and reliability should stick with a previous release or
6wait for Mesa 23.3.1.
7
8Mesa 23.3.0 implements the OpenGL 4.6 API, but the version reported by
9glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
10glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
11Some drivers don't support all the features required in OpenGL 4.6. OpenGL
124.6 is **only** available if requested at context creation.
13Compatibility contexts may report a lower version depending on each driver.
14
15Mesa 23.3.0 implements the Vulkan 1.3 API, but the version reported by
16the apiVersion property of the VkPhysicalDeviceProperties struct
17depends on the particular driver being used.
18
19SHA256 checksum
20---------------
21
22::
23
24 50f729dd60ed6335b989095baad81ef5edf7cfdd4b4b48b9b955917cb07d69c5 mesa-23.3.0.tar.xz
25
26
27New drivers
28-----------
29- NVK: A Vulkan driver for Nvidia hardware
30
31New features
32------------
33- VK_EXT_pipeline_robustness on ANV
34- VK_KHR_maintenance5 on RADV
35- OpenGL ES 3.1 on Asahi
36- GL_ARB_compute_shader on Asahi
37- GL_ARB_shader_atomic_counters on Asahi
38- GL_ARB_shader_image_load_store on Asahi
39- GL_ARB_shader_image_size on Asahi
40- GL_ARB_shader_storage_buffer_object on Asahi
41- GL_ARB_sample_shading on Asahi
42- GL_OES_sample_variables on Asahi
43- GL_OES_shader_multisample_interpolation on Asahi
44- GL_OES_gpu_shader5 on Asahi
45- EGL_ANDROID_blob_cache works when disk caching is disabled
46- VK_KHR_cooperative_matrix on RADV/GFX11+
47
48
49Bug fixes
50---------
51
52- crash in si_update_tess_io_layout_state during _mesa_ReadPixels (radeonsi_dri, mesa 23.2.1)
53- mesa: vertex attrib regression
54- [RADV] War Thunder has some grass flickering.
55- radv: satisfactory broken shader
56- RADV problem with R7 M440 in some games
57- gpu driver crashes when opening ingame map playing dead space 2023
58- [anv] Valheim water misrendering
59- EGL/v3d: EGL applications under a X compositor doesn't work
60- RADV: trunc_coord breaks ambient occlusion in Dirt Rally and other games
61- radv: Mass Effect Legendary Edition: a line going across the screen is visible in some areas with Ambient Occlusion enabled
62- anv: DIRT5 gfx11_generated_draws_spv_source triggers "assert(!copy_value_is_divergent(src) || copy_value_is_divergent(dest));"
63- panfrost: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12 buffers
64- [RADV][TONGA] - BeamNG.drive (284160) - Artifacts are present when looking at the skybox.
65- LEGO Star Wars: The Skywalker Saga graphical glitches (DXVK) on R9 380
66- [radv] Crypt not rendering properly
67- Leaks of DescriptorSet debug names
68- [Tracing flake] Missing geometry in trace\@freedreno-a630\@freedoom\@freedoom-phase2-gl-high.trace
69- Unreal Engine 5.2 virtual shadow maps have glitchy/lazy tile updates
70- RADV: Visual glitches in Unreal Engine 5.2.1 when using material with anisotropy and light channel 2
71- radv: Regression with UE5 test
72- SIGSEGV with MESA_VK_TRACE=rgp and compute only queue
73- [ANV] Corruptions in Battlefield 4
74- anv regression w/ commit e488773b29d97 ("anv: Fast clear depth/stencil surface in vkCmdClearAttachments")
75- ir3: dEQP-GLES31.functional.synchronization.inter_invocation.image_atomic_read_write crash on a6xx gen4
76- Zink + Venus: driver can't handle INVALID<->LINEAR!
77- Anv: Particles have black square artifacts on Counter Strike 2 on Skylake
78- Lords of the Fallen 2023 Red Eye mode crashing game and desktop
79- [radeonsi] [vulkan] [23.3-rc1 regression] Video output corrupted in QMplay2 with Vulkan renderer
80- [BISECTED] ac/radeon commit somehow breaks nv12 surface from HEVC decode
81- Parsec displays completely green screen with hardware decoder selected while using Mesa 23.3 and Mesa 24
82- H264 to H264 transcode output corruption with gst-vaapi
83- opencl-jpeg-encoder does not work with nouveau/rusticl, works with nouveau/clover
84- [R600] X-plane 11 demo (Linux Native) crashes upon launch on HD5870 and HD6970
85- Ubuntu 23.10 build error with rusticl_opencl_bindings.rs
86- Rusticl fails to build
87- ANV not handling VkMutableDescriptorTypeCreateInfoEXT::pMutableDescriptorTypeLists[i] being out of range
88- tu: Wolfenstein: The New Order misrenders on a740
89- DRI_PRIME fails with ACO only radeonsi
90- nir_to_tgsi: Incorrect handling of indirect array access
91- ANV gen9 32 bit vulkan asserts on many cts tests
92- GPU hang observed while launching 3DMark Wildlife Unlimited on MTL
93- ac/gpu_info: Query maximum submitted IBs from the kernel
94- RADV: regression in 23.2.1 causing GPU hang with RDNA1 in various UE5 games
95- GPU page faults reported while playing Talos Principle 2 (demo)
96- No CCS_E scanout on tgl+ with ANV
97- anv: Modifier tests assert-fail on TGL+
98- ci: zink-tu jobs no longer included in manual pipelines
99- [ANV][A770] GravityMark segfaults and buffer allocation errors
100- etnaviv: gc2000 gles2 regression
101- ci_run_n_monitor: pipeline finding unreliable
102- nvk: Implement VK_EXT_dynamic_rendering_unused_attachments
103- anv: jsl timeline semaphores flaky
104- anv: OOB access in vkDestroyDevice?
105- nvk: Implement VK_EXT_primitive_topology_list_restart
106- nvk: Implement VK_EXT_image_sliced_view_of_3d
107- nvk: Implement VK_KHR_workgroup_memory_explicit_layout
108- util/macros: BITFIELD64_RANGE raises an error with mesa-clang if we try to set last bit
109- r300/r400 regression; can't compile \`if/then` in shaders
110- iris: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12/P010 buffers
111- nvk: Implement VK_EXT_depth_bias_control
112- ICL/zink: gpu hang on 'piglit.object namespace pollution.framebuffer with gldrawpixels'
113- [R600] Wolfenstein: The New Order text glitch on menu
114- need extension to request image/texture not use data dependent compression
115- rusticl: segfault in clCreateKernel on AMD Instinct MI100
116- !25587 broke xserver
117- GPU Hang in Deep Rock Galactic on DG2
118- intel: Wrong length for 3DSTATE_3D_MODE on gfx125
119- [radeonsi] Wargame: Red Dragon /w OpenGL stopped working with ACO
120- traces job reference images missing again sometimes
121- Vulkan Texture/Polygon Glitches in Games
122- freedreno: dmabuf modify query ignores format
123- virgl: removing PIPE_CAP_CLEAR_TEXTURE completely breaks virglrenderer
124- Turnip build error on termux
125- failiure in amd llvm helper
126- failiure in amd llvm helper
127- radv_amdgpu_cs_submit: Assertion \`chunk_data[request->number_of_ibs - 1].ib_data.ip_type == request->ip_type' failed.
128- hasvk: subgroups regression
129- radeonsi: broken hardware decoding (vaapi/vulkan) on RDNA2 gpu (bisected)
130- aco: SwizzleInvocationsMaskedAMD behavior is not correct for reads from inactive lanes
131- anv: dEQP-VK.ssbo.phys.layout.random.16bit.scalar.13 slow
132- [RDNA3] CS:GO - excessive power consumption and lower performance in Vulkan while MSAA is set to 4x or 8x
133- [ICL] piglit.spec.arb_gl_spirv.execution.ssbo.unsized-array regression
134- radv: Counter Strike 2 has multiple bugs while rendering smoke grenade effect
135- Doom Eternal freezing on NAVI31 with current git
136- iris CTS blend test fail with MSAA config on DG2
137- anv: 32bit mesa asserts
138- RADV: Randomly dissapearing objects in Starfield with RX 5xx and Vega graphics
139- anv: missing barrier handling on video engines
140- radv: Star Wars The Old Republic hang when DCC is enabled
141- radv: Resident Evil 6 hangs 7900XTX GPU when DCC is enabled if in Options go to Display settings
142- radv: Resident Evil 6 Benchmark Tool hangs 7900 XTX GPU when DCC is enabled immediately after splash screen
143- ANV: fp64 shader leaked
144- v3d: noop drm-shim raises some warnings
145- freedreno: crashdec/etc chip_id support
146- intel: compute dispatches with variable workgroup size have ralloc_asprintf CPU overhead
147- ci build issues with builtin types
148- freedreno: running angle perf traces with GALLIUM_THREAD=0 crashes
149- RadeonSI: glClear() causes clear texture for some frames on RX580
150- radeonsi: corruption when seeking video decoded with vaapi in mpv
151- Zink/HasVK regression bisected to "gallium: move vertex stride to CSO"
152- [radv] [Path Of Exile] - one setting in the workaround file breaks shadows/lighting rendering. Other workaround settings seems obsolete.
153- radv: images don't always have extents in RGP
154- shader_test causing a crash in compiler
155- D3D12: Video decoding requirements are too restrictive. ID3D12VideoDevice3 should not be required.
156- Crash in st_ReadPixels
157- [regression] intel build issue on i386
158- [ANV] [DG2/A770] The Spirit and The Mouse, miscellaneous issues with Mesa Git
159- zink on hasvk regression: Assertion \`(dyn)->vi_binding_strides[first_binding + i] == (strides[i])' failed.
160- Penumbra: Overture hangs on new game loading screen
161- [r300, RV516] Some deqp-gles2\@performance\@shader\@control_statement vertex tests cause hard lockup & reboot in mesa 22.3.1 (regression over 22.1.7) on a Radeon X1550
162- v3dv: Add a feature that implicitly copies the linear image to the tiled image prior to sampling from it
163- radv: Regression from 266b2cfe5bf3feda16747c50c1638fb5a0426958
164- h264 encoding picture showed randomly repeated frames.
165- Mesa CI: NAVI10 hangs when running VKCTS on Linux 6.1
166- zink: no uniform buffer objects support for v3dv?
167- v3dv: Request for VkImageDrmFormatModifierExplicitCreateInfoEXT::pPlaneLayouts support
168- [ANV] [DG2/A770] The Spirit and The Mouse, occasional flickering geometry
169- [Google][Rex][anv] GLES dEQP test fails in anv when run via ANGLE-on-Venus on ChromeOS ARCVM.
170- VAAPI on VCN: bad stream may crash whole gfx system
171- Crash after GPU reset
172- Bifrost PanVK should not be in CI
173- [Intel][Vulkan][Gen12] vkCmdCopyImage() generates garbage data when the destination texture is bound to a piece of used device memory
174- mesa: new glcts fails
175- tu: GPL support is broken
176- lavapipe: ycbcr regression
177- aco: Assertion when compiling CP2077 shader
178- anv: flakiness on tgl+ with samplemask handling
179- [RADV] Dead by Daylight memory leak (shader-related?) on 23.1.6
180- r300: optionally convert MULs into output modifier for the following MUL or DOT instructions
181- r300: better 1-x presubtract pattern matching
182- gpu hang on DG2 when running KHR-GLES31.core.texture_cube_map_array.image_op_tess*
183- KHR-GLES31.core.texture_cube_map_array.image_op_tessellation_evaluation_sh fail on GFX12+
184- wsi: deadlocks when DISPLAY is changed
185- hasvk: Incompatible with minigbm/gralloc4 on Android
186- VAAPI: AMDGPU crash on RX 6900 XT on corrupted video
187- lavapipe/llvmpipe: shader unregister crash
188- [ANV] [DG2/A380] Corruption in Borderlands 3
189- blorp regression on dg2
190- decouple -Dshader-cache= from EGL_ANDROID_blob_cache
191- radv: commit 81641b01555faa4dd1dfc7de2513ad8d63e77ab7 leaded to artifacts in Quake II RTX
192- [radv] Colors are distorted in Cyberpunk 2077 with ray tracing enabled
193- Forza Horizon 5 stuttering since mesa 23.1.4 / 9b008673 revert as a FIX
194- ubsan + gtest build fails
195- glCopyTexSubImage2D is very slow on Intel
196- NVE4 (GeForce 710) fails to get vdpau in mesa git
197- [RADV] red and pink tinted shadows in Overwatch 2 on 7900 XTX
198- nouveau prevents hardware acceleration with Chromium (Wayland)
199- Corrupt text rendering in Blender
200- DRI2 gallium frontend is using bad format type
201- regression - MR 23089 - Hellblade RT crashing
202- Incorrect vlVaCreateBuffer/vlVaMapBuffer behavior for buffer type VAEncCodedBufferType in Gallium
203- Issue with clang-format
204- Follow-up from "Draft: intel: Disable color fast-clears for blorp_copy"
205- nightly VA-API build: new timeout
206- r600: retire the SB optimizer
207- ci: do not download perfetto on-fly in build jobs
208- Shared Memory Leak With Qt OpenGL Applications
209- OpenGL, SIGSEGV when program pipeline objects has separated vertex shader progam and separated fragment shader progam with in/out
210- vaDeriveImage returns VA_STATUS_ERROR_OPERATION_FAILED
211- 975a8ecc881873744d851ab0ef45ad7698eaa0ef "frontends/va: use resources instead of views" cause radeonsi can't play video.
212- zink: reduce pipeline hash size
213- Rusticl,radeonsi: ac_rtld error(2): too much LDS
214- aco, radv Rage 2 menu corruption - bisected
215- radv, aco: World War Z character texture regression on 7900xtx
216- android: De-stage drm_gralloc support from mesa3d
217- Cyberpunk screen goes black at game launch on integrated Gfx
218- lavapipe/llvmpipe: regressions since descriptor rewrite
219- intel: State cache invalidation after BLORP binding table setup ought to be unnecessary on ICL.
220- ci: HW job logs have spam at the end
221- kernel crash seen on AMD Raven device
222- crocus: regression crashing in doubles/ubo tests
223- turnip: object management CTS crashes
224- a618: multiple assertions with different kernel config on u_vector_add
225- [anv] Death Stranding crashes
226- Can no longer build Clover without llvmspirvlib
227- [radeonsi][vaapi] segfault in vl_video_buffer_sampler_view_components() when using vaapisink receiving I420 format
228- Baldurs Gate 3 (DX11) - Graphical corruption on RDNA3 (ACO regression)
229- [AMDGPU] Compiling large Blender Eevee shader node trees is unusably slow
230- Building llvmpipe with LP_USE_TEXTURE_CACHE set fails since 23.2.0-rc1: error C2039: dynamic_state is not member of lp_build_sampler_soa in lp_tex_sample.c
231- r300: calculate some cycles estimate for shader-db
232- intel: Deathloop and other DX12 games fail assert(validated) with invalid SEL instruction
233- GTF-GL46.gtf21.GL.build.CorrectFull_vert regressed on intel platforms
234- error message when encoding via VAAPI AMD
235- gpu hangs on dg2 with mesh shading enabled on vkcts
236- radeonsi: Deadlock when creating a new GL context in parallel with linking a shader on another GL context
237- robustness2 raygen tests intermittently fail in Intel Mesa CI
238- ci/ci_run_n_monitor.py: KeyError: 'clang-format'
239- glthread: huge performance regression
240- DirectX games do not launch on Intel HD Graphics 4000 (IVB GT2) [bisected]
241- rusticl: fails to build for iris + radeonsi
242
243
244Changes
245-------
246
247Adam Jackson (3):
248
249- egl: Implement EGL_EXT_explicit_device
250- mesa: Implement and advertise GL_MESA_sampler_objects
251- docs: Mention 'meson devenv' in the pre-install test instructions
252
253Aditya Swarup (6):
254
255- isl: enable Tile64 for 3D images
256- intel/isl: Unittest for linear to Ytile conversion
257- intel/isl: Convert linear texture to Tile4 format
258- intel/isl: Convert Tile4 texture to linear format
259- intel/isl: Linear to Tile-4 conversion unittest
260- Revert "iris: Disable tiled memcpy for Tile4"
261
262Alba Mendez (1):
263
264- meson: support installation tags
265
266Alejandro Piñeiro (61):
267
268- v3dv: re-enable sync_fd import/export on the simulator
269- broadcom(cle,clif,common,simulator): add 7.1 version on the list of versions to build
270- broadcom/cle: update the packet definitions for new generation v71
271- broadcom/common: add some common v71 helpers
272- broadcom/qpu: add comments on waddr not used on V3D 7.x
273- broadcom/qpu: set V3D 7.x names for some waddr aliasing
274- broadcom/compiler: rename small_imm to small_imm_b
275- broadcom/compiler: add small_imm a/c/d on v3d_qpu_sig
276- broadcom/qpu: add v71 signal map
277- broadcom/qpu: define v3d_qpu_input, use on v3d_qpu_alu_instr
278- broadcom/qpu: add raddr on v3d_qpu_input
279- broadcom/qpu: defining shift/mask for raddr_c/d
280- broadcom/commmon: add has_accumulators field on v3d_device_info
281- broadcom/qpu: add qpu_writes_rf0_implicitly helper
282- broadcom/qpu: add pack/unpack support for v71
283- broadcom/compiler: phys index depends on hw version
284- broadcom/compiler: don't favor/select accum registers for hw not supporting it
285- broadcom/vir: implement is_no_op_mov for v71
286- broadcom/compiler: update vir_to_qpu::set_src for v71
287- broadcom/qpu_schedule: add process_raddr_deps
288- broadcom/qpu: update disasm_raddr for v71
289- broadcom/qpu: return false on qpu_writes_accumulatorXX helpers for v71
290- broadcom/compiler: add support for varyings on nir to vir generation for v71
291- broadcom/compiler: payload_w is loaded on rf3 for v71
292- broadcom/qpu_schedule: update write deps for v71
293- broadcom/compiler: update register classes to not include accumulators on v71
294- broadcom/qpu: implement switch rules for fmin/fmax fadd/faddnf for v71
295- broadcom/compiler: update one TMUWT restriction for v71
296- broadcom/compiler: update ldunif/ldvary comment for v71
297- broadcom/compiler: update payload registers handling when computing live intervals
298- broadcom/qpu: new packing/conversion v71 instructions
299- v3dv/meson: add v71 hw generation
300- v3dv: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71
301- v3dv/cmd_buffer: emit TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1 for v71
302- v3dvx/cmd_buffer: emit CLEAR_RENDER_TARGETS for v71
303- v3dv/cmd_buffer: emit CLIPPER_XY_SCALING for v71
304- v3dv/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71
305- v3dv/cmd_buffer: just don't fill up early-z fields for CFG_BITS for v71
306- v3dv: default vertex attribute values are gen dependant
307- v3dv/pipeline: default vertex attributes values are not needed for v71
308- v3dv/pipeline: handle GL_SHADER_STATE_RECORD changed size on v71
309- v3dv: no specific separate_segments flag for V3D 7.1
310- v3dv: add support for TFU jobs in v71
311- v3d: add v71 hw generation
312- v3d: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71
313- v3d: TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1
314- v3d: emit CLEAR_RENDER_TARGETS for v71
315- v3d: just don't fill up early-z fields for CFG_BITS for v71
316- v3d: emit CLIPPER_XY_SCALING for v71
317- v3d: no specific separate_segments flag for V3D 7.1
318- v3d: default vertex attributes values are not needed for v71
319- v3d/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71
320- v3d: handle new texture state transfer functions in v71
321- v3d: handle new TEXTURE_SHADER_STATE v71 YCbCr fields
322- v3d: setup render pass color clears for any format bpp in v71
323- v3d: GFX-1461 does not affect V3D 7.x
324- v3d: don't convert floating point border colors in v71
325- v3d: handle Z clipping in v71
326- v3d: add support for TFU blit in v71
327- v3dv: implement depthBounds support for v71
328- doc/features: update after last v3d changes
329
330Alex Denes (1):
331
332- virgl: link VA driver with build-id
333
334Alexander Orzechowski (1):
335
336- radeonsi: Set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET for auxiliary contexts
337
338Alyssa Rosenzweig (431):
339
340- zink: Switch to register intrinsics
341- gallium/trace: Collect enums from multiple files
342- gallium,util: Move blend enums to util/
343- gallium,util: Move util_blend_dst_alpha_to_one
344- util/blend: Add helpers for normalizing inverts
345- vulkan: Add helpers for blend enum translation
346- lvp: Use common blend/logicop translation
347- nir/lower_blend: Use util enums
348- panfrost: Convert to PIPE_BLEND enums internally
349- gallium: Remove pipe->compiler BLEND enum translation
350- compiler: Remove blend enums duplicating util
351- nir/legacy: Fix fneg(load_reg) case
352- nir/legacy: Fix handling of fsat(fabs)
353- ntt: Switch to new-style registers and modifiers
354- ir3: Convert to register intrinsics
355- nir: Add fence_{pbe,mem}_to_tex(_pixel)_agx intrinsics
356- nir: Devendor load_sample_mask
357- nir: Promote tess_coord_r600 to tess_coord_xy
358- nir: Add nir_lower_tess_coord_z pass
359- r600: Use nir_lower_tess_coord_xy
360- ir3: Use nir_lower_tess_coord_z
361- nir: Initialize workgroup_size in builder_init_simple_shader
362- v3dv: Rely on nir_builder setting workgroup size
363- radv: Rely on workgroup_size initialization
364- panfrost: Fix transform feedback on v9
365- r600/sfn: Remove nir_register unit tests
366- panfrost: Lower vertex_id for XFB
367- panfrost: Fix transform feedback on v9 harder
368- asahi: Augment fake drm_asahi_params_global
369- asahi: Use nir_builder_at more
370- asahi: Remove unused #define
371- asahi: Refactor PBE upload routine
372- asahi: Extract shader_initialize helper
373- asahi: Serialize NIR in memory
374- asahi: Identify background/EOT counts
375- asahi,agx: Set coherency bit for clustered targets
376- ail: Page-align layers for writable images
377- asahi: Mark writeable images as such
378- asahi: Reallocate to set the writeable image flag
379- asahi: Add agx_batch_track_image helper
380- asahi: Add texture/image indexing lowering pass
381- asahi: Upload at most the max texture state registers
382- asahi: Upload image descriptors
383- asahi: Make clear the non-sRGBness of EOT images
384- asahi: Don't restrict sampler views
385- asahi: Forbid 2D Linear with images
386- agx: Add try_coalesce_with helper
387- agx: Try to allocate phis compatibly with sources
388- agx: Try to allocate phi sources with phis
389- agx: Try to allocate phi sources with loop phis
390- agx: Vectorize 16-bit parallel copies
391- agx: Reduce un/packs with mem access lowering
392- agx: Fix bogus assert
393- asahi: Augment PBE descriptor for software access
394- asahi: Extend PBE packing for image support
395- asahi: Use nir_lower_robust_access
396- agx: Legalize image LODs to be 16-bit
397- agx: Lower image size to txs
398- agx: Generalize texture/PBE packing
399- agx: Add image write instruction
400- agx: Model texture bindless base
401- agx: Handle bindless properly for txs lowering
402- agx: Pack bindless textures
403- agx: Translate texture bindless handles
404- agx: Translate image_store from NIR
405- agx: Handle frag side effects without render targets
406- agx: Wait for outstanding stores before barriers
407- agx: Implement image barriers
408- agx: Handle early_fragment_tests
409- agx: Add interleave opcode
410- agx: Extract coords_for_buffer_texture helper
411- agx: Extract texture_descriptor_ptr_for_* helpers
412- agx: Lower image atomics
413- agx: Lower buffer images
414- asahi,agx: Fix txf sampler
415- agx: Add image_load opcode
416- agx: Extract texture write mask handling
417- agx: Implement image_load
418- agx: Emit global memory barriers for images
419- agx: Don't emit silly barriers
420- agx: Implement fence_*_to_tex_agx intrinsics
421- agx: Add simple image fencing pass
422- agx: Require tag writes with side effects
423- agx: Plumb in coverage mask
424- asahi: Extract sampler_view_for_surface
425- asahi: Introduce concept of spilled render targets
426- asahi: Add agx_tilebuffer_spills query
427- asahi: Do not support masking with spilled RTs
428- asahi: Ignore spilled render targets in EOT shaders
429- asahi: Ignore spilled render targets with partial renders
430- asahi: Extract some tilebuffer lowering code
431- asahi: Lower tilebuffer access for spilled RTs
432- asahi: Lower multisample image stores
433- asahi: Permit meta shaders to use preambles
434- asahi: Ignore spilled render targets for background load
435- asahi: Offset clear colour uniform by 4
436- asahi: Execute preambles for background programs
437- asahi: Advertise Z16_UNORM
438- ir2: Switch to nir_legacy
439- intel/fs: Don't read reg.base_offset
440- panfrost: Remove unused helpers
441- nir: Remove nir_lower_locals_to_regs
442- nir: Rename lower_locals_to_reg_intrinsics back
443- nir: Remove register arrays
444- asahi: Don't depend on glibc to decode
445- pan/bi: Remove leftover include
446- nir/trivialize: Handle more RaW hazards
447- panfrost: Disable blending for no-op logic ops
448- nir/lower_blend: Fix 32-bit logicops
449- nir/lower_blend: Optimize out PIPE_LOGICOP_NOOP
450- clang-format: Ignore original panfrost commit
451- nir/schedule: Assume no old-style registers
452- gallium/u_simple_shaders: Optimize out ffloors
453- gallium/u_transfer_helper: Remove dead forward decl
454- nir/loop_analyze: Drop unused inverse_comparison
455- nir/passthrough_gs: Drop unused array_size_for_prim
456- panfrost: Add missing static inline annotation
457- pan/decode: Drop unused debug function
458- pan/mdg: Add missing static inline annotation
459- panfrost: Drop unused decode_position for samples
460- panfrost: Only define pan_blitter_get_blend_shaders for midgard
461- panfrost: Add missing inline
462- panfrost: Gate overdraw_alpha on Bifrost+
463- nir: Rename scoped_barrier -> barrier
464- nir: Remove lower_to_source_mods
465- nir: Remove lower_vec_to_movs
466- nir: Remove reg_intrinsics parameter to convert_from_ssa
467- nir: Remove register load/store builders
468- r600/sfn: Stop referencing legacy functionality
469- r600/sfn: Ignore instruction write masks
470- nouveau/codegen: Drop writemask check
471- vc4,broadcom/compiler: Drop write_mask handling
472- zink: Collapse is_ssa check
473- nir: Add {...} before case
474- nir/from_ssa: Drop legacy reg support
475- nir/schedule: Drop nir_schedule_dest_pressure
476- nir: Drop NIR reg create/destroy
477- nir: Remove nir_index_local_regs and callers
478- nir/schedule: Drop more nir_register handling
479- nir: Remove nir_foreach_register
480- nir: remove nir_{src,dest}_for_reg
481- ntt: Drop nir_register reference
482- nir/print: Assume SSA
483- nir/clone: Assume SSA
484- nir/serialize: Drop legacy NIR
485- nir/validate: Assume SSA
486- nir: Remove impl->{registers,reg_alloc}
487- nir: Remove nir_alu_dest::saturate
488- treewide: Drop is_ssa asserts
489- nir: Collapse some SSA checks
490- treewide: Remove more is_ssa asserts
491- nir: Remove reg-only dest manipulation
492- nir: Remove stale todo
493- nir/print: Drop legacy NIR
494- nir: Drop nir_alu_src::{negate,abs}
495- treewide: sed out more is_ssa
496- pan/mdg: Assume SSA
497- treewide: Drop some is_ssa if's
498- nir: Drop trivial reg handling
499- aco: Remove is_ssa check
500- intel: Collapse is_ssa checks
501- llvmpipe: Assume SSA
502- ir3: Collapse is_ssa checks
503- lima: Collapse is_ssa checks
504- radeonsi: Collapse SSA check
505- nir/gather_ssa_types: Collapse SSA checks
506- nir/worklist: Assume SSA
507- nir/range_analysis: Assume SSA
508- treewide: Collapse more SSA checks
509- nir/instr_set: Assume SSA
510- nir: Collapse more SSA checks
511- nir: Remove def_is_register
512- nir: Do not init dests
513- nir: Initialize source as a NULL SSA def
514- nir: Collapse more SSA checks
515- nir: Remove nir_{src,dest}::is_ssa
516- nir: Drop nir_register
517- nir/from_ssa: Remove pointless union
518- ir3: Drop write_mask handling
519- rogue: Stop reading write masks
520- etnaviv: Don't use alu->dest.write_mask
521- etnaviv: What if we just didn't have a compiler?
522- intel/vec4: Don't use legacy write mask
523- ntt: Evaluate write_mask check
524- nir: Remove nir_alu_dest::write_mask
525- nir: Remove nir_foreach_def
526- lima: Clean up after deleting asserts
527- nir: Remove no-op remove_def_cb
528- nir: Drop no-op all_srcs_are_ssa
529- nir: Simplify alu_instr_is_copy
530- nir: Add load_coefficients_agx intrinsic
531- agx: Implement nir_intrinsic_load_coefficients_agx
532- agx: Allow more varying slots
533- agx: Set lower_fisnormal
534- agx: Forcibly vectorize pointcoord coeffs
535- agx: Add interpolateAtOffset lowering pass
536- agx: Lower flat shading in NIR
537- asahi: Stub num_dies
538- asahi: Move a bunch of helpers to common
539- agx: Lower 8-bit ALU
540- agx: Handle 8-bit vecs
541- asahi,agx: Respect no16 even for I/O
542- agx: Don't lower load_local_invocation_index
543- agx/dce: Use the helper
544- agx: Fix atomics with no destination
545- agx: Fix shader info with sample mask writes
546- agx: Do not move bindless handles
547- agx: Put else instructions in the right block
548- agx: Use unconditional else instruction
549- agx: Optimize out pointless else instructions
550- agx: Fix length bit confusion
551- agx: Require an immediate for \`nest`
552- agx: Use compressed fadd/fmul encodings
553- agx: Optimize swaps of 2x16 channels
554- agx: Optimize logical_end removal
555- agx: Fix AGX_MESA_DEBUG=demand
556- agx: Maintain ctx->max_reg while assigning regs
557- agx: Allow 64-bit memory regs
558- agx: Fix accounting for phis
559- agx: Set phi sources in predecessors
560- agx: Stop setting registers after the shader
561- agx: Use agx_replace_src
562- agx: Assert invariant stated in the comment
563- agx: Don't use ssa_to_reg across blocks
564- agx: Don't reuse ssa_to_reg across blocks
565- agx: Remove unused allocation
566- agx: Stop setting forwarding bit
567- agx: Handle blocks with no predecessors
568- agx: Lower f2u8/f2i8
569- agx: Handle conversions to 8-bit
570- agx: Fix uadd_sat packing
571- agx: Fix 64-bit immediate moves
572- agx: Lower f2f16_rtz
573- agx: Handle f2f16_rtne like f2f16
574- agx: Handle <32-bit local memory access
575- agx: Do not allow creating vec8
576- asahi: Legalize compression before blitting
577- nir: Drop "SSA" from NIR language
578- agx: Stop passing nir_dest around
579- agx: Remove agx_nir_ssa_index
580- pan/mdg: Don't reference nir_dest
581- pan/bi: Don't reference nir_dest
582- asahi: Do not reference nir_dest
583- panfrost: Do not reference nir_dest
584- zink: Do not reference nir_dest
585- ir3: Do not reference nir_dest
586- dxil: Do not reference nir_dest
587- nir: Drop nir_dest_init
588- panfrost: Pack stride at CSO create time on v9
589- lvp,nir/lower_input_attachments: Use nir_trim_vector
590- broadcom/compiler: Use nir_trim_vector explicitly
591- nir: Assert that nir_ssa_for_src components matches
592- nir: Add nir_shader_intrinsics_pass
593- nir: Lower fquantize2f16
594- agx: Lower fquantize2f16
595- nir/lower_helper_writes: Consider bindless images
596- nir/passthrough_gs: Correctly set vertices_in
597- nir/passthrough_gs: Fix array size
598- nir/print: Print access qualifiers for intrinsics
599- nir/lower_gs_intrinsics: Remove end primitive for points
600- panfrost/ci: Disable T720
601- nir: Add load_sysval_agx intrinsic
602- agx: Fix extraneous bits with b2b32
603- agx: Use more barriers
604- asahi: Copy CSO stride
605- agx: Assert vertex_id, instance_id are VS-only
606- asahi: Keep drawoverhead from OOMing itself
607- agx: Don't blow up when lowering textures twice
608- agx/lower_vbo: Handle nonzero component
609- agx: Allow loop headers without later preds
610- agx: Handle b2i8
611- agx: Convert 8-bit comparisons
612- agx: Implement imul_high
613- asahi: Advertise OpenGL ES 3.1!
614- asahi/decode: Turn assert into error
615- asahi: Report local_size from compiler
616- asahi: Use local_size from compiler directly
617- asahi: Pass layer stride in pixels, not elements
618- agx: Clear sample count after lowering MSAA
619- agx: Clear image_array after lowering
620- asahi: Preserve atomic ops when rewriting image to bindless
621- agx: Use 16-bit reg for pixel_coord
622- asahi: Generalize query logic
623- asahi: Simplify occlusion query batch tracking
624- asahi: Refactor agx_get_query_result
625- asahi: Only touch batch->occlusion_queries for occlusion
626- asahi: Sync when beginning a query
627- asahi: Add non-occlusion query tracking
628- asahi: Add get_query_address helper
629- agx/fence_images: Use intrinsics_pass
630- agx: Do not fence write-only images
631- asahi: Add missing LOD source for agx_meta's txfs
632- agx: Do some texture lowering early
633- agx: Add helper returning if a descriptor crawl is needed
634- nir,asahi: Remove texture_base_agx
635- asahi: Move UBO lowering into GL driver
636- asahi: Add sysval tables for each shader stage
637- asahi: Split out per-stage sysvals
638- asahi: Collapse grid_info
639- asahi: Extract agx_upload_textures
640- asahi: Upload a single draw_uniforms per draw
641- asahi: Add real per-stage dirty flags
642- asahi: Extract sampler upload
643- asahi: Put unuploaded uniforms on the batch
644- asahi: Decouple sysval lowering from uniform assignment
645- asahi: Use finer dirty tracking for blend constant
646- asahi: Use proper dirty tracking for VBOs
647- asahi: Dirty track VBOs + blend const separately
648- asahi: Dirty the shader stage when the shader changes
649- asahi: Fix shader stage dirtying
650- treewide: Use nir_shader_intrinsic_pass sometimes
651- treewide: Also handle struct nir_builder form
652- nir/lower_shader_calls: Fix warning with clang
653- nir: Add nir_before/after_impl cursors
654- treewide: Use nir_before/after_impl in easy cases
655- treewide: Use nir_before/after_impl for more elaborate cases
656- radv: Use before/after_cf_list for entrypoints
657- ci: Disable known broken Bifrost Vulkan job
658- ci: Disable WHL jobs
659- nir/opt_if: Simplify if's with general conditions
660- asahi: Fixes for clang-warnings
661- agx: Fix jmp_exec_none encoding
662- agx/validate: Print to stderr
663- agx: Annotate opcodes with a scheduling class
664- agx: Add schedule-specialized get_sr variants
665- agx: Include schedule class in the opcode info
666- agx: Schedule for register pressure
667- agx: Lower pack_32_4x8_split
668- asahi: Force translucency for ignored render targets
669- agx: Remove logical_end instructions
670- agx: Lower pseudo-ops later
671- agx: Expand nest
672- agx: Lower nest later
673- agx: Split nest instruction into begin_cf + break
674- agx: Add break_if_*cmp instructions
675- agx: Add agx_first/last_instr helpers
676- agx: Use agx_first_instr
677- agx: Detect conditional breaks
678- agx: Omit push_exec at top level
679- agx: Omit while_icmp without continue
680- agx: Add helper to determine if a NIR loop uses continue
681- agx: Only use nest by 1 for loops w/o continue
682- agx: Add pseudo-instructions for icmp/fcmp
683- agx: Generate unfused comparison pseudo ops
684- agx: Fuse conditions into if's
685- agx: Fuse compares into selects
686- agx: Add unit test for if_cmp fusing
687- agx: Add unit test for cmp+sel fusing
688- asahi: Translate cube array dimension
689- ail: Force page-alignment for layered attachments
690- agx: Handle cube arrays when clamping arrays
691- agx: Lower coordinates for cube map array images
692- agx: Run opt_idiv_const after lowering texture
693- asahi: Forbid linear 1D Array images
694- asahi: Handle linear 1D Arrays
695- asahi: Conditionally expose cube arrays
696- gallium,mesa/st: Add PIPE_CONTEXT_NO_LOD_BIAS flag
697- asahi: Skip LOD bias lowering for GLES
698- nir: Add nir_function_instructions_pass helper
699- nir: Add NIR_OP_IS_DERIVATIVE property
700- nir: Hoist nir_op_is_derivative
701- nir/opt_preamble: Use nir_op_is_derivative
702- nir/opt_gcm: Use nir_op_is_derivative more
703- nir/gather_info: Use nir_op_is_derivative
704- nir/opt_sink: Sink load_constant_agx
705- nir/opt_sink: Sink load_local_pixel_agx
706- nir/opt_sink: Sink frag coord instructions
707- nir/opt_sink: Do not move derivatives
708- nir/opt_sink: Move ALU with constant sources
709- nir/opt_sink: Also consider load_preamble as const
710- agx: Enable sinking ALU
711- treewide: Drop nir_ssa_for_src users
712- treewide: Remove remaining nir_ssa_for_src
713- nir: Remove nir_ssa_for_src
714- asahi: Clamp index buffer extent to what's read
715- agx: Align the reg file for 256-bit vectors
716- agx: Hoist sample_mask/zs_emit
717- agx: Set PIPE_SHADER_CAP_CONT_SUPPORTED
718- agx: Augment if/else/while_cmp with a target
719- agx: Add jumps to block ends
720- agx: Add agx_prev_block helper
721- agx: Insert jmp_exec_none instructions
722- nir: Add layer_id_written_agx sysval
723- nir: Support arrays in block_image_store_agx
724- agx/nir_lower_texture: Allow disabling layer clamping
725- agx: Pack block image store dim correctly
726- agx: Handle layered block image stores
727- agx: Add pass to lower layer ID writes
728- asahi: Add helper to get layer id in internal program
729- asahi,agx: Select layered rendering outputs
730- agx: Support packed layered rendering writes
731- agx/tilebuffer: Support layered layouts
732- agx/lower_tilebuffer: Support spilled layered RTs
733- asahi: Use layered layouts
734- asahi: Expose VS_LAYER_VIEWPORT behind a flag
735- asahi: Account for layering for attachment views
736- asahi: Assume LAYER is flat-shaded
737- asahi: Add pass to predicate layer ID reads
738- asahi: Predicate layer ID reads
739- asahi: Write to cubes/etc attachments as 2D array
740- asahi: Use a 2D Array texture for array render targets
741- asahi: Generate layered EOT programs
742- asahi: Handle layered background programs
743- lima/pp: Do not use union undefined behaviour
744- nir: Add trivial nir_src_* getters
745- nir: Use set_parent_instr internally
746- nir: Use getters for nir_src::parent_*
747- nir: Assert the nir_src union is used safely
748- nir: Use a tagged pointer for nir_src parents
749- nir: Add ACCESS_CAN_SPECULATE
750- ir3: Set CAN_SPECULATE before opt_preamble
751- ir3: Model cost of phi nodes for opt_preamble
752- nir/opt_preamble: Walk cf_list manually
753- nir/opt_preamble: Preserve IR when replacing phis
754- nir/opt_preamble: Unify foreach_use logic
755- nir/opt_preamble: Move phis for movable if's
756- nir/opt_preamble: Respect ACCESS_CAN_SPECULATE
757- freedreno/ci: Minetest
758- r600/sfn: Handle load_global_constant
759- nir/opt_phi_precision: Work with libraries
760- nir/legalize_16bit_sampler_srcs: Use instr_pass
761- nir/print: Handle KERNEL
762- nir/lower_io: Use load_global_constant for OpenCL
763- nir/opt_algebraic: Reduce int64
764- nir/opt_algebraic: Optimize LLVM booleans
765- nir/trivialize_registers: Handle obscure load hazard
766- hasvk: Support builiding on non-Intel
767- crocus: Support building on non-Intel
768- meson: Add vulkan-drivers=all option
769- meson: Add gallium-drivers=all option
770- agx: Fix fragment side effects scheduling
771
772Amber (7):
773
774- ir3: make wave_granularity configurable
775- turnip: Add support for devices not supporting double thread size.
776- turnip: make sampler_minmax support configurable.
777- freedreno, turnip: set correct reg_size_vec4 for a6xx_gen1_low
778- ir3: handle non-uniform case for atomic image/ssbo intrinsics
779- freedreno: Add support for devices not supporting double thread size.
780- turnip: Add debug option to allow non-conforming features.
781
782Andrew Randrianasulu (1):
783
784- nv50/ir: Remove few nvc0 specific defines from nv50-specific header.
785
786Antonio Gomes (9):
787
788- rusticl/kernel: Removing unnecessary clone in kernel launch
789- rusticl/kernel: Add CsoWrapper
790- rusticl/compiler: Add NirPrintfInfo
791- rusticl: Move Cso to Program
792- rusticl/compiler: Remove unnecessary functions
793- rusticl: Move NirKernelBuild to ProgramDevBuild
794- rusticl/program: New helper functions to NirKernelBuild
795- rusticl/core: Delete KernelDevState and KernelDevStateInner
796- rusticl/core: Make convert_spirv_to_nir output pair (KernelInfo, NirShader)
797
798Asahi Lina (29):
799
800- docs/tgsi: Specify that depth texture fetches are replicated
801- asahi: Add synctvb debug flag
802- asahi: Add smalltile debug option
803- asahi: Add nomsaa debug flag
804- asahi: decode: Add a params argument to pass through
805- asahi: Add extra CDM header block for G14X
806- asahi: wrap: Handle freeing shmems
807- asahi: decode: Refactor to always copy GPU mem to local buffers
808- asahi: decode: Add a function to construct decode_params from a chip_id
809- asahi: Add a shared library interface for decode
810- asahi: Add a noshadow debug flag
811- asahi: Do not overallocate BOs by more than 2x
812- asahi: Fix race in BO stats accounting
813- asahi: Always use resource size, not BO size
814- asahi: Print info about shadowed resources
815- asahi: Impose limits on resource shadowing
816- asahi: Force linear for SHARED buffers with no/implicit modifier
817- asahi: Enable explicit coherency for G14D (multi-die)
818- asahi: Handle non-written RTs correctly
819- asahi: Fix incorrect BO bitmap reallocations
820- asahi: Allocate staging resources as staging
821- asahi: cmdbuf: Identify call/ret bits
822- asahi: decode: Implement VDM call/ret
823- asahi: decode: Do not assert on buffer overruns
824- asahi: Fix VDM pipeline field width
825- asahi: Add scaffolding for supporting driconf options
826- asahi: Add and support the no_fp16 driconf flag
827- driconf: Disable fp16 for browsers
828- asahi: Allow no16 flag for disk cache
829
830Bas Nieuwenhuizen (16):
831
832- aco: fix nir_op_vec8/16 with 16-bit elements.
833- aco: Fix some constant patterns in 16-bit vec4 construction with s_pack.
834- nir: Fix 16-component nir_replicate.
835- radv: Expose VK_EXT_external_memory_acquire_unmodified.
836- util/perf: Add gpuvis integration.
837- egl,venus,vulkan,turnip,freedreno: Update CPU trace init to init more than perfetto.
838- vulkan: Add CPU tracing for vkWaitForFences.
839- docs: Add documentation for gpuvis.
840- vulkan: Add trace points for more Vulkan waiting functions.
841- radv: Use a double jump to limit nops in DGC for dynamic sequence count.
842- nir: Add AMD cooperative matrix intrinsics.
843- aco: Add WMMA instructions.
844- aco: Make RA understand WMMA instructions.
845- radv: Don't transparently use wave32 with cooperative matrices.
846- radv: Add cooperative matrix lowering.
847- radv: Expose VK_KHR_cooperative_matrix.
848
849Benjamin Cheng (10):
850
851- radv/video: use app provided hevc scaling list order
852- radv/video: copy from correct H264 scaling lists
853- anv/video: copy from correct H264 scaling lists
854- vulkan/video: add helper to derive H264 scaling lists
855- radv/video: use vk_video_derive_h264_scaling_list
856- anv/video: use vk_video_derive_h264_scaling_list
857- util/vl: extract gallium vl scanning data to shared code
858- radv/video: send h264 scaling list in raster order
859- anv/video: send h264 scaling list in raster order
860- radv/video: find SPS with pps_seq_parameter_set_id
861
862Benjamin Lee (1):
863
864- nvk: Fix segfault when opening DRI device file returns error
865
866Biswapriyo Nath (1):
867
868- radv/video: Match function definitions to declarations
869
870Boris Brezillon (1):
871
872- panfrost: Flag the right shader when updating images
873
874Boyuan Zhang (3):
875
876- virgl: Add vp9 picture desc
877- virgl: Implement vp9 hardware decode
878- radeonsi/vcn: disable tmz ctx buffer for VCN_2_2_0
879
880Caio Oliveira (134):
881
882- nir: Use instructions_pass() for nir_fixup_deref_modes()
883- meson: Ensure that LLVMSPIRVLib is not required for Clover
884- nir: Let nir_fixup_deref_modes() fix deref_casts when possible
885- nir: Add nir_opt_reuse_constants()
886- radv: Use nir_opt_reuse_constants()
887- compiler/types: Use ralloc for the key in array_types
888- compiler/types: Use smaller keys for array_types table
889- compiler/types: Extract get_explicit_matrix_instance() function
890- compiler/types: Use smaller keys for explicit_matrix_types table
891- anv/tests: Refactor state_pool_test_helper to not use macros for parametrization
892- anv/tests: Link a single anv_tests binary using gtest
893- anv/tests: Propagate failures to gtest
894- hasvk/tests: Refactor state_pool_test_helper to not use macros for parametrization
895- hasvk/tests: Link a single hasvk_tests binary using gtest
896- hasvk/tests: Propagate failures to gtest
897- util: Add convenience macros for linear allocator
898- compiler/types: Use right hash for function types
899- compiler/types: Don't duplicate empty string
900- compiler/types: Constify a couple of pointers in glsl_type
901- compiler/types: Remove unused GLSL_TYPE_FUNCTION and related functions
902- compiler/types: Move GLSL specific builtin structs into glsl/
903- glsl: Add missing glsl_types initialization to test_optpass
904- glsl: Don't create struct type builtins
905- compiler/types: Add extra level of macro to builtin_macros
906- compiler/types: Use designated initializer syntax to specify builtins
907- compiler/types: Move local cache details to implementation file
908- compiler/types: Add a mem_ctx for the glsl_type_cache
909- compiler/types: Use type cache mem_ctx for hash tables
910- compiler/types: Don't store a mem_ctx per type
911- compiler/types: Simplify clearing the glsl_type_cache
912- compiler/types: Move static asserts about glsl_type to a central place
913- compiler/types: Store builtin types directly as data
914- compiler/types: Use a linear (arena) allocator for glsl_types
915- compiler/types: Make struct glsl_type visible to C code
916- compiler/types: Add workaround to use builtin_type_macros.h in C
917- compiler/types: Move builtin type initialization to C
918- glsl: Annotate _mesa_glsl_error() with PRINTFLIKE
919- compiler/types: Fix array name dimension flipping for unsized arrays
920- compiler/types: Use Python to generate code for builtin types
921- compiler/types: Use glsl_get_type_name() to access the type name
922- compiler/types: Change glsl_type::name to be an uintptr_t
923- compiler/types: Use a string table for builtin type names
924- intel/compiler/xe2: Account for reg_unit() in TCS intrinsics
925- intel/compiler/xe2: Account for reg_unit() in TES intrinsics
926- intel/fs/xe2+: Update BS payload setup for Xe2 reg size.
927- intel/fs/xe2+: Update TASK/MESH payload setup for Xe2 reg size.
928- compiler: Use a meson dependency for libcompiler
929- meson: Remove unnecessary inc_compiler mentions
930- rusticl: Ensure NIR generated headers will be available
931- clover: Hide SPIR-V related code behind HAVE_CLOVER_SPIRV
932- clover: Only compile/depend libclspirv and libclnir when using SPIR-V support
933- compiler: Only enable mesaclc helper if we have OpenCL SPIR-V support
934- intel/compiler: Don't allocate memory for SIMD select error handling
935- microsoft/compiler: Fix printf formatting string issues
936- util: Add more PRINTFLIKE and MALLOCLIKE annotations
937- util: Remove ralloc_parent from linear_header
938- util: Use linear parent to (r)allocated extra nodes
939- util: Remove size from linear_parent creation
940- util: Make DECLARE_LINEAR_ALLOC_* macros assume no destructors
941- util: Use an opaque type for linear context
942- util: Remove usages of linear_realloc()
943- util: Remove linear_realloc()
944- util: Remove size information from child allocations
945- util: Remove per-buffer header in linear alloc for release mode
946- util: Add a few basic tests for linear_alloc
947- util: Fix bookkeeping of linear node sizes
948- intel/compiler: Don't store stage name and abbrev
949- intel/compiler/xe2: URB fence uses LSC now
950- intel/compiler/xe2: Fix URB writes in TCS
951- intel/compiler/xe2: Update TCS ICP handle code to support SIMD16
952- compiler/types: Add support for Cooperative Matrix types
953- nir: Add new intrinsics for Cooperative Matrix
954- nir: Handle cooperative matrix in various passes
955- spirv: Expose some memory related functions in vtn_private.h
956- spirv: Let vtn_ssa_value hold references to variables
957- spirv: Implement SPV_KHR_cooperative_matrix
958- compiler/types: Remove private related declarations
959- compiler/types: Remove use of new/delete
960- compiler/types: Remove use of references
961- compiler/types: Remove use of auto
962- compiler/types: Use C compatible cast syntax
963- compiler/types: Spell struct and enum in type names
964- compiler/types: Add void parameter to ensure these are valid C prototypes
965- intel/fs: Tweak default case of fs_inst::size_read()
966- compiler/types: Move the C++ inline functions in glsl_type out of the struct body
967- compiler/types: Move C declarations into glsl_types.h
968- compiler/types: Flip wrapping of base_type checks
969- compiler/types: Flip wrapping of various type identification checks
970- compiler/types: Flip wrapping of convenience accessors for vector types
971- compiler/types: Flip wrapping of basic "get type" functions
972- rusticl: Add Rust bindings for inline glsl_types functions
973- util: Add size to ralloc_header in debug mode
974- util: Add a canary to identify gc_ctx in debug mode
975- util: Add function print information about a ralloc tree
976- util: Avoid waste space when linear alloc'ing large sizes
977- spirv: Expose stage enum conversion in vtn_private.h
978- spirv: Change spirv2nir to use the shorter shader name abbreviations
979- spirv: List entry-points in spirv2nir when unsure what to use
980- spirv: Let spirv2nir find out the shader to use
981- intel/compiler: Don't emit calls to validate() in release build
982- compiler/types: Flip wrapping of "type contains?" predicate functions
983- compiler/types: Flip wrapping of array related functions
984- compiler/types: Flip wrapping of cmat related functions
985- compiler/types: Flip wrapping of CL related functions
986- compiler/types: Flip wrapping of size related functions
987- compiler/types: Flip wrapping of struct related functions
988- compiler/types: Flip wrapping of interface related functions
989- compiler/types: Flip wrapping of layout related functions
990- compiler/types: Flip wrapping of record_compare
991- compiler/types: Flip wrapping of get_instance()
992- compiler/types: Flip wrapping of texture/sampler/image get instance functions
993- compiler/types: Flip wrapping of various get instance functions
994- compiler/types: Flip wrapping of get row/column type helpers
995- compiler/types: Flip wrapping of remaining non-trivial type getters
996- compiler/types: Flip wrapping of remaining small data getters
997- compiler/types: Flip wrapping of numeric type conversion functions
998- compiler/types: Move remaining code from nir_types to glsl_types
999- rusticl: Add bindings for glsl_vector_type()
1000- compiler/types: Add more glsl_contains_*() functions and use them in C++
1001- compiler/types: Add glsl_get_mul_type() and use it in C++
1002- compiler/types: Add glsl_type_compare_no_precision() and use it in C++
1003- compiler/types: Add glsl_type_uniform_locations() and use it in C++
1004- compiler/types: Add glsl_get_std430_array_stride() and use it in C++
1005- compiler/types: Add glsl_get_explicit_*() functions and use them in C++
1006- compiler/types: Implement glsl_type::field_type() in terms of existing functions
1007- compiler/types: Add glsl_simple_explicit_type() and simplify glsl_simple_type()
1008- compiler/types: Add remaining type extraction functions and use them in C++
1009- compiler/types: Use C instead of C++ constants for builtin types
1010- compiler/types: Remove usages of C++ members in glsl_types.cpp
1011- compiler/types: Annotate extern "C" only once in glsl_types.cpp
1012- compiler/types: Rename glsl_types.cpp to glsl_types.c
1013- compiler/types: Remove warnings about potential fallthrough
1014- compiler/types: Move comments and reorganize declarations
1015- anv: Fix leak when compiling internal kernels
1016
1017Carsten Haitzler (2):
1018
1019- kmsro: Add hdlcd DPU
1020- panfrost: Add GPU variant of G57 to the set of known ids
1021
1022Charles Giessen (1):
1023
1024- panvk: Use 1.0 in ICD Manifest json
1025
1026Charmaine Lee (8):
1027
1028- svga: set clear_texture to NULL for vgpu9
1029- svga: fix stride used in vertex declaration
1030- svga: fix persistent mapped surface update to constant buffer
1031- svga: restrict use of rawbuf for constant buffer access to GL43 device
1032- svga: fix immediates used in rawbuf for constant buffer
1033- svga: use srv raw buffer for accessing readonly shader buffer
1034- svga: sync resource content from backing resource before image upload
1035- svga: ignore sampler view resource if not used by shaders
1036
1037Chia-I Wu (38):
1038
1039- radv: fix separate depth/stencil layouts in fb state
1040- radv: fix separate depth/stencil layouts in resolve meta
1041- radv: refactor depth clear in clear meta
1042- radv: fix separate depth/stencil layouts in clear meta
1043- amd/ci: update radv-stoney-aco-fails.txt for depth/stencil clear
1044- radv: disable tc-compat htile for layered images on gfx8
1045- amd/ci: update radv-stoney-aco-fails.txt for depth/stencil resolve
1046- winsys/amdgpu: fix a race between import and destroy
1047- ac/surface: limit RADEON_SURF_NO_TEXTURE to color surfaces
1048- winsys/radeon: fix a race between bo import and destroy
1049- vulkan/runtime: add a helper for ETC2 emulation
1050- radv: use vk_tecompress_etc2 from the runtime
1051- vulkan/runtime: fix image type check for ETC2 emulation
1052- vulkan/runtime: fix a harmless typo for ETC2 emulation
1053- vulkan/runtime, radv: remove 1D support from ETC2 emulation
1054- radv: add radv_is_format_emulated
1055- radv: simplify view format override for emulated formats
1056- radv: hard code format features for emulated formats
1057- mesa: make astc_decoder.glsl vk-compatible
1058- radv, drirc: rename radv_require_{etc2,astc}
1059- anv: remove unused field from anv_image_view
1060- anv: add anv_image_view_{init,finish}
1061- anv: support image views with surface state stream
1062- anv: add anv_push_descriptor_set_{init,finish}
1063- anv: support alternative push descriptor sets
1064- anv: add anv_descriptor_set_write
1065- anv: add anv_cmd_buffer_{save,restore}_state
1066- anv: add anv_is_format_emulated
1067- anv: add a hidden plane for emulated formats
1068- anv: decompress on upload for emulated formats
1069- anv: fix up image views for emulated formats
1070- anv: fix up blit src for emulated formats
1071- anv: advertise emulated formats
1072- anv: add support for vk_require_astc driconf
1073- util: improve BITFIELD_MASK and BITFIELD64_MASK on clang
1074- anv: prep for gen9 astc workaround
1075- anv: add gen9 astc workaround
1076- radv: fix image view extent override for astc
1077
1078Chris Spencer (9):
1079
1080- radv: initialize result when pipeline cache creation fails
1081- anv/android: Fix importing hardware buffers with planar formats
1082- anv/android: Add support for AHARDWAREBUFFER_FORMAT_YV12
1083- anv: Advertise Vulkan 1.3 on Android 13
1084- anv: Don't reject Android image format if external props not supplied
1085- android: Add explanatory comment to u_gralloc
1086- anv/android: Enable shared presentable image support
1087- anv/video: use correct enum value for max level IDC
1088- radv/video: use correct enum value for max level IDC
1089
1090Christian Gmeiner (41):
1091
1092- nir/print: print instr pass_flags
1093- etnaviv: move nir texture lowerings into one pass
1094- nir: add enta specific intrinsic used for txs lowering
1095- etnaviv: nir: support intrinsic used for txs lowering
1096- etnaviv: nir: lower nir_texop_txs
1097- ci/etnaviv: update ci expectations
1098- etnaviv: make use of BITFIELD_BIT(..) macro
1099- etnaviv: name the enum used for pass_flags
1100- etnaviv: add is_dead_instruction(..) helper
1101- etnaviv: extend etna_pass_flags with source modifiers
1102- etnaviv: do not clear all pass_flags before RA
1103- etnaviv: nir: look at parent instr in lower_alu(..)
1104- etnaviv: nir: add etna_nir_lower_to_source_mods(..)
1105- etnaviv: nir: switch to etna_nir_lower_to_source_mods(..)
1106- etnaviv: nir: convert to new-style NIR registers
1107- freedreno/regs: remove double assignment of self.current_domain
1108- freedreno/regs: remove not used variable
1109- freedreno/regs: remove dead code
1110- freedreno/regs: python does not need ';'
1111- etnaviv: switch to log2f(..)
1112- etnaviv: switch to U_FIXED(..) macro
1113- etnaviv: switch to S_FIXED(..) macro
1114- etnaviv: fix null pointer dereference
1115- etnaviv: switch to float_to_ubyte(..)
1116- ci/etnaviv: update ci expectation
1117- etnaviv: unbreak cmdline compiler
1118- agx/lower_address: Use intrinsics_pass
1119- agx/lower_address: Remove not used has_offset
1120- isaspec: python does not need ';'
1121- docs: Move isaspec out of drivers/freedreno
1122- isaspec: Add support for templates
1123- isaspec: encode: Correct used regex
1124- isaspec: Add method to get all instrustions
1125- isaspec: Add support for custom meta information
1126- isaspec: Add BitSetEnumValue object
1127- spirv: Don't use libclc for rotate
1128- docs: update etnaviv extensions
1129- etnaviv: drm: Be able to mark end of context init
1130- etnaviv: Skip 'empty' cmd streams
1131- ci: Bump PyYAML to 6.0.1
1132- etnaviv: Don't leak disk_cache
1133
1134Collabora's Gfx CI Team (2):
1135
1136- Uprev Piglit to ed58dfbd12be34fa3dab97a7a2987b890e0637f1
1137- Uprev Piglit to f7db20b03de6896d013826c0a731bc4417c1a5a0
1138
1139Cong Liu (2):
1140
1141- r300: Fix out-of-bounds access in ntr_emit_store_output()
1142- virgl:Fix ITEM_CPY macro pointer copy bug
1143
1144Connor Abbott (83):
1145
1146- afuc: Rework and significantly expand README.rst
1147- tu: Fix vk2tu_*_stage flag type
1148- tu: Fix and simplify execution dependency handling
1149- tu, freedreno/a6xx: Remove has_ccu_flush_bug
1150- ir3: Handle GS stream "mixing" with non-point output primitives
1151- tu: Disable transformFeedbackPreservesProvokingVertex
1152- isaspec: Add "displayname" for altering {NAME} when decoding
1153- isaspec: Add support for "absolute" branches
1154- isaspec: Add support for function and entrypoint labels
1155- isaspec: Add "custom" field type
1156- isaspec: Add callback after decoding an instruction
1157- isaspec: Rename isa_decode() to isa_disasm()
1158- isaspec: Add initial decoding support
1159- afuc: Fix xmov lexer typo
1160- afuc: Convert to isaspec
1161- afuc: Add setbit/clrbit
1162- afuc: Fix writing $00
1163- freedreno/afuc: Initial a7xx support
1164- ir3: Parse (eq) flag
1165- ir3, freedreno, tu: Plumb through SP_FS_PREFETCH_CNTL::ENDOFQUAD
1166- tu: Add missing last_baryf statistic
1167- freedreno, tu, ir3: Add last_helper statistic
1168- ir3: Gather pixlod status earlier
1169- ir3: Implement helper invocation optimization
1170- vk/graphic_state, tu: Use dynamic blend count from subpass
1171- freedreno/a7xx: Add CP_RESET_CONTEXT_STATE
1172- vk/graphics_state: Fix copying MS locations pipeline state
1173- tu: Remove MSAA draw state
1174- tu: Merge SAMPLE_LOCATIONS and SAMPLE_LOCATIONS_ENABLE draw states
1175- tu: Merge PC_RASTER_CNTL into RAST draw state
1176- tu: Stop reusing base Vulkan dynamic state enums
1177- tu: Merge depth/stencil draw states
1178- tu: Rename PrimID-related registers
1179- tu, freedreno/a6xx: Don't use VS for PrimID passthru state
1180- tu: Pull entangled shader state into program config
1181- ir3: Add ir3_find_input_loc() helper
1182- tu: Split up tu6_emit_vpc()
1183- freedreno, ir3, tu: Constify various uses of ir3_shader_variant
1184- ir3: Add helper to determine when variant exceeds safe constlen
1185- tu: Split program draw state into per-shader states
1186- tu: Fix per-view viewport state propagation
1187- tu: Fix tu6_emit_*_fdm size call
1188- tu: Fix assert in FDM state emission
1189- tu: Actually emit patchpoint for viewports with FDM
1190- nir/lower_subgroups: Don't do multiple lowerings at once
1191- nir/spirv: Add inverse_ballot intrinsic
1192- amd: Use inverse ballot intrinsic if available
1193- tu: Create singleton "empty" shaders
1194- tu: Start tracking shaders independently of pipeline
1195- tu: Move FS-specific pipeline information to the shader
1196- tu: Use shader directly for VS/TCS output size and patch size
1197- tu: Rewrite tessellation modes handling
1198- tu: Rework passing shared consts
1199- tu: Decouple program state from the pipeline
1200- tu: Use pipeline feedback loop flag indirectly
1201- tu: Rewrite remaining pipeline LRZ handling
1202- tu: Don't reference pipeline for some draw states
1203- tu: Make compute dispatch use the shader
1204- tu: Don't use pipeline for dynamic draw states
1205- tu: Don't use pipeline for bandwidth validity
1206- tu: Don't use pipeline for per_view_viewport
1207- tu: Don't use pipeline for active stages
1208- tu: Remove pipeline from state
1209- zink: Rework color clamping and conversion
1210- freedreno/fdl: Use A8_UNORM HW format for sampling
1211- tu: Support clearing A8_UNORM
1212- freedreno/fdl: Support PIPE_FORMAT_R5G5B5A1_UNORM on a6xx
1213- tu/clear_blit: Fix staging image view layer count
1214- tu/clear_blit: Allow VK_REMAINING_ARRAY_LAYERS as layerCount
1215- tu: Allow VK_WHOLE_SIZE in tu_CmdBindVertexBuffers2EXT pSizes
1216- tu: Implement vkCmdBindIndexBuffer2KHR
1217- tu: Implement vkGetImageSubresourceLayout2KHR and vkGetDeviceImageSubresourceLayoutKHR
1218- tu: Implement vkGetRenderingAreaGranularityKHR
1219- tu: Use new buffer usage flags
1220- tu: Support VkPipelineCreateFlags2CreateInfoKHR
1221- tu: Check for DEVICE_LOST in vkGetEventStatus()
1222- tu: Add maintenance5 properties
1223- freedreno/ci: Skip dEQP-VK.info.device_extensions
1224- tu: Expose VK_KHR_maintenance5
1225- freedreno/ci: Remove minetest trace
1226- v3d/ci: Remove minetest trace
1227- ir3/ra: Don't swap killed sources for early-clobber destination
1228- tu: Fix re-emitting VS param state after it is re-enabled
1229
1230Corentin Noël (16):
1231
1232- ci: Add locked flag to bindgen-cli installation
1233- virgl: Do not expose EXT_texture_mirror_clamp when using a GLES host
1234- ci: disable Collabora's LAVA lab for maintenance
1235- llvmpipe: make sure to initialize the lp_setup_context slots with the default values
1236- virgl: Cover all the formats defined in the virgl definition
1237- mesa: Ensure that the baselevel will never exceed the maximal supported number
1238- ci: Uprev virglrenderer
1239- freedreno/drm/virtio: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
1240- tu: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
1241- aux/tc: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
1242- venus: Change the only occurrence of VN_TRACE_BEGIN/END to VN_TRACE_SCOPE
1243- util: Avoid the use of MESA_TRACE_BEGIN/END
1244- util/perf: Remove the tracing categories
1245- util: Remove MESA_TRACE_BEGIN/END
1246- mesa/bufferobj: ensure that very large width+offset are always rejected
1247- frontends/va: Remove wrong use of ProfileToPipe
1248
1249Daniel Schürmann (9):
1250
1251- nir/opt_move: fix handling of if-condition
1252- aco: append p_logical_end after monolithic RT shaders
1253- aco/insert_exec_mask: set Exact mode after p_discard_if when necessary
1254- aco: don't optimize cross-lane instructions across p_wqm
1255- aco: make p_wqm a marker instruction without Operands/Definitions
1256- aco: don't insert a copy when emitting p_wqm
1257- aco: insert a single p_end_wqm after the last derivative calculation
1258- aco/insert_exec_mask: Simplify WQM handling (1/2)
1259- aco/insert_exec_mask: Simplify WQM handling (2/2)
1260
1261Daniel Stone (23):
1262
1263- dri: Support 1555/4444 formats
1264- egl/dri2: Don't look up image extension twice
1265- egl/wayland: Always initialise fd_display_gpu
1266- egl/wayland: Add image loader extension for swrast
1267- egl/wayland: Never use DRI2_LOADER extension
1268- egl/wayland: Assume modern DRI interface versions
1269- egl/drm: Use IMAGE_DRIVER instead of DRI2_LOADER
1270- egl/drm: Assume modern DRI interface versions
1271- ci: Disable nouveau CI
1272- panfrost/vk: Use correct sampler dimensions for MSAA
1273- ci: Declare stages before jobs
1274- ci/radeonsi: Add new flake
1275- ci/d3d12: Add new flake
1276- ci/intel: Add new skqp flake
1277- ci/zink: Add new zink-lvp flakes
1278- ci/radeonsi: Skip more really slow tests
1279- ci/zink: Add another conversion fail on a618
1280- ci: Move farm-disable rules before anything else
1281- ci: Always set user container jobs to manual
1282- ci: Use container rules for containers
1283- ci: Only look at file changes for MRs
1284- ci: Fix pre-merge pipelines with no code changes
1285- ci: Try really hard to print final result string
1286
1287Daniel van Vugt (1):
1288
1289- glx: Increment dpy->request before issuing an error that had no request
1290
1291Danylo Piliaiev (71):
1292
1293- freedreno/cffdec: Decode CP_DRAW_AUTO
1294- freedreno, turnip: Clarify some RB_CCU_CNTL fields
1295- freedreno,turnip: Make number of VSC pipes configurable
1296- freedreno,turnip: Make CS shared memory size configurable
1297- freedreno,turnip: Make VS input attr/binding count configurable
1298- freedreno: Add A605, A608, A610, A612 GPUs definition
1299- turnip: Make multiview support configurable per generation
1300- ir3: Make FS tex prefetch optimization optional
1301- ir3: Use NIR info to enable per sample shading
1302- freedreno/regs: Rename SP_FS_CTRL_REG0.DIFF_FINE into LODPIXMASK
1303- ir3: Fix FS quad ops returning wrong values from helper invocations
1304- tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults
1305- radv: fix unused non-xfb shader outputs not being removed
1306- vulkan/nir: Add common helper to check if output is XFB
1307- radv: Use common nir_vk_is_not_xfb_output
1308- turnip: Use common nir_vk_is_not_xfb_output
1309- freedreno/regs: Define unknown SP_FS_PREFETCH_CNTL fields
1310- freedreno/registers: Refactor gen_header.py to allow more options
1311- freedreno/registers: Generate python files with reg offsets
1312- freedreno: Add a list of raw magic regs
1313- freedreno: Fully define a730 and a740 device properties
1314- ir3/tests: Use fd_dev_info to infer GPU generation
1315- freedreno/computerator: Fix remaining issues with A7XX
1316- isaspec: Make possible to obtain gpu_id in <expr> blocks
1317- ir3/a7xx: cat5 mode1 has swapped tex/samp ids
1318- ir3/a7xx: Don't multiply global mem instruction's offset by 4
1319- ir3/a7xx: insert lock/unlock at the end of every compute shader
1320- ir3/a7xx: Add ccinv instruction
1321- ir3/a7xx: Use ccinv for data synchronization
1322- ir3/a7xx: Disable shared consts for a7xx
1323- tu/common: Generalize TU_GENX macro
1324- tu: Basic a7xx support
1325- freedreno/fdl: Set LOSSLESSCOMPEN for image when ubwc is enabled on a7xx
1326- tu/a7xx: Fix geometry shaders
1327- tu/a7xx: Fix tesselation shaders
1328- tu/a7xx: Fix multiview
1329- tu/a7xx: Fix flat shading
1330- tu/a7xx: Fix occlusion query
1331- tu/a7xx: Fix 3d blits after multiview usage
1332- tu/a7xx: Fix CmdDrawIndirectByteCountEXT
1333- tu/a7xx: Disable LRZ
1334- ir3/lower_tex_prefetch: Fix crash with lowered load_barycentric_at_offset
1335- tu: Exclude SP_UNKNOWN_AE73 from reg stomping
1336- tu: Call tu_cs_dbg_stomp_regs with appropriate GPU gen
1337- freedreno/replay: Add limited support for KGSL
1338- freedreno/rddecompiler: Update to handle a7xx
1339- freedreno/replay: Add "print" instr to ir3 asm to be used in replay
1340- freedreno/replay: Add "gpu_print" function for command streams
1341- tu/perfetto: Remove now unnecessary tu_perfetto_util
1342- tu/perfetto: Allow gpu time to be passed into tu_perfetto_submit
1343- tu/kgsl: Fix memory leak of tmp allocations during submissions
1344- tu/kgsl: Support u_trace and perfetto
1345- tu/a7xx: Correctly record timestamps for u_trace
1346- tu/virtio: Fix incorrect call to tu_perfetto_submit
1347- ci: Compile Turnip's virtio kmd in debian-arm64
1348- freedreno/registers: Refine a7xx push consts registers
1349- ir3,tu: Refactor push consts info plumbing
1350- freedreno: Make possible to specify A7XX feature flags
1351- turnip,ir3: Implement A7XX push consts load via preamble
1352- tu: Add push_consts_per_stage debug option
1353- tu: Fix VK_FORMAT_A8_UNORM_KHR using UBWC when !has_8bpp_ubwc
1354- tu/kgsl: Fix field order in kgsl_command_object init
1355- tu: Fix stale tu_render_pass_attachment::store_stencil with dyn rendering
1356- tu: Zero init tu_render_pass and tu_subpass for dynamic rendering
1357- tu: Disable preamble push consts when they are not used
1358- ir3: Fix values of #wrmask not being compatible with ir3 parser
1359- tu: Count a whole push consts range in constlen for PREAMBLE push consts
1360- freedreno/rddecompiler: Use fd_dev_gen to pass gpu_id to ir3 disasm
1361- freedreno/rddecompiler: Decompile repeated IBs
1362- freedreno: Fix field size of A6XX_TEX_CONST[3].ARRAY_PITCH
1363- tu: Fix reading of stale (V)PC_PRIMITIVE_CNTL_0
1364
1365Dave Airlie (163):
1366
1367- ci: remove binding model from the asan skips for lavapipe.
1368- gallivm: fix atomic global temporary storage.
1369- llvmpipe: fix fragdata/lastfragdata heuristic a bit more.
1370- nvk: add missing finish calls
1371- nvk: add some initial wsi framework.
1372- nvk: fix header guards to be less generic.
1373- nvk: add bind buffer memory
1374- nvk: Add initial queue
1375- nvk: add cmd buffer framework
1376- nvk: Reset pushbufs on command buffer reset
1377- nvk: reindent descriptor sets to mesa std.
1378- nvk: add initial descriptor pool framework.
1379- nvk: some boilerplate for descriptor sets
1380- nvk: add descriptor set bo allocation.
1381- nvk: implement buffer address.
1382- nvk: descriptor set freeing fix
1383- nvk: move to new command stream generator.
1384- nvk: port the blit and copy code to new command submission.
1385- nouveau/ws: drop the old push generators.
1386- nvk: link in codegen without gallium bits.
1387- nvk: Initial wiring in of the compiler
1388- nvk: Basic descriptor binding
1389- nouveau/vk: add support for compute classes to generator.
1390- nvk: retrieve gpc/mp counts from kernel.
1391- nvk: add support for preamble and tls allocation.
1392- nvk: add record result to cmd_buffer.
1393- nvk: add command stream upload buffer.
1394- nouveau/winsys: Add m2mf/compute objects
1395- nvk: add some basic format wrapping framework
1396- nvk: add some compute limits
1397- nvk: add basic nve4+ compute support.
1398- nvk: fix empty cmd submission.
1399- nouveau/ws: add a push reset just for references.
1400- nouveau/classes: add 906f header support.
1401- nvk: add initial 8/16 byte clears.
1402- nvk: fix pipeline pushbuf sizing
1403- nvk: increase graphics cpu push buffer
1404- nvk: fix depth emission ordering.
1405- nvk: add some limits/features from binary driver.
1406- nvk: add indexed draw support.
1407- nvk: assign vertex locations according to input attrib index
1408- nvk: lower io to temps to avoid output reads in vertex shaders
1409- nvk: handle NULL to destroy descriptor pool
1410- nvk: add basic primitive restart
1411- nvk: fix copy lower address extraction
1412- nvk: fix multiple pipelines failure allocation case.
1413- nvk: init dev->physical_device earlier.
1414- nvk/winsys: store device ptr into bo instead of ptr
1415- nvk: set the device fd
1416- nil: Fix image align and size constraints
1417- nvk: Report image alignments from NIL
1418- nouveau/winsys: allocate unique object handles across channels.
1419- nvk/nil: don't ask for compressed image kind
1420- nvk/barrier: handle host bit.
1421- nvk: add compute support for ampere
1422- nvk: add min_lod to spirv caps.
1423- nvk: fix r32_sint format support
1424- nvk: expose EXT_sampler_filter_minmax
1425- nvk: fix transform feedback crash when optimiser removes things.
1426- nvk: merge tess info between tcs/tes.
1427- nvk: introduce an optimisation loop.
1428- nvk: add support for D32_SFLOAT_S8_UINT
1429- nvk/query: fix push buffer size for copy pool results.
1430- nvk: init image fields for requirements
1431- nvk: handle alignments in device memory
1432- nvk/tess: don't emit patch control points in pipeline
1433- nvk: align geometry clip setting with nvc0
1434- nvk: fix independent color write masks.
1435- nvk: enable rgb32 texel buffer support
1436- nvk: enable EXT_depth_clip_control
1437- nvk: enable EXT_depth_clip_enable
1438- nvk: always sync internal cmd bufs for vma lifetimes.
1439- nouveau/winsys: add support for the vma bind interfaces
1440- nvk: Add support for sparse buffers
1441- nvk: Add support for sparse images
1442- nvk/queue: add support for syncobjs and sparse binds
1443- nvk: Handle pre-turing indirect buffers with sparse
1444- nvk: enable sparse features
1445- nvk: enable a bunch of external fence/semaphore bits
1446- nvk: enable sparse residency buffer on maxwell+
1447- nvk: add new internal bo allocation flag.
1448- docs: add two nvk exts to features.txt
1449- zink: use fprintf instead of printf to align the requirements warnings
1450- nvk: align sampler allocation counts with nvidia.
1451- zink: turn off threaded cpu access if not visible.
1452- nvk: add gart forced cmd pool side buffer.
1453- nvk: add cond render upload buffer.
1454- nvk: enable KHR_shader_clock.
1455- nvk: NOUVEAU_WS_BO_LOCAL is a trap.
1456- gallivm: drop unused info parameter
1457- llvmpipe/fs: drop cbuf 0 since it's lowered now.
1458- gallivm/nir: avoid using params->info
1459- llvmpipe/fs: move some tgsi checks in nir path to nir code.
1460- llvmpipe/cs: convert to using tgsi->nir
1461- llvmpipe/cs: drop tgsi for compute/mesh/task shader internals.
1462- lavapipe: use vk_buffer common code.
1463- lavapipe: use vk_buffer_range common code.
1464- llvmpipe/fs: switch to using tgsi->nir instead of handling tgsi
1465- llvmpipe/analyse: drop TGSI path.
1466- llvmpipe/fs: start using nir info in some places.
1467- llvmpipe/fs: drop the simple shader logic
1468- llvmpipe/fs: rewrite output finding using nir.
1469- nvk: add build_id linker argument.
1470- nir/gather: add support for fbfetch and bindless image loads.
1471- llvmpipe/cs: further cleanups after tgsi removal.
1472- llvmpipe: move to nir lowering for fquantize2f16
1473- rusticl: don't store ptrs to nir_variables across opt passes.
1474- llvmpipe: enable f16 paths on aarch64.
1475- clover/llvm: move to modern pass manager.
1476- nir: use a _clone so users calling their variable clone don't get a warning
1477- nir: rename nir_inline_functions.c to nir_functions.c
1478- nir: use nir_function_instructions_pass in the inliner.
1479- nir: move the libclc lowering over to functions file.
1480- nir/functions: use helper to get function for a name.
1481- nir/functions: put link state into a struct
1482- nir/functions: move linker pass to new helper
1483- nir: add nir function clone
1484- nir: don't inline linked functions
1485- gallivm/nir: split prepasses out to make per-function work easier.
1486- gallivm: rework translator to allow per-impl work.
1487- spirv/nir: parse function control and store in nir.
1488- nir: add driver_functions option to avoid inlining.
1489- nir: add a function usage tracker
1490- rusticl: use cleanup funcs
1491- gallivm: add support for function calling
1492- llvmpipe/cs: add support for function calls.
1493- llvmpipe: enable driver functions.
1494- radv: don't emit event code on video queues.
1495- spirv: use a pointer sized int type for opencl event_t
1496- clover: fix parameter arguments since recent translator changes.
1497- radv/video: take db alignment into account when allocating images.
1498- ac,radeonsi: move vcn enc structs to common
1499- ac,radeonsi: move vcn enc av1 default cdf file to common
1500- nir: add a deref slot counter that handles compact
1501- llvmpipe/linear: drop tgsi path.
1502- gallivm: drop tgsi aos paths.
1503- llvmpipe/nir: call gather info to update inputs read properly
1504- llvmpipe/fs: start converting interp/input paths to nir.
1505- llvmpipe/fs: start converting dervied state to nir based.
1506- llvmpipe/linear: convert to using nir for output.
1507- llvmpipe/linear: move to nir inputs
1508- draw/mesh: reset some user state values on mesh draws.
1509- llvmpipe/fs: fix regression in sample mask handling from tgsi removal.
1510- llvmpipe: reset viewport_index_slot in fb bind
1511- llvmpipe/cs: migrate to generic jit texture from pipe code.
1512- llvmpipe/cs: migrate cs image handle to common jit code.
1513- lavapipe: fix some whitespace in advance of other changes.
1514- lavapipe: fix subresource layers asserts
1515- lavapipe: support host image copying on compressed texture formats
1516- llvmpipe: don't create texture functions for planar textures.
1517- lavapipe: don't emit blit src/dst for subsampled formats.
1518- llvmpipe: don't support planar formats for buffers.
1519- lavapipe: convert sampler to use vk base class.
1520- lavapipe: cleanup copy code to use a local region variable.
1521- lavapipe: start introducing planes structure.
1522- lavapipe: allocate image and image view planes.
1523- lavapipe: handle planes in copies
1524- lavapipe: handle planes in get image sub resource
1525- lavapipe: add descriptor sets bindings for planar images
1526- lavapipe: handle planes in texture lowering.
1527- lavapipe: expose planar ycbcr formats and new ycbcr features
1528- lavapipe + docs: update ycbcr extension enables.
1529- intel-clc: avoid using spirv-linker.
1530
1531David Heidelberg (82):
1532
1533- ci/freedreno: update a530 flakes
1534- ci: build kernel in gfx-ci/linux and just use binaries in Mesa3D CI
1535- ci: update kernel to 6.3.13
1536- ci/freedreno: add fails introduced by upreving to 6.3.13
1537- Revert "lima/ci: temporarily disable deqp-egl tests due to timeouts"
1538- ci/radeonsi: stoney arb_timer_query got fixed between kernel 6.3.1..13
1539- ci/lima: EGL testing was disabled when fp16 fail was removed
1540- ci/freedreno: fix unexpectedpass flake on a630
1541- ci/freedreno: add another a530 flakes
1542- ci: add quirk for GitLab assuming changes is always true for scheduled runs
1543- ci/microsoft: when re-enabling Windows Farm, always run the container
1544- ci/freedreno: add a530 flakes, remove one fail which recently started passing
1545- ci/panfrost: introduce OpenGL testing with Mali-G57 MP5 on Asurada chromebook
1546- ci/freedreno: cover all texture gather flakes
1547- ci/freedreno: add a530 flake vs-lessthanequal-uvec4-uvec4
1548- ci/farms: always compare the code against main repository
1549- Revert "ci/farms: always compare the code against main repository"
1550- ci/kernel: add amd patch to prevent crashes when starting X
1551- ci/kdl: remove extra-verbose ls command
1552- ci/nouveau: add 20 minutes timeout to gk20a and align gm20b
1553- ci/freedreno: document another mapbuffer flake on a530
1554- ci/amd: fix timeouting radeonsi-raven-va-full job
1555- docs/ci: default to port 80 for the caching proxy
1556- docs/ci: update to systemd and used version of the trace for testing
1557- docs/ci: remove default nginx config, which we don't need for proxy
1558- bin/ci: handle errors more gracefully in update_traces_checksum script
1559- ci/freedreno: document another flakes on Adreno 530
1560- ci: add perfetto into mesa git-cache
1561- ci/panfrost: re-enable t760 and t860 traces as a nightly job
1562- CI: Re-enable G52 Vulkan testing
1563- ci/panfrost: t760-gles is nightly job, test also GLES 3 and 3.1
1564- ci/zink: Add flake seen in the wild
1565- ci/build: limit debian-build-testing to 30 minutes
1566- ci/amd: add glx\@glx-visuals-depth flake to raven
1567- ci/freedreno: document vs-nested-return-sibling-loop2 flake on Adreno 530
1568- ci/farms: enabled Microsoft job only when conditions are met
1569- ci/deqp: really remove the uncompressed results.csv file
1570- ci/baremetal: do not install curl, it's already there
1571- ci/baremetal: shorten BM_KERNEL to filename and BM_DTB to name only
1572- ci/freedreno: document another a530 flake batch
1573- ci: remove LAVA prefix from variables which can be used also elsewhere
1574- ci/zink: drop a630, which we currently have very low amount available
1575- ci/freedreno: the tag belongs to the apq8016 only
1576- ci/freedreno: switch references, the farm-rules takes care about this
1577- ci/freedreno: handle disabling farm properly for each FD/Collabora farm
1578- ci/freedreno: another batch of Adreno 530 flakes
1579- gtest: backport ansi color fix
1580- ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace
1581- panfrost/ci: revert Disable T720
1582- ci/piglit: add extra space on top to prevent single quote getting into URL
1583- ci/freedreno: There is only one King of Town.
1584- ci: switch to 6.4 kernel, improving Adreno 660 reliability
1585- ci/iris: add GL46.arrays_of_arrays_gl.SizedDeclarationsPrimitive timeout
1586- ci/panfrost: add G52 flakes
1587- ci/panfrost: we have enough device, parallelize Vulkan tests
1588- ci/virgl: flakes in functional.draw_buffers_indexed group
1589- ci/freedreno: add another a530 flake
1590- ci/panfrost: add G52 simple_tests.partial_image_pot_same_format_noclear flake
1591- panvk: architecture isn't invalid, just unsupported
1592- panvk: catch unsupported arch in the panvk_physical_device_init
1593- Revert "ci: disable a660 jobs"
1594- docs: add LAVA farm informations
1595- ci: disable Google Freedreno farm, currently timeouting on all jobs
1596- Revert "ci: disable Google Freedreno farm, currently timeouting on all jobs"
1597- ci/farms: no need to check RUNNER_TAG for Collabora farm
1598- ci/traces: extend no-output timeout by 5 minutes
1599- ci/venus: add fragment.32B_in_memory_with_vec4_s32 flake
1600- iris: do not mention specifically clover for OpenCL support
1601- ci/freedreno: disable broke cheza (Adreno 630) runners
1602- ci/bare-metal: correct workaround for R8152 issue while retrieving TFTP data
1603- ci/bare-metal: drop unused imports, sort, use SPDX license
1604- ci/lima: farm is down, disable for now
1605- ci: do not report failed job when flakes reporting fails
1606- ci/freedreno: re-enable Cheza (Adreno 630) runners
1607- ci/traces: upload only missing trace images
1608- ci/traces: keep images for every job except the performance testing
1609- ci/traces: rename upload function to reflect it works with S3
1610- ci/traces: always export piglit EXTRA_ARGS
1611- ci: ci_marge_queue.py
1612- ci/freedreno: fix copy paste causing a618_gl being run only in manual pipeline
1613- ci/freedreno: disable Adreno 660 Vulkan pre-merge
1614- ci/traces: drop the freedoom-phase2-gl-high.trace
1615
1616David Rosca (70):
1617
1618- radeonsi: Use DIV_ROUND_UP instead of ALIGN_POT
1619- frontends/va: Skip processing buffers already converted with EFC
1620- frontends/va: Don't use EFC with scaling or filtering enabled
1621- radeonsi/vcn: Don't use chroma in AV1 encode with RGB input
1622- frontends/va: Parse H264 SPS for video signal parameters
1623- frontends/va: Parse HEVC SPS for video signal parameters
1624- frontends/va: Add postproc support for converting to full range
1625- radeonsi/vcn: Set H264 video signal parameters in bitstream
1626- radeonsi/vcn: Set HEVC video signal parameters in bitstream
1627- radeonsi/vcn: Enable full/limited range support for H264/HEVC/AV1
1628- radeonsi/vcn: Fix setting color range in AV1 bitstream
1629- gallium/auxiliary/vl: Fix RGB->YCbCr full range matrix
1630- gallium/auxiliary/vl: Handle UV subsampling in compute_shader_yuv
1631- gallium/auxiliary/vl: Fix blurry output of compute_shader_yuv
1632- frontends/va: Add YUV420 to NV12 postproc conversion
1633- gallium/auxiliary/vl: Fix chroma and blurry output of cs video_buffer
1634- gallium/auxiliary/vl: Fix chroma offset of compute_shader_weave
1635- frontends/va: Also map VAImageBufferType for reading
1636- frontends/va: Alloc interlaced surface for interlaced pics
1637- frontends/vdpau: Alloc interlaced surface for interlaced pics
1638- radeonsi: Don't prefer interlaced for video decode
1639- ci/amd: Skip VAAPI CreateSurfacesWithConfigAttribs/1121 test
1640- frontends/va: Don't allow multi-plane derive without driver support
1641- frontends/va: Init view_resources array in vlVaPut/GetImage
1642- radeonsi: Copy all planes with multi-plane staging textures
1643- radeonsi: Enable PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP
1644- ci/amd: Skip all VAAPI tests that creates too many huge surfaces
1645- radeonsi/vcn: Update rate control when framerate changes with HEVC
1646- frontends/va: Ignore requested size when creating VAEncCodedBufferType
1647- gallium/auxiliary/vl: Set correct csc matrix in set_buffer_layer
1648- radeonsi/vcn: Fix leaking fences in decode
1649- gallium/auxiliary/vl: Add BT.709 full csc matrix
1650- frontends/va: Set csc matrix in postproc
1651- gallium/auxiliary/vl: Don't set csc matrix in video_buffer/rgb_to_yuv_layer
1652- frontends/va: Add BT.709 as supported postproc color standard
1653- Revert "radeonsi/vcn: add an exception of field case for h264 decoding"
1654- gallium/auxiliary/vl: Set vertex element src_stride in vl_deint_filter
1655- gallium/auxiliary: Fix util_compute_blit half texel offset with scaling
1656- gallium/auxiliary/vl: Map range when updating constants
1657- gallium/auxiliary/vl: Clamp coordinates in compute shaders
1658- gallium/auxiliary/vl: Support chroma sample location in compute shaders
1659- frontends/va: Support chroma sample location in postproc
1660- frontends/va: Flush after unmapping VAImageBufferType
1661- frontends/va: Parse chroma sample location in H264/HEVC SPS
1662- radeonsi/vcn: Set H264/HEVC chroma sample location in bitstream
1663- radeonsi/vcn: Don't hang GPU when using DCC surface as encoder input
1664- frontends/va: Track surfaces in context
1665- frontends/va: Destroy fences when destroying surface or context
1666- radeonsi/vcn: Implement destroy_fence vfunc
1667- frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture
1668- frontends/va: Set default rate control values once when creating encoder
1669- gallium/auxiliary/vl: Add RGB to YUV compute shader
1670- gallium/auxiliary/vl: Use chroma offset in YUV to RGB weave compute shader
1671- gallium/auxiliary/vl: Fix YUV to RGB bob compute shader deinterlacing
1672- gallium/auxiliary/vl: Only map the shader constants buffer in render
1673- frontends/va: Add High Quality preset mode
1674- radeonsi/vcn: Add High Quality encoding preset for AV1
1675- radeonsi: Fix plane size in si_copy_multi_plane_texture
1676- frontends/va: Implement vaMapBuffer2
1677- frontends/va: Fix locking in vlVaBeginPicture
1678- frontends/va: Parse H264 SPS for max_num_reorder_frames
1679- util/vl: Fix vl_rbsp parser with bitstreams without emulation bytes
1680- frontends/va: Fix parsing packed headers without emulation bytes
1681- radeonsi/vcn: Add encode support for H264 B-frames
1682- frontends/va: Map decoder and postproc surfaces for reading
1683- radeonsi: Fix offset for linear surfaces on GFX < 9
1684- gallium/auxiliary/vl: Fix coordinates clamp in compute shaders
1685- gallium/auxiliary: Fix coordinates clamp in util_compute_blit
1686- gallium/auxiliary/vl: Scale dst_rect x0/y0 when rendering chroma plane
1687- util/rbsp: Fill bits twice if reading more than 16 bits
1688
1689Derek Foreman (2):
1690
1691- vulkan/wsi: Allow binding presentation_timing when software rendering
1692- vulkan/wsi: warn about unset present_mode in PresentModeCompatibilityExt
1693
1694Dmitry Baryshkov (3):
1695
1696- gallium: move kmsro definition to the bottom of the file
1697- gallium: unbreak kmsro/freedreno case
1698- tu: Pass real size of prime buffers to allocator
1699
1700Dmitry Osipenko (3):
1701
1702- util/cache_test: Re-add test for disabled cache
1703- util/cache_test: Fix disabled cache test using SHADER_CACHE_DISABLE_BY_DEFAULT
1704- util/cache_test: Add test for get/put() with disabled cache
1705
1706Dor Askayo (1):
1707
1708- nouveau: add exported GEM handles to the global list
1709
1710Dr. David Alan Gilbert (6):
1711
1712- rusticl/core: Add profiling time storage (queued) to event
1713- rusticl: Wire the 'queued' profiling time up
1714- rusticl: Wire the 'submit' profiling time up
1715- rusticl: Wrap pipe queries
1716- rusticl: Wrap pipe query reads
1717- rusticl: Wire the 'start' and 'end' profilng times up
1718
1719Dylan Baker (4):
1720
1721- VERSION: bump to 23.3.0-devel
1722- docs: Update release calendar for 23.2.0-rc1
1723- docs: truncate feature list for 23.3-devel
1724- meson: use a single dependency call for lua
1725
1726Echo J (5):
1727
1728- nvk: Fix some cast defines
1729- nvk: Add A8B8G8R8_*_PACK32 format support
1730- nvk: Add bufferImageGranularity limit
1731- nvk: Reset offset value in ResetDescriptorPool
1732- nil: Add A4B4G4R4_UNORM format support
1733
1734Emma Anholt (111):
1735
1736- ci/radv: Clarify when the ANGLE GS failures started happening.
1737- ci: Uprev ANGLE to 0518a3ff4d4e ("Android: Simplify power metrics collection")
1738- ci/tgl: Improve the info for ANGLE's MSAA regression on TGL.
1739- ci/tu: Add more crash cases for the multithreading bugs caught on a630.
1740- ci/tu: Mark descriptor_buffer.basic.limits as failing in gmem too.
1741- ci/tu: Drop some xfails for !24086
1742- tu: Fix data race in userspace VMA management.
1743- ci/a5xx: Add another GPU hanging piglit test to the skips.
1744- Revert "ci: Disable nouveau CI"
1745- nvk: Avoid strict aliasing warning in the pushbuffer encoding.
1746- nvk: Fix uninitialized result usage in NVK_DEBUG_ZERO_MEMORY.
1747- nvk: Fix unused result warnings in pushbuf resets.
1748- nvk: Remove duplicate (disabled) point sprite setup.
1749- nvk: Fix missing init of the stages to sync against.
1750- nvk: Use depth_clamp_enable to select PIXEL_*_Z_CLAMP.
1751- nouveau/winsys: Fix an undefined use in the error path.
1752- nvk: Quiet a compiler warning.
1753- nvk: Clean up redundant vendor checking for physical device creation.
1754- nvk: Add support for probing as a platform device.
1755- nvk: Disable shaderStorageImageReadWithoutFormat pre-Maxwell.
1756- freedreno/a5xx: Fix border color structure size.
1757- freedreno/a5xx: Skip emitting unused texture descriptors for images.
1758- freedreno/ir3: Move pvtmem per-fiber size alignment to the compiler.
1759- ci/freedreno: Drop a bunch of stale a530 xfails.
1760- ci/freedreno: Sort another a530 xfail with its friends.
1761- ci/freedreno: Update comments for some a530 xfails.
1762- ci/freedreno: Add some more db820c xfails.
1763- freedreno/devices: Move fibers_per_sp to the common info struct.
1764- freedreno/devices: Set num_sp_cores explicitly for pre-gen6.
1765- freedreno/a6xx: Move pvtmem allocation to ir3_gallium.
1766- freedreno/a3xx: Add the shift for MEMSIZEPERITEM according to db410c docs.
1767- freedreno/a5xx: Refactor SHADER_OBJ emit to a helper function.
1768- freedreno/a5xx: Set num_sp_cores and set PC/VFD_POWER_CNTL accordingly.
1769- freedreno/a5xx: Add private mem support.
1770- freedreno/cffdec: Fix decode on pixel 2 blob's COMPUTE_CHECKPOINT
1771- ci/freedreno: Add a regression test for decoding a540 blob's compute shaders.
1772- freedreno: Fix crashdec pre-a6xx.
1773- freedreno/a5xx: Skip SSBO emit when none are enabled.
1774- vulkan/util: Make multialloc succeed with 0 allocations.
1775- turnip: Track the first/last subpass an attachment is used in.
1776- turnip: Skip emitting empty CP_COND_REG_EXEC.
1777- turnip: Save the renderpass's clear values in the cmdbuf state.
1778- turnip: Move gmem clears and loads to the first subpass that uses them.
1779- turnip: Move sysmem clears to the first subpass that uses them.
1780- ci/freedreno: Skip some tests on a5xx that destabilize other tests.
1781- freedreno/a3-5xx: Don't try to emit ISAM for SSBO loads.
1782- ci/turnip: Add a660 VK coverage.
1783- disk_cache: Disable the "List" test for RO disk cache.
1784- blorp: Disable unaligned partial HIZ fast clears for HIZ_CCS too.
1785- intel/fs: Move defin/defout setup to the start of the loop.
1786- intel/fs: Move the defin[]/defout[] screening up to livein[]/liveout[] setup.
1787- intel/fs: Simplify compute_start_end().
1788- ci/freedreno: Add another excessive-constlen UBO skip.
1789- ci/anv: Drop DEQP_VER:vk setting.
1790- ci/anv: Drop "-vk" from the job name.
1791- ci/anv: Add a manual full VK run for TGL.
1792- ci/anv: Add testing on JSL.
1793- freedreno: Build drm subdir before perfcntrs, which uses it.
1794- ci/intel: Add various updates from our nightly runs.
1795- ci/virgl: Disable virgl-iris-traces.
1796- ci/zink: Add a few updates for anv/tgl from the nightly runs.
1797- ci/fastboot: Use a case insensitive match for a fastboot line.
1798- ci/etnaviv: Skip some tests that hang the GPU and knock out other tests.
1799- ci/etnaviv: Drop some gc2k flakes that I think are resolved.
1800- ci/anv: Drop incorrect xfail addition for TGL
1801- ci/anv: Drop the 16bit.scalar.13 skip.
1802- ci/etnaviv: Minor xfail/flake polishing.
1803- ci/etnaviv: Skip a GLES2 test that times out the asan job.
1804- ci/zink: Skip more doubles tests on anv that flake at 3 minute timeouts.
1805- ci/docker: Clear the results file before starting a new deqp test run.
1806- ci/crocus: Add a related flake to a known one.
1807- ci/etnaviv: return gl-1.4-tex1d-2dborder as a known flake
1808- ci/crocus: Add known piglit flakes
1809- ci/hasvk: Add a bunch of new CTS border color fails.
1810- i915: Re-clang-format and enforce it in CI.
1811- i915: Print the relevant counts vs limits when throwing errors.
1812- i915: Don't log I915_DEBUG=fs output for blit shaders.
1813- i915: Save fragment program compile error messages in the fragment shader.
1814- i915: Do a test compile at glLinkShader() time.
1815- i915: Make exceeding tex indirect count fatal.
1816- i915: Use nir_group_loads() to reduce texture indirection phases.
1817- ci/crocus: Generalize the drawarrays-vertex-count flakes.
1818- ci/zink: Skip 3-minute-long glx-visuals timeouts.
1819- ci/zink: Skip dmat[34] op tests in general, as well
1820- ci/crocus: Disable flaky unvanquished-ultra trace
1821- nir/print: Decode system values in the variable declarations.
1822- ci/zink: Add a TGL flake that's showed up in nightlies recently.
1823- ci/radeonsi: Drop an xfail for vangogh.
1824- i915: Make I915_DEBUG=fs log shaders that fail to link due to CF.
1825- nir: Flatten ifs with discards in nir_opt_peephole_select for HW without CF.
1826- glsl: Remove lower_discard().
1827- ci/zink: Only test half of piglit pre-merge on anv.
1828- ci: Stop doing internal retries in bare-metal.
1829- ci/bare-metal: Drop the 2 vs 1 exit code from poe_run.
1830- ci/bare-metal: Default our boards to a 20-minute timeout for the whole job.
1831- ci/iris: Drop parallel on kbl piglit to 2.
1832- ci/freedreno: Fold a630_egl into a630_gl.
1833- ci/freedreno: Move skqp testing to a618.
1834- ci/zink: Cut zink-lvp coverage in half.
1835- ci/freedreno: Generalize the implicit_unmap timeouts.
1836- ci_run_n_monitor: Poll mesa/mesa and user/mesa for pipelines at the same time.
1837- glx: Delete support for GLX_OML_swap_method.
1838- ci: drop skip for glx-swap-copy.
1839- dri: Drop a duplicate mesa vs pipe format table.
1840- docs/ci: Drop old instructions for farm disabling
1841- docs/ci: Add some links in the CI docs to how to track job flakes
1842- glsl: Remove int64 div/mod lowering.
1843- llvmpipe: Set nir_lower_dround_even.
1844- nir: Add nir_lower_dsign as 64-bit fsign lowering.
1845- glsl: Retire dround lowering.
1846- ci_run_n_monitor: Always resolve --rev arguments for looking up pipelines.
1847
1848Eric Engestrom (194):
1849
1850- ci: avoid running hardware jobs if lint fails - now on LAVA too!
1851- ci: avoid running hardware jobs if lint fails - now on Windows too!
1852- ci: replace copy of nouveau rules with reference
1853- ci: drop leftover kernel configs
1854- ci: use !reference for scheduled_pipeline retry rule
1855- ci: add .llvmpipe-manual-rules and use it
1856- ci: add .gallium-core-rules and use it instead of gallium_core_file_list anchor
1857- ci: replace llvmpipe_file_list anchor with reference
1858- ci: replace softpipe_file_list anchor with reference
1859- ci: replace lavapipe_file_list anchor with reference
1860- ci: replace iris_file_list anchor with reference
1861- ci: replace radv_file_list anchor with reference
1862- ci: replace radeonsi_file_list anchor with reference
1863- ci: replace virgl_file_list anchor with reference
1864- ci: move etnaviv files rules to src/etnaviv/ci/gitlab-ci.yml
1865- ci: move freedreno files rules to src/freedreno/ci/gitlab-ci.yml
1866- ci: move nouveau files rules to src/gallium/drivers/nouveau/ci/gitlab-ci.yml
1867- ci: move panfrost files rules to src/panfrost/ci/gitlab-ci.yml
1868- ci: move broadcom files rules to src/broadcom/ci/gitlab-ci.yml
1869- ci: move lima files rules to src/gallium/drivers/lima/ci/gitlab-ci.yml
1870- ci: move amd files rules to src/amd/ci/gitlab-ci.yml
1871- ci: move microsoft files rules to src/microsoft/ci/gitlab-ci.yml
1872- ci: move zink files rules to src/gallium/drivers/zink/ci/gitlab-ci.yml
1873- ci: move virtio files rules to src/virtio/ci/gitlab-ci.yml
1874- ci: move intel files rules to src/intel/ci/gitlab-ci.yml
1875- ci: move virgl files rules to src/gallium/drivers/virgl/ci/gitlab-ci.yml
1876- ci: move llvmpipe files rules to src/gallium/drivers/llvmpipe/ci/gitlab-ci.yml
1877- ci: move softpipe files rules to src/gallium/drivers/softpipe/ci/gitlab-ci.yml
1878- ci: move lavapipe files rules to src/gallium/drivers/lavapipe/ci/gitlab-ci.yml
1879- ci: delete install.tar after extracting it to avoid re-uploading it
1880- docs: add release notes for 23.1.4
1881- docs: add sha256sum for 23.1.4
1882- docs: update calendar for 23.1.4
1883- asahi: drop unused include paths
1884- ci/lint: deduplicate formatting check jobs
1885- ci/lint: also print a diff for rust format issues
1886- ci: allow hw jobs even if lint jobs fail for non-Marge pipelines
1887- ci: print rustfmt's version
1888- ci: print clang-format's version
1889- bin/ci_run_n_monitor: get git sha from pipeline if specified, instead of requiring --rev to match
1890- lavapipe/ci: use tighter changes: rules
1891- ci: add a 10min job timeout to formatting checks
1892- ci: reduce bare-metal retries of poe_run to only 3 attempts
1893- broadcom/ci: reduce vc4-rpi3-gl timeout to 30min (instead of 1h)
1894- broadcom/ci: reduce v3d-rpi4-gl timeout to 30min (instead of 1h)
1895- broadcom/ci: reduce v3d-rpi4-traces timeout to 30min (instead of 1h)
1896- broadcom/ci: reduce v3dv-rpi4-vk timeout to 30min (instead of 1h)
1897- ci: add .core-rules to .gallium-core-rules
1898- ci: drop rule for non-existent src/include/
1899- docs: add release notes for 23.1.5
1900- docs: add sha256sum for 23.1.5
1901- docs: update calendar for 23.1.5
1902- ci: include some timing information in the git cache download script
1903- docs/ci: stop trying to enumerate drivers that are tested using VK-GL-CTS
1904- docs/ci: in paragraph about the CI being overwhelmed, mention our tool to help with that
1905- docs/ci: drop mention of build systems variants in the CI
1906- docs/ci: expand the description of test suites
1907- bin: add wrapper to run scripts in a python venv
1908- bin/ci/ci_run_n_monitor: use venv wrapper
1909- bin/ci/gitlab_gql: use venv wrapper
1910- bin/ci/update_traces_checksum: use venv wrapper
1911- bin/pick-ui: use venv wrapper
1912- ci: include mold in x86_64_test-base & rootfs images
1913- ci: use mold to build deqp
1914- zink/ci: set the default timeout for zink jobs to 30min instead of 1h
1915- egl: make _eglFilterConfigArray static
1916- egl: fixup _eglFilterConfigArray() params and drop _eglFallbackMatch() wrapper
1917- ci: build nvk
1918- ci: document max image tag length
1919- docs/radv: mark VK_EXT_tooling_info as implemented
1920- docs/radv: mark VK_INTEL_shader_integer_functions2 as implemented
1921- git-blame-ignore-revs: repeat instruction on how to enable to avoid having to look for it
1922- git-blame-ignore-revs: add radv formatting commit
1923- git-blame-ignore-revs: add pvr formatting commit
1924- meson: fix indentation
1925- docs/v3dv: mark direct display extensions as implemented
1926- ci: reorder vk drivers alphabetically in debian-vulkan job
1927- ci: build hasvk in debian-vulkan job
1928- ci/zink+radv: set a timeout of 2x the normal runtime
1929- amd/ci: drop duplicate test expectations
1930- panfrost: upcast uint8/uint16 before shifting them beyond their range
1931- ci/a530: document piglit flake
1932- docs: add release notes for 23.1.6
1933- docs: add sha256sum for 23.1.6
1934- docs: update calendar for 23.1.6
1935- docs: add one more 23.1.x release
1936- ci: rename \*.log to \*.txt to work around gitlab bug
1937- ci/freedreno: reuse freedreno_gl_file_list instead of re-definining it
1938- egl: bump extension string length
1939- vc4: drop duplicate .lower_ldexp
1940- zink: fix format in zink_make_{image,texture}_handle_resident()
1941- v3dv: fix VK_PIPELINE_ROBUSTNESS_{BUFFER,IMAGE}_BEHAVIOR_DEVICE_DEFAULT_EXT copy/paste typo
1942- v3dv: fix copy/pasted type of \`sample`
1943- v3dv: fix shader stage name in error message
1944- v3d/qpu: fix type of function argument
1945- ci/deqp: backport fix for dEQP-EGL.functional.wide_color.*_888_colorspace_*
1946- ci/farm-rules: fix missing valve-infra jobs in scheduled pipelines
1947- bin/ci_run_n_monitor: error out if both --project and --pipeline-url are passed
1948- ci: document farm rules
1949- ci/b2c: skip install.tar extraction if the tarball is not present
1950- ci/b2c: don't allow failures in test script preparation
1951- ci/b2c: assert that install folder is present whether or not the tarball was extracted
1952- ci/amd: split the polaris10 rules into one for each farm
1953- ci: skip containers & build jobs when disabling a farm
1954- docs: add release notes for 23.1.7
1955- docs: add sha256sum for 23.1.7
1956- docs: update calendar for 23.1.7
1957- docs: add one more 23.1.x release
1958- ci: taking igalia farm offline
1959- ci/b2c: drop logic to remove install.tar
1960- ci: drop clover leftover
1961- Revert "ci: taking igalia farm offline"
1962- bin/ci_run_n_monitor: print in which repo we're looking for the pipeline
1963- bin/ci_run_n_monitor: automatically pick MR pipelines when they exist
1964- ci: remove duplicate fork pipeline in MRs
1965- ci_run_n_monitor: add comment to explain "MR > fork" logic
1966- ci: don't run everything just because a farm gets re-enabled
1967- ci/windows: centralize definition of windows runners tags
1968- ci/windows: add windows docker runner tags to .windows-docker-vs2019
1969- ci/windows: drop build rules from test jobs
1970- ci: document which image tags need to be bumped when updating piglit
1971- ci: document which image tags need to be bumped when updating {alpine,debian,fedora}/x86_64
1972- ci/farm-rules: rename .disable-farm-mr-rules to make it clear it's only about MRs
1973- ci/farm-rules: re-add "run every container and build job when a farm gets re-enabled"
1974- ci/zink: drop redundant \`MESA_LOADER_DRIVER_OVERRIDE: zink`
1975- docs: add release notes for 23.1.8
1976- docs: add sha256sum for 23.1.8
1977- docs: update calendar for 23.1.8
1978- docs: add another 23.1.x
1979- ci: limit build jobs to 30min so that they can retry when they go wrong
1980- docs: drop outdated and redundant note about the minimum meson version
1981- ci/zink+radv: specify that zink-radv-navi10-valve should run in the mupuf farm
1982- ci/zink+radv: bump the timeout of zink-radv-navi10-valve by 10 minutes
1983- docs: add calendar for 23.3
1984- ci: unify container and build jobs rules
1985- docs/meson: drop mention that our meson is ready
1986- ci/docs: drop extra overwritten rules
1987- ci/zink+radv: document flake
1988- docs: document the merging process and what is allowed or not
1989- ci: drop unused shader-db clone + build from alpine image
1990- ci: drop unused shader-db clone + build from fedora image
1991- ci: move shader-db clone/build into its own script
1992- ci/deqp-runner: fix indentation
1993- ci/deqp-runner: restore exit-on-error after getting deqp-runner's exit code
1994- ci: fix shebang in build-deqp-runner.sh
1995- docs: add release notes for 23.1.9
1996- docs: add sha256sum for 23.1.9
1997- docs: update calendar for 23.1.9
1998- ci: drop unused ephemeral packages in alpine image
1999- docs/ci: rewrite the "farm maintenance ^ other change" rule to mean what we actually meant
2000- ci: skip dEQP-VK.api.driver_properties.conformance_version for everyone
2001- pick-ui: use assignment expressions
2002- pick-ui: use more expressive variable names
2003- pick-ui: add \`Backport-to: XX.Y` nomination
2004- v3d/ci: move traces job to wayland
2005- ci: print deqp version in the job log
2006- ci/b2c: move to the shiny new \`gfx-ci/ci-tron` repo
2007- ci/b2c: use latest mesa-trigger image
2008- include/dri_interface.h: restore define mistakenly removed in !25587
2009- ci_run_n_monitor: dependency jobs must always be started
2010- util/xmlconfig: drop driInjectDataDir() now that DRIRC_CONFIGDIR is always supported
2011- util/xmlconfig: inline datadir
2012- ci/b2c: change artifacts path to match baremetal and LAVA
2013- VERSION: bump for rc1
2014- .pick_status.json: Update to e64a97694ac9dc97f65e1a8e91a5c9789109fd2c
2015- .pick_status.json: Update to 4cdd094ae1e97d857a6b9dbc291d7bbe6ea266ac
2016- .pick_status.json: Update to e4a1bc70dd739ca8addddc940af08312b038e288
2017- .pick_status.json: Update to faed5d647f2416bb0ce3a9d33a3955169c70dc52
2018- VERSION: bump for 23.3.0-rc2
2019- .pick_status.json: Update to 1f1ec1c6bcc2a32a3c1df8c2cc7a2f4e7139b7ec
2020- .pick_status.json: Mark 8dda860f83ac30d042dc6beb4438cc925d1fd130 as denominated
2021- .pick_status.json: Update to 7d6f9ccfbeab050c26775d5e03578a01526cbfcb
2022- .pick_status.json: Update to aa33ca0a52591961f8ae01dc253354462ed17c18
2023- .pick_status.json: Update to a77ea9555aa00cc12f3d1c440252e940ff552500
2024- .pick_status.json: Mark 227300345ed38377190b0eaf08694d5c42ee7e60 as denominated
2025- VERSION: bump for 23.3.0-rc3
2026- .pick_status.json: Update to 56451ce773c11094a8c08fdc6b500bb8bdcf37e1
2027- .pick_status.json: Mark fa7ec4226bdf48bf63438e303af83ecd58ec95f2 as denominated
2028- .pick_status.json: Update to 08f851f4361cfbdb211dc70d03cf3ebff331c3ee
2029- .pick_status.json: Update to 03a7cb261828b350dd9b56bd74850197ca9eba33
2030- .pick_status.json: Mark fcfa68a632e5711cc657b103c9a0384928e9bf49 as denominated
2031- VERSION: bump for 23.3.0-rc4
2032- .pick_status.json: Update to f05688aa3299a27430119b27e45181a6f415bff8
2033- egl/dri2: increase NUM_ATTRIBS to fit all the attributes
2034- .pick_status.json: Update to f39ed0063b4cd3e5a71efad2d43ce31f574c698d
2035- .pick_status.json: Update to b07a58157d0b110dbc09a42cffe7046c3200dd3b
2036- VERSION: bump for 23.3.0-rc5
2037- .pick_status.json: Update to f843b14c171299e1696ca6d971ccaa496f60c3ab
2038- intel/perf: fix regex escaping
2039- intel/ci: fix .hasvk-manual-rules
2040- VERSION: bump for 23.3.0
2041- Revert "VERSION: bump for 23.3.0"
2042- docs: add release notes for 23.3.0
2043- Revert "docs: add release notes for 23.3.0"
2044
2045Erico Nunes (10):
2046
2047- lima/ppir: don't optimize loads with different block successors
2048- lima/ppir: convert to nir_legacy
2049- lima/gpir: switch to register intrinsics
2050- egl/drm: fix EGL_EXT_buffer_age with gbm contexts
2051- lima: fix plbu block stride calculation
2052- ci: disable lima LAVA lab for maintance
2053- Revert "ci: disable lima LAVA lab for maintance"
2054- v3dv: allow headless device without display device
2055- Revert "ci/lima: farm is down, disable for now"
2056- v3dv: Rework to remove drm authentication for wsi
2057
2058Erik Faye-Lund (30):
2059
2060- meson: report with_glvnd in summary
2061- docs: upgrade bootstrap to 5.3.1
2062- docs: expand mobile-menu without js
2063- panfrost: delete stale editorconfig file
2064- docs/panfrost: link to lima
2065- docs/panfrost: use code-blocks with wrapping for long blocks
2066- docs/panfrost: use math-role to denote powers of two
2067- docs: fix linkcheck
2068- docs: update a few links to https
2069- docs: update anchor for link
2070- docs: update link to git-wiki
2071- docs: link to upstream etnaviv
2072- docs: apply some trivial redirects
2073- docs: use doc-role when linking to lists article
2074- docs: keep up with intels ever-moving documentation
2075- docs: mark some redirects as allowed
2076- docs: only link to old docs from html
2077- docs: use html_static_path for static files
2078- ci/etnaviv: update ci expectation
2079- ci/etnaviv: allow failure on failing test
2080- zink: fix wording of warning
2081- ci/etnaviv: move failure to flake
2082- meson: add wayland-protocols from meson wrapdb
2083- util/xmlconfig: add an env-var for overriding drirc search dir
2084- meson: add src/util to the drirc search path
2085- docs/relnotes: remove cruft from end of lines
2086- docs/ci: escape at-symbols
2087- docs/relnotes: escape some at-symbols
2088- bin/gen_release_notes: escape at-symbols
2089- panfrost: use perf_debug instead of open-coding
2090
2091Faith Ekstrand (809):
2092
2093- nv50/ir: Convert to new-style NIR registers
2094- nv50/ir: Support vector movs
2095- intel/fs: Add support for new-style registers
2096- intel/vec4: Assume get_nir_dest() provides a sane write-mask
2097- intel/vec4: Add support for new-style registers
2098- intel: Switch to intrinsic-based registers
2099- intel/fs: Drop support for nir_register
2100- intel/vec4: Drop support for nir_register
2101- anv,hasvk,iris: sampler_prog_key::swizzles is only used on crocus
2102- nir: Properly handle divergence for load_reg
2103- nir/trivialize: Maintain divergence information
2104- nir/trivialize: Trivialize cross-block loads
2105- vc4: Convert to new-style NIR registers
2106- nir/schedule: Support load/store_reg
2107- broadcom/compiler: Convert to new-style NIR registers
2108- intel/fs: Use write masks from store_reg intrinsics
2109- intel/fs: Rework the overlapping mov/vec case
2110- intel/fs: Assume NIR is in SSA form
2111- nir: Add a backend_flags field to nir_tex_instr
2112- intel/fs: Add a parameter to speed up register spilling
2113- nir/builder: Allow tex helpers on image types
2114- nir/builder: Add a nir_txs_deref() helper
2115- vulkan: Add a core vk_buffer_view struct
2116- vulkan: Add a more direct way to use a NIR shader
2117- vulkan: Add a vk_query_pool base object
2118- vulkan: Add common vkCmdBegin/EndQuery wrappers
2119- vulkan/format: Add the remaining 1-plane YCbCr formats
2120- vulkan: Add a core vk_sampler struct
2121- nv50/nir: Lower to scratch AFTER optimization
2122- nouveau: Allow GLSL_SAMPLER_DIM_SUBPASS*
2123- nouveau/nir: Implement support for compact arrays
2124- nouveau/codegen: Handle/indirect goes before sample index
2125- nouveau/codegen: Use a NULL format for PIPE_FORMAT_NONE for images
2126- nouveau/codegen: Don't convertSurfaceFormat for unknown formats
2127- nv50/ir: Run nir_divergence_analysis before out-of-SSA
2128- anv: Use vk_sampler
2129- anv: Use vk_buffer_view
2130- vulkan: Add init/finish helpers for vk_query_pool
2131- anv: Use vk_query_pool
2132- anv: Use the common versions of vkBegin/EndQuery()
2133- nir/builder: Don't assume we have compiler options
2134- Revert "mesa, compiler: Move gl_texture_index to glsl_types.h"
2135- Revert "compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h"
2136- vulkan: Use VkBufferUsageFlags2 in vk_buffer
2137- clang-format: Set ColumnLimit to 78
2138- nvk: Implement EnumerateInstanceVersion
2139- nvk: Add stub implementations of VkImage and VkImageView
2140- nvk: Add stub implementation of VkSampler
2141- nvk: Add a stub implementation of VkBuffer
2142- nvk: Implement VkDescriptorSetLayout
2143- nvk: Implement VkPipelineLayout
2144- nvk: Add initial descriptor set lowering
2145- nvk: Implement vkUpdateDescriptorSets
2146- nvk: Expose nvk_descriptor_stride_align_for_type
2147- nvk: Re-format descriptor set layouts
2148- nvk: Re-format pipeline layouts
2149- nvk: Re-format descriptor sets some more
2150- nvk/buffer: Take an offset in nvk_buffer_address
2151- nvk/buffer: Add a push_buffer_ref helper
2152- nvk/copy: Use nvk_buffer_address in CmdCopyBuffer
2153- nvk/image: Add image address helpers
2154- nvk/copy: Use nvk_image_base_address()
2155- nvk: Add an nvk_device_physical helper
2156- nvk: Add a skeleton for pipelines
2157- nvk: Re-arrange nvk_descriptor_set.h a bit
2158- nvk: Reformat nvk_nir_lower_descriptors
2159- nvk: Add a couple descriptor set address helpers
2160- nvk: Move nvk_cmd_pool cast definitions
2161- nvk: Rework whitespace in nvk_cmd_buffer.c
2162- nvk: Add a root descriptor table
2163- nvk: Fetch descriptor set addresses from the root table
2164- nvk: Re-arrange nir_lower_explicit_io a bit
2165- nvk: Lower load_global_constant_offset
2166- nvk: Drop image_view_init
2167- nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable
2168- nvk: Allow R32_UINT
2169- nvk: Mark nvk_push_descriptor_set_ref() inline
2170- nvk: Add a descriptor table data structure
2171- nvk: Copy in the nouveau TIC format table
2172- nvk/image_view: Reformat and fix Create/DestroyImageView
2173- nvk: Add an image descriptor table to the device
2174- nvk: Fill out TIC table entries for image views
2175- nvk: Set b->cursor when lowering image intrinsics
2176- nvk: Unify descriptor loading in lower_descriptors
2177- nvk: Re-format nvk_image_view.h a bit
2178- nvk: Re-format nvk_buffer.c a bit
2179- nvk: Add a stub implementation of buffer views
2180- nvk: Make texture descriptors a bit more acceptable to codegen
2181- nvk: GART os host-cache-coherent
2182- nvk: Reserve a null image descriptor
2183- nvk: Rework descriptor writes
2184- nouveau: Add stubs for an image layout library called NIL
2185- nil: Create images
2186- nil: Add the TIC format table from nouveau
2187- nil: Add a nil_view and code to fill out TIC entries
2188- nvk: Add an nvk_get_format helper
2189- nvk: Use helpers for push_ref
2190- nvk: Align arguments consistently in copy/blit code
2191- nvk: Move Fill/UpdateBuffer to nvk_cmd_copy
2192- Revert "nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable"
2193- nvk: Manually offset for array layers in copy/blit
2194- nvk: Convert to using NIL for image layout
2195- nvk: Re-indent image entrypoints
2196- nvk: Implement VkGetImageSubresourceLyout
2197- nvk: Reset and properly clean up command buffer upload areas
2198- nvk: Rework format features queries
2199- nvk: Add a more competent GetPhysicalDeviceImageFormatProperties
2200- nvk: Support compressed images in copy commands
2201- nvk: Drop vk_sync BO refs after push_submit
2202- nil: Drop miptail support for now
2203- nil: Don't minify image dimensions when setting up TIC
2204- nil: Refactor TIC image extent setup
2205- nil: Fix image array layer alignments
2206- nvk: Teture pool sizes are maximums not sizes
2207- nvk: Re-format nvk_sampler.c
2208- nvk: Implement samplers
2209- nil: Add a helper for filling out buffer TIC entries
2210- nvk: Move is_storage_image_format to nvk_format.c
2211- nvk: Implement buffer views
2212- nvk: Advertise KHR_dedicated_allocation
2213- nvk: Use the correct root descriptor table size for CmdDispatch
2214- nvk: Add support for dynamic buffers
2215- nvk: Better advertise image format features
2216- nvk: Advertise descriptor array indexing
2217- nvk: Advertise non-zero descriptor set limits
2218- nvk: Use a descriptor type instead of a hand-rolled thing
2219- nvk: Handle cube storage images properly
2220- nvk: Load the requested descriptor size
2221- nvk: Implement push constants
2222- nvk: Properly indent a comment
2223- nvk: Fix descriptor offset alignment
2224- nvk: Use a switch for descriptor types in load_descriptor
2225- nvk: Support inline uniform blocks
2226- nvk: Delete the storage TIC in nvk_image_view_destroy
2227- nvk: Assert that we don't double-free descriptors
2228- nvk: Initial vkCmdClearImage support
2229- nvk: Unconditionally zero image format properties
2230- nvk: No-op sparse image format properties
2231- nvk: Advertise minUniformBufferOffsetAlignment
2232- nvk: Rework OOM handling for descriptor pools
2233- nvk: Bind immutable samplers on descriptor set creation
2234- nvk: Padd shader BOs by 4K to avoid I-cache overflow
2235- nvk: Include nvk_private.h in everything
2236- nvk: Make image/buffer address helpers const
2237- nouveau/push: Add a P_INLINE_FLOAT helper
2238- nvk: Init WSI after setting up supported_sync_types
2239- nouveau/parser: Fix an integer overflow and a typo
2240- nouveau/parser: Properly dump most arrays used by 3D
2241- nouveau/parser: Better dump float data
2242- nouveau/parser: Handle arrays properly in P_IMMD()
2243- nouveau/push: Make P_IMMD more versatile
2244- nouveau: Null terminate the debug flag list
2245- nouveau: Generate 3D headers
2246- nvk: Add graphics state to command buffers
2247- nvk: Split pipeline binding into helpers
2248- nvk: Switch to vk_pipeline_shader_stage_to_nir
2249- nvk: Don't free the NIR in nvk_compile_nir
2250- nvk: Add an nvk_shader_address helper
2251- nvk: Free pipeline shader BOs
2252- nvk: Expose pipeline alloc/free functions
2253- nvk: Make shader_upload take an nvk_device
2254- nvk/shader: Assign I/O locations and gather info
2255- nvk/shader: Populate headers for vertex and fragment shaders
2256- nvk: Add a nvk_cmd_buffer_device() helper
2257- nvk: Import 3D context init code from nouveau
2258- nil/format: Add helpers for render formats
2259- nvk: Add boilerplate for Begin/EndRendering
2260- nvk: Misc. additional state setup
2261- nvk: Emit dynamic graphics state
2262- nvk: Implement push constants and descriptors for graphics
2263- nouveau: Add CPU push buffers
2264- nvk: Graphics pipelines
2265- nvk: Implement vkCmdDraw()
2266- nvk: Color attachments clears via image clears
2267- vulkan/meta: Add the start of a meta framework
2268- vulkan/meta: Add an object tracking list
2269- vulkan/meta: Add a concept of rect pipelines
2270- vulkan/meta: Implement attachment clears
2271- vulkan/meta: Implement start-of-rendering clears
2272- vulkan/meta: Add implementations of Clear*Image
2273- nvk: Add an attachment format even for secondaries
2274- nvk: Add an addr field to nvk_buffer
2275- nvk: Expose a bind_vertex_buffer helper
2276- nvk: Use vk_meta for CmdClearAttachments
2277- nvk: Stop using vk_cmd_set_dynamic_graphics_state in meta_end()
2278- nvk: Enable all the dynamic state features
2279- nouveau: Fix pushbuf ref reset for user command buffers
2280- nvk: add linear image creation support.
2281- nvk: Use max alignment for descriptor pool sizes
2282- nil: Switch to using the new headers for TIC entries
2283- nvk: Use meta for CmdClear*Image
2284- nvk: Zero client memory objects
2285- nvk: Bind texture and sampler header pools for 3D
2286- nvk: Use the new headers for samplers
2287- nvk: Implement nir_intrinsic_load_frag_coord
2288- vulkan/meta_clear: Populate VkRenderingInfo::renderArea
2289- nvk: Don't assert when there are no attachments
2290- nvk: Track and reference all device memory objects
2291- vulkan: Allow scissors or viewports to be set without counts
2292- nvk/copy: Mape bpp part of nouveau_copy_buffer
2293- nvk: Implement copies for D24_UNORM_S8_UINT images
2294- nvk: Drop sample locations structs
2295- nvk/meta: Save and restore VI state
2296- nvk: Re-initialize dynamic_graphics_state.vi when recycling
2297- nvk: Move the vertex format table into nvk_format.h
2298- nvk: Advertise vertex buffer format featues
2299- nvk: Clean up try_create_physical_device error handling
2300- nouveau/parser: Dump more fields as float
2301- nvk: Depth bounds need fui()
2302- nouveau: Add class information to nouveau_ws_device
2303- nil: Properly depend on nouveau winsys and nvidia-headers
2304- nil: Use nvidia headers for texture format enums
2305- nil: Use the nvidia headers for render target format enums
2306- nil: Use nvidia headers for ZS format enums
2307- nil: Rename rt to czt in the format info struct
2308- nil: Rename rendering to color_target
2309- nil: Re-introduce the format capabilities
2310- nil: Add more format support helpers
2311- nvk: Advertise more format features
2312- nvk: Clear dynamic state dirty after flushing it all
2313- vulkan/meta: Make stencil reference dynamic for clears
2314- nvk: Depth buffers don't allow Z-tiling
2315- nvk: Disable sparse Z on Maxwell+
2316- nil: Compute PTE kinds and tile modes for images
2317- nouveau: Add a function to allocate a tiled buffer
2318- nvk: Add internal helpers for device memory allocation
2319- nvk: Do internal dedicated allocations for ZS images
2320- nvk: Fix depth/stencil render pass clears
2321- nvk: Fix viewport Z scale
2322- nvk: Enable two-sided stencil
2323- nvk: Flip the front-face setting
2324- nvk: Advertise depth/stencil support
2325- nvk: Don't destroy NULL descriptor pool BOs
2326- nvk: Call nir_lower_input_attachments
2327- nvk: Set GEOMETRY_SHADER_SELECTS_LAYER properly
2328- nvk: Return OUT_OF_DEVICE_MEMORY if bo_new fails
2329- nil: Add a PTE kind for Z32_FLOAT
2330- nvk: Add nvk_queue_init/finish() helpers
2331- nvk: Align descriptor buffers to NVK_MIN_UBO_ALIGNMENT
2332- nvk: Re-flow a couple function prototypes
2333- nvk: Assert samples == 1
2334- nvk: Allocate descriptors for input attachments
2335- nvk: Wire up early z and post depth coverage
2336- nvk: Save/restore push constants around meta ops
2337- nouveau/parser: Add array and float tags for clear values
2338- nvk: Use hardware clears for attachment clears
2339- nvk: Add image_view_init/finish functions
2340- nvk: Implement vkCmdClear*Image directly
2341- nvk: Use a UINT format to clear non-renderable images
2342- nvk: Don't advertise tiling on non-power-of-two formats
2343- nvk: Fix max anisotropy
2344- nvk: Assert on CmdExecuteCommands
2345- nvk: VkSamplerCreateInfo::mipLodBias is signed
2346- nvk: Fix border color alpha
2347- nil/format: Depth/stencil formats appear as red
2348- nil: Fix max mip level
2349- nil: Fix nonnormalized coordinates
2350- nvk: Set up clip and cull distances
2351- nvk: Fix dynamic buffer descriptor copies
2352- nvk: Inline nouveau_copy_linear
2353- nvk/copy: Rename push to p
2354- nvk/blit: Rename push to p
2355- nvk/dispatch: Rename push to p
2356- nvk: Drop most buffer tracking
2357- nvk: Rework TLS/SLM and image/sampler table handling
2358- nvk: Invalidate texture header and sampler caches each submit
2359- nvk/sampler: Free descriptor table entries
2360- nvk: Rework nvk_descriptor_table_add/remove
2361- nvk: Implement descriptor table growing
2362- nvk: Zero unused descriptors
2363- nvk: Add some asserts for nv50 compiler image restrictions
2364- nvk: Update to the new command buffer infrastructure
2365- nvk: Split nvk_queue into its own file
2366- nvk: Start every command buffer with a nop
2367- nvk: Initialize fixed draw/default state once
2368- nouveau/parser: Convert to mako
2369- nouveau/parser: Use more idiomatic python
2370- nouveau/parser: Put the dump helpers in C files
2371- nvk: Use f for extension features
2372- nvk: Drop a TODO
2373- nvk: Use VK_IMAGE_USAGE_*_ATTACHMENT_BIT for image clears
2374- nvk: Increase the graphics pipeline push space
2375- nil: Don't claim texture support for 2-bit SNORM
2376- nouveau/push: Fix a void pointer arithmetic bug
2377- nouveau/parser: Add more arrays
2378- nouveau/mme: Add basic structures for the Turing+ MME
2379- nouveau/mme: Add isaspec XML for the Turing+ MME
2380- nouveau/mme: Add an assembler and disassembler for the Turring+ MME
2381- nouveau/mme: Add a builder for the Turing+ MME
2382- nouveau/mme: Add a tiny simulator for the Turing+ MME
2383- nouveau/mme: Add an isaspec-based dumper
2384- nouveau/mme: Make the winsys headers C++ safe
2385- nouveau/mme: Add unit tests for the Turing+ MME simulator
2386- nvk: Add MME infrastructure
2387- nvk: Use MME for clears
2388- nouveau/mme: Add helper macros for setting fields
2389- nvk: Use MME for vkCmdDraw[Indexed]()
2390- nvk: Implement vkCmdDraw[Indexed]Indirect()
2391- nvk: Use p for the nouveau_ws_push_buffer in zero_vram
2392- nouveau: Add an nv_push struct
2393- nouveau: Rename the fields of vk_push
2394- nouveau: Move nv_push and helpers to their own header
2395- nouveau/parser: Take a FILE* in DUMP_*_MTHD_DATA
2396- nouveau: Move push validate to nv_push.c
2397- nouveau: Move push dumping to nv_push.c
2398- nvk: Use nv_push directly for graphics pipelines
2399- nouveau: Add a nouveau_ws_bo_new_mapped helper
2400- nvk: Use bo_new_mapped for the zero page
2401- nvk: Always allocate empty_push
2402- nvk: Move queue_sumbit to nvk_queue_drm_nouveau.c
2403- nvk: Submit pushbufs directly
2404- nvk: Use a regular BO for the empty push
2405- nvk: Use a regular BO for the queue state push
2406- nvk: Add an nvk_queue_submit_simple helper
2407- nvk: Initialize the queue later in device setup
2408- nvk: Use submit_simple for draw state init
2409- nvk: Use queue_submit_simple for zero_vram
2410- nvk: Break nvk_cmd_pool into its own file
2411- nvk: Use cmd instead of cmd_buffer
2412- nvk: Add BO recycling to the command pool
2413- nvk: Return VkResult from nvk_cmd_buffer_upload_alloc
2414- nvk: memcpy root descriptors for compute instead of doing a DMA
2415- nvk: Fully populate QMDs before uploading
2416- nvk: Constant buffer alignment is actually 64B
2417- nvk: Rework side-band data upload
2418- nvk: Add an nvk_cmd_buffer_push helper
2419- nvk: Add an nvk_cmd_buffer_ref_bo helper
2420- nvk: Allocate upload buffers from the command pool
2421- nvk: Use nvk_cmd_bo for push bufs
2422- nvk: Implement vkCmdExecuteCommands()
2423- nvk: Remove remaining references to nouveau_push.h
2424- nouveau: Use DRM interfaces directly in MME tests
2425- nouveau: Drop nouveau_ws_push
2426- nvk: Re-indent vk_instance.c
2427- nvk: Use vk_object_zalloc/free for descriptor pools/sets
2428- nvk: Fix up whitespace in nvk_descriptor_set.c
2429- nvk: Implement VK_KHR_push_descriptor
2430- nvk: Reference descriptor set layouts in the sets themselves
2431- nvk: Embed a nv_device_info in nvk_physical_device
2432- nvk: Add an nvk_queue_submit wrapper
2433- nvk: Also store the push BO map in nvk_queue_state
2434- nvk: Bring back push sync and dumping
2435- nvk: drop nvk_nir.h
2436- nvk: Add lowering for load_global_constant_bounded
2437- nvk: Properly implement robustBufferAccess
2438- vulkan/meta: Add key types
2439- vulkan/meta: Add a helper for image view types
2440- vulkan/meta: Add a create_sampler helper
2441- vulkan/meta: Fixes for clear
2442- vulkan/meta: Implement vkCmdBlitImage()
2443- nvk: Support load_layer_id
2444- nvk/meta: Save/restore descriptor set 0
2445- nvk: Use meta for doing blits with the 3D hardware
2446- nvk: WFI in pipeline barriers
2447- util/vma: Allow initializing zero-size heaps
2448- nvk: Rework nvk_queue_submit_simple()
2449- nvk: Add a heap data structure
2450- nvk: Return a VkResult from nvk_shader_upload()
2451- nvk: Add a shader heap to nvk_device
2452- nvk: Allocate shaders from a heap
2453- nvk: Rework whitespace in nvk_device_memory.c
2454- nvk: Style fixes in nvk_physical_device.c
2455- nvk: Reset semaphore syncs on wait
2456- nvk/wsi: Style fixes
2457- nvk/wsi: Use the common present implementation
2458- nouveau/parser: Parse all fields in each method
2459- nvk: Add a query pool object
2460- nvk: Implement timestamp queries
2461- nvk: Implement pipeline statistics and occlusion queries
2462- nouveau/mme: Allow ZERO as the destinatio nof mme_load_to
2463- nouveau/mme: Assert on OOB registers
2464- nouveau/mme: Add support for freeing registers
2465- nouveau/mme: Add a couple helpers for working 64-bit addresses
2466- nouveau/mme: Add a helper for MME_DMA_READ_FIFOED
2467- nvk: Use mme_tu104_read_fifoed()
2468- nvk: Implement vkCmdCopyQueryPoolResults()
2469- nvk: Handle large command buffer uploads better
2470- nvk: Use a normal DMA for CmdUpdateBuffer
2471- nouveau/parser: Handle 6F methods
2472- nvk: Use mme_load_addr64()
2473- nvk: Use poll for BO waits
2474- nvk: Events
2475- nvk: Don't crash if we fail to allocate a push BO
2476- nvk: Stop leaking command pool BOs
2477- nvk: Enable VK_KHR_create_renderpass2
2478- nvk: Advertise VK_KHR_imageless_framebuffer
2479- nvk: Flush the current pushbuf before allocating a new one
2480- nvk: Advertise VK_KHR_separate_depth_stencil_layout
2481- nvk: Tell WSI we don't support legacy scanout
2482- nouveau: Add PCI information to nv_device_info
2483- nvk: Implement VK_EXT_pci_bus_info
2484- nvk: Bind 3D images as 3D for clears
2485- nvk: Support copies between 3D and 2D images
2486- nil: Add a helper for getting 2D views of 3D images
2487- nvk: Support 2D views of 3D images
2488- nvk: Advertise VK_KHR_maintenance1
2489- nvk: Use 2D array views for 3D storage images
2490- nil: Fix include guards in nil_image.h
2491- nvk: Advertise custom border color features
2492- vulkan: Add a helper for swizzling color values
2493- nvk: Implement VK_EXT_border_color_swizzle
2494- nvk: Advertise VK_EXT_extended_dynamic_state3
2495- nvk: Move more states to dynamic
2496- nvk: Advertise VK_KHR_storage_buffer_storage_class
2497- nvk: Add a helper for pushing descriptors
2498- nouveau/headers: Add generated headers to dependencies
2499- nvk: Implement VK_EXT/KHR_buffer_device_address
2500- nvk: Break the guts of CmdDispatch into a helper
2501- nvk: Implement DispatchIndirect
2502- nouveau/mme: Add a mul64 helper
2503- nvk: Implement CS invocations statistics queries
2504- nil: Use ONE for the anixotropic coarse spread function
2505- nil: Properly support MSAA
2506- nil: Add an offset4d struct and some helpers
2507- nouveau/parser: Sort METHOD_ARRAY_SIZES
2508- nouveau/parser: Handle SET_ANTI_ALIAS_SAMPLE_POSITIONS
2509- nvk: Stop asserting on MSAA
2510- nvk: Handle zero color attachments better
2511- nvk: Handle multisampled render targets properly
2512- nvk: Support copies of MSAA images
2513- nvk: Use the right view format for stencil texturing
2514- nvk: Pass through a shader key for fragment shaders and MSAA
2515- nvk: Set correct multisample regs for graphics pipelines
2516- nvk: Stop creating a new upload BO every time
2517- nvk: Fill out sample locations on Maxwell B+
2518- vulkan/meta: Bind whole LODs of 3D blit destinations
2519- vulkan/meta: Add a helper for building texture ops
2520- vulkan/meta: Break the guts of blit into a helper
2521- vulkan/meta: Support writing stencil as iterative discard
2522- vulkan/meta: Rename vk_meta_blit.c to vk_meta_blit_resolve.c
2523- vulkan/meta: Add support for MSAA resolves
2524- nvk/meta: Fix restore for descriptor set 0
2525- nvk: Use meta for MSAA resolves
2526- nvk: Replace gl_SamplePosition with fract(gl_FragCoord.xy)
2527- nvk: Stop advertising higher framebufferNoAttachmentsSampleCounts
2528- nvk: Advertise MSAA via image format properties
2529- nvk: Advertise VK_KHR_depth_stencil_resolve
2530- nvk: Assert that descriptor buffer access stays in-bounds
2531- nvk: Add a bo size to nvk_descriptor_set
2532- nvk/format: Style fix for VkFormatProperties3KHR
2533- nvk: Support VK_FORMAT_B10G11R11_UFLOAT_PACK32 for vertex buffers
2534- nvk: Add a devenv ICD json file
2535- nvk: Advertise EXT_vertex_attribute_divisor
2536- nvk: Lower image_size to txs
2537- nvk: Fix a comment
2538- nvk: Add an nvk_buffer_addr_range helper
2539- nvk: Use nvk_buffer_addr_range for buffer descriptors
2540- nvk: Re-order Vulkan 1.0 feature bits
2541- nvk: Enable inheritedQueries
2542- nvk: Enable VK_EXT_provoking_vertex
2543- nvk: Advertise samplerMirrorClampToEdge via 1.2 features
2544- nvk: Advertise VK_KHR_bind_memory2
2545- nvk: Enable KHR_dynamic_rendering
2546- nvk: Advertise KHR_uniform_buffer_standard_layout
2547- nvk: Advertise EXT_index_type_uint8
2548- nvk: Advertise VK_EXT_separate_stencil_usage
2549- nvk: Capitalize NVK in user exposed strings
2550- nvk: Rename grid_size to group_count
2551- nvk: Lower load_num_workgroups ourselves
2552- nvk: Drop block_size from the root descriptor table
2553- nvk: Add a helper for loading resource_index-based descriptors
2554- nvk: Set maxMemoryAllocationCount
2555- nouveau/winsys: Take a drmDevicePtr in nouveau_ws_device_new()
2556- nouveau/winsys: Add an info to nouveau_ws_device
2557- nouveau/winsys: Move device type into nv_device_info
2558- nouveau/nil: Take an nv_device_info for image functions
2559- nouveau/nil: Use nv_device_info for format queries
2560- nouveau/mme: Invoke SET_OBJECT in the tests
2561- nouveau/mme: Make alu_op_to_str static
2562- nouveau/mme: Move mme_value into its own header
2563- nouveau/mme: Add a mme_reg_alloc struct
2564- nouveau/mme: Add an intermediate MME_ALU_OP enum
2565- nouveau/mme: Add an intermediate MME_CMP_OP enum
2566- nouveau/mme: Use mme_mov() for temp copies of register IMM32 sources
2567- nouveau/mme: Make helpers less Turing specific
2568- nouveau/mme: Break the Turing builder guts into a separate header
2569- nouveau/mme: Move the guts of mme_merge_to() into mme_tu104_builder.c
2570- nouveau/mme: Move the guts of mme_state_arr_to() into mme_tu104_builder.c
2571- nouveau/mme: Drop the implicit_imm parameter from mme_alu_to()
2572- nouveau/mme: Move the cf_stack struct to mme_builder.h
2573- nouveau/mme: Prepare the builder for multiple GPU generations
2574- nouveau/mme: Take an nv_device_info in mme_builder_init
2575- Support immediates in MERGE
2576- Add add immediate optimizations
2577- nvk: Add support for contiguous heaps to nvk_heap
2578- nvk: Use a contiguous shader heap pre-Volta
2579- nvk: Disable indirect draw/dispatch and query copy MMEs for now
2580- nvk: Free a couple regs in nvk_mme_build_draw_*()
2581- nvk: Properly align root descriptor tables for pre-Pascal
2582- nvk: Compile all NIR before running codegen
2583- vulkan/meta: Insert a geometry shader when needed
2584- nvk: Use a GS for layerered rendering pre-MaxwellB
2585- nvk: Handle zero-size index and vertex buffers pre-Turing
2586- nvk: Cosmetic clean-ups to Create/DestroyDevice
2587- nil: Only choose a PTE kind for tiled images
2588- nouveau/mme: Fix is_int18 for negative numbers
2589- nouveau/mme: Don't swap x and y in mme_fermi_merge_to()
2590- nouveau/mme: Take a const nv_device_info in mme_builder_init
2591- nouveau/mme: Unify some of the test framework
2592- nouveau/mme: Add some generic builder tests
2593- nouveau/mme: Add builder tests for SUB
2594- nouveau/mme: Use a uint32_t for size in mme_fermi_bfe()
2595- nouveau/mme: nouveau/mme: Add builder tests for SLL and SRL
2596- nvk/drm: Take a byte offset/range in push_add_push
2597- nvk: Rework nvk_cmd_push a bit
2598- nvk: Add a helper for pushing indirect data
2599- nvk: Make some MME builder names more consistent
2600- nouveau/mme: Don't allow WaW dependencies in the same Turing instruction
2601- nvk: Reduce register pressure in nvk_mme_build_draw*()
2602- nouveau/push: Add an NV_PUSH_MAX_COUNT #define
2603- nvk: Implement Draw*Indirect on pre-Turing
2604- vulkan/meta: Use the new NIR texture helpers
2605- nvk: Add a build test for MMEs
2606- nvk: Don't over-size push descriptor sets
2607- nvk: Return VK_ERROR_INCOMPATIBLE_DRIVER if the PCI vendor isn't NVIDIA
2608- nvk: Bump init context batch size
2609- nouveau/mme: Fix nested while instructions on Turing+
2610- nouveau/mme: Add a helper to dump instructions
2611- nvk: Rework extension enables
2612- nvk: Rework features enables
2613- nvk: Advertise shaderImageGatherExtended
2614- nouveau/mme: Add a bfe helper
2615- nouveau/mme: Ensure that zero-initizlied mme_value is ZERO
2616- nvk: De-duplicate MME code for setting draw params
2617- nvk: Clamp viewport clip to max range
2618- nvk: Use the same lock for the submit and the memory objects list
2619- nvk: Advertise ICD/loader interface version 4
2620- nvk: Add instace WSI entrypoints
2621- nouveau/mme: Use ADD for ine with an immediate
2622- nouveau/mme: Fix while loops pre-Turing
2623- nvk: Add begin to mme_scratch
2624- nvk: Use the new load/store_scratch helpers for DRAW_PAD_DW
2625- nouveau/mme: Add a helper for re-allocating registers
2626- nvk: Rework spill helpers and DRAW_COUNT spilling
2627- nvk: Spill DRAW_IDX pre-Turing
2628- nvk: Break the inner MME draw loop into a helper
2629- nvk: Increase the push runout to 512 dwords
2630- nil: Add a nil_image_for_level helper
2631- nil: Add an image_level_as_uncompressed helper
2632- nvk: Implement uncompressed views of compressed images
2633- nvk: Set pointClippingBehavior
2634- nvk: Expose VK_KHR_maintenance2
2635- nvk: Add a separate #define for SSBO alignment
2636- nvk: Set spirv_to_nir_options::min_*_alignment
2637- nvk: Use vk_device_memory
2638- nvk: Implement VK_KHR_map_memory2
2639- nvk: Sort SPIR-V caps
2640- nvk: Advertise EXT_shader_viewport_index_layer on MaxwellB+
2641- nvk: Only use view_id for layer in multiview
2642- nvk/heap: Set the right pitch for heap resize copies
2643- nvk: Advertise shaderStorageImageReadWithoutFormat
2644- nvk: Fix the NO_PREFETCH assert for CmdDrawIndirect
2645- nvk: Advertise KHR_spirv_1_4
2646- nvk: s/device/dev in nvk_image.c
2647- nvk: Add helpers for binding image planes
2648- nvk: Take an nvk_image_plane in nouveau_copy_rect_image
2649- nvk: Use the max descriptor alignemtn in GetDescriptorSetLayoutSupport
2650- nvk: Use NVIDIA_VENDOR_ID in pdev try_create()
2651- nvk: Use abbreviated names in nvk_device_memory.c
2652- nvk: Add device and driver UUIDs
2653- nvk: Add external memory queries
2654- nvk: Dedicated allocations override internal
2655- nvk: Require dedicated allocations for external images
2656- nouveau/winsys: Add dma-buf import support
2657- nvk: Support dma-buf import
2658- nvk: Support dma-buf export
2659- nvk: Enable external memory extensions
2660- nvk: Reformat nvk_buffer.c
2661- nvk: Add a buffer alignment helper
2662- nvk: Add an addr field to nvk_image_plane
2663- nvk: Use canonical variable names in nvk_physical_device.c
2664- nvk: Use canonical variable names in nvk_shader.c
2665- nvk: Use canonical variable names in nvk_bo_sync.c
2666- nvk: Use canonical variable names in nvk_sampler.c
2667- nvk: Drop nvk_physical_device::instance
2668- nvk: Only advertise EXT_pci_bus_info on discrete GPUs
2669- nouveau: Put PCI info in a pci substruct in nv_device_info
2670- nouveau: Stop using hex for SM numbers
2671- nvk: Set deviceType based on nv_device_info::type
2672- nouveau: Move more stuff into nv_device_info
2673- nouveau: Move gart_size to nv_device_info
2674- nvk: Use nv_device_info for class checks
2675- nvk: Rename nvk_device::ctx to ws_ctx
2676- nvk: Add a ws_dev to nvk_device and use it
2677- nvk: Move the winsys device to nvk_device
2678- nvk: Don't enumerate pre-Kepler GPUs
2679- nvk: Implement VK_EXT_physical_device_drm
2680- nvk: Require an environment variable for poorly tested hardware
2681- nvk: Use the new core vk_sampler struct
2682- Revert "vulkan: Allow scissors or viewports to be set without counts"
2683- vulkan/meta: Add a get_pipeline_layout helper
2684- vulkan/meta: Use vk_meta_get_pipeline_layout in blit/resolve
2685- nvk: Bind 3D depth/stencil images as 2D arrays
2686- nvk: Flush more state on VI_BINDINGS_VALID dirty
2687- nvk: Don't skip zero-size bindings in GetDescriptorSetLayoutSupport
2688- docs: Add a docs page for NVK
2689- docs: Add NVK to features.txt
2690- docs/relnotes: Stick something about NVK in new_features.txt
2691- nouveau: Drop GART size from nv_device_info
2692- nil: Add a nil_image_level_extent_px() helper
2693- nvk: Use the new NIL helper for image level extents for copies
2694- nvk: Improve image format properties and limits
2695- nvk: Rework multi-plane format features a bit
2696- nvk: Use nvk_root_descriptor_offset for drawInfoBase
2697- nvk: Add a root_desc_addr to the root descriptor table
2698- nvk: Add support for variable pointers
2699- nvk: Enable the SPIR-V DeviceGroup capability
2700- nvk: Separate the MME query copy code out a bit
2701- nvk: Implement CopyQueryPoolResults with a compute shader
2702- nvk: Misc. style nits
2703- nvk: Rework memory requirements to handle aspects correctly
2704- nvk: Implement the maintenance5 image layout queries
2705- nvk: Use VkBufferUsageFlags2
2706- nvk: Implement CmdBindIndexBuffer2KHR
2707- nvk: Implement GetRenderingAreaGranularityKHR
2708- nvk: Decorate CmdBegin/EndRendering entrypoints
2709- nouveau: Move shader topology info to nv_device_info
2710- drm-uapi: Import nouveau_drm.h
2711- nouveau/winsys: Use the imported nouveau_drm.h headers
2712- nvk: Use the imported nouveau_drm.h headers
2713- nouveau/shim: Use the imported nouveau_drm.h headers
2714- nouveau/mme: Support the new UAPI
2715- nvk: Use an empty EXEC for the empty submit case
2716- nouveau/winsys: Allow nouveau_ws_device_new() without VM_BIND
2717- nvk: Print an error message if VM_BIND support is missing
2718- nvk: Enable the new UAPI
2719- nvk: Use more consistent device variable names
2720- nvk: Call nir_lower_int64
2721- nir/gl: Move glsl_type::sampler_target() into a helper in its one caller
2722- nvk: Remove plane sources from tex instructions
2723- nvk: Use common physical device properties
2724- nv50/ir: Rework conversions for texture array indices
2725- clang-format: Add nir_foreach_reg_*
2726- clang-format: nir_foreach_src is not a foreach macro
2727- clang-format: Set the default ColumnLimit to 0
2728- nir: Re-align a couple enums and add clang-format comments
2729- nir: Don't clang-format const_value helpers
2730- nir: Don't clang-format a couple typedefs
2731- nir: Don't clang-format debug print setup
2732- nir: More manual formatting
2733- nir: Pretty format type mapping helpers
2734- nir: Wrap pass macros in braces
2735- nir: Add a do to the do/while in nir_const_value_t_array()
2736- nir: Add a .clang-format file
2737- nir: clang-format src/compiler/nir/\*.[ch]
2738- nvk: Don't use nir_ssa_for_src()
2739- nir: Drop most instances of nir_ssa_dest_init()
2740- nir: Drop more instances of nir_ssa_dest_init()
2741- nir/clone: Clone nir_def nor nir_dest
2742- nir/serialize: [De]serialize nir_def nor nir_dest
2743- nir: Drop nir_ssa_dest_init()
2744- nir: Drop nir_ssa_dest_init_for_type()
2745- nir: nir_foreach_ssa_def() -> nir_foreach_def()
2746- st,zink,sfn: Use nir_foreach_def instead of nir_foreach_dest
2747- dxil: Use nir_foreach_def() instead of nir_foreach_dest()
2748- nir/from_ssa: Use nir_foreach_def() instead of nir_foreach_dest()
2749- nir: Drop nir_foreach_dest()
2750- intel/vec4: Stop passing around nir_dest
2751- intel/fs: Stop passing around nir_dest and nir_alu_dest
2752- broadcom: Stop using nir_dest directly
2753- vc4: Stop passing around nir_dest
2754- nir,ntt,a2xx,lima: Stop using nir_dest directly
2755- lima: Stop using nir_dest directly
2756- etnaviv: Stop passing around nir_dest
2757- r600/sfn: Stop passing around nir_dest and nir_alu_dest
2758- nv50/ir: Stop passing around nir_dest and nir_alu_dest
2759- nir/gather_types: Stop passing around nir_dest
2760- nir/dce: Stop passing around nir_dest
2761- nir/propagate_invariant: Stop passing around nir_dest
2762- nir/validate: Replace all dest validation with validate_def
2763- nir/print: Replace all dest printing with print_def
2764- nir: Get rid of nir_dest_bit_size()
2765- nir: Get rid of nir_dest_num_components()
2766- nir: Get rid of nir_dest_is_divergent()
2767- nir: Drop nir_alu_dest
2768- nir: Drop nir_dest
2769- util/format: 8-bit interleaved YUV formats are UNORM
2770- gallivm: Support G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM
2771- blorp: Use R8G8_UINT for YCRCB_* formats with CCS
2772- anv: Disable CCS_E for ISL_FORMAT_YCRCB_*
2773- vulkan/format: Use correct swizzle for 1-plane YCbCr formats
2774- gallivm: Drop the Vulkan YUV format hacks
2775- nir: Rename nir_instr_type_ssa_undef to nir_instr_type_undef
2776- nir s/nir_get_ssa_scalar/nir_get_scalar/
2777- nir: s/live_ssa_def/live_def/
2778- nir: s/nir_instr_ssa_def/nir_instr_def/
2779- nir: Rework nir_scalar_chase_movs a bit
2780- nir: Fix nir_op_mov handling in nir_collect_src_uniforms
2781- nir: Handle nir_op_mov properly in opt_shrink_vectors
2782- nir: Don't handle nir_op_mov in get_undef_mask in opt_undef
2783- nir: Clean up nir_op_is_vec() and its callers
2784- nir/large_constants: Use nir_component_mask_t
2785- nir/large_constants: Add read/write_const_values helpers
2786- nir/opt_large_constants: Add Small constant handling
2787- spirv: Re-emit constants at their uses
2788- nir: Take a nir_def * in nir_tex_instr_add_src()
2789- nir: Take a nir_def * in nir_phi_instr_add_src()
2790- nir/opt_undef: Don't rewrite a bcsel to mov
2791- nir: Add a nir_instr_clear_src() helper and use it
2792- nir: Add and use a nir_instr_init_src() helper
2793- nir: Drop nir_if_rewrite_condition()
2794- nir: Drop most uses of nir_instr_rewrite_src_ssa()
2795- nir: Drop nir_instr_rewrite_src_ssa()
2796- nir: Drop most uses if nir_instr_rewrite_src()
2797- nir: Drop nir_instr_rewrite_src()
2798- nir: Drop nir_push_if_src()
2799- nir: Fix metadata in nir_lower_is_helper_invocation
2800- nir: Use nir_shader_intrinsic_pass() a few places
2801- drm-uapi: Sync nouveau_drm.h
2802- nvk: Plumb no_prefetch through to the DRM back-end
2803- nouveau/mme: Fix a compile warning
2804- intel/isl: Rename ISL_TILING_Yf/s to ISL_TILING_SKL_Yf/s
2805- intel/isl: Add ICL variants of Yf and Ys tiling
2806- intel/isl: Implement correct tile size calculations for Ys/Yf
2807- intel/isl: Use the depth field of phys_level0_sa for GFX4_2D 3D surfaces
2808- intel/isl: Fill out the correct phys_total_extent for Ys/Yf/Tile64
2809- intel/isl: Indent uncompressed surface code
2810- intel/isl: Support Ys, Yf & Tile64 in isl_surf_get_uncompressed_surf
2811- intel/isl: Support Yf/Ys tiling in surf_fill_state
2812- intel/isl: Support Yf/Ys tiling in emit_depth_stencil_hiz
2813- intel/isl: Add initial data-structure support for miptails
2814- intel/isl: Add support for computing offsets with miptails
2815- intel/isl: Support miptails in isl_surf_get_uncompressed_surf
2816- intel/isl: Start using miptails
2817- intel/isl: Disallow CCS on 3D surfaces with miptails
2818- intel/isl: Allow Ys tiling
2819- anv: Align memory VA to support for Ys, Tile64 tiled images
2820- nvk: Clean up includes
2821- nvk: Add include guards to nvk_bo_sync.h
2822- nvk: SPDX everything
2823- nouveau/nil: SPDX everything
2824- nouveau/mme: SPDX everything
2825- nvk: Don't add a dummy attachment when gl_SampleMask is written
2826- nvk: Set the discard bit for Z/S self-deps
2827- nvk: Invalidate the texture cache in PipelineBarrier
2828- nvk: Lower interp_at_sample to interp_at_offset
2829- nvk: Disable statistics around meta ops
2830- nvk: Clean up viewport math
2831- nvk: Fix depth clipping parameters
2832- nvk: Enable dynamic clip/clamp enable
2833- nvk: Set GUARDBAND_Z_SCALE_1 when Z-clipping
2834- r600: Use more auto-generated nir_builder helpers
2835- r600: Use nir_builder helpers for load/store_shared_r600
2836- nvk: Re-order physical device limits
2837- nvk: Advertise maxMemoryAllocationCount = 4096
2838- nvk: Advertise discreteQueuePriorities = 2
2839- nvk: Rip out old UAPI support
2840- nvk/drm: Drop the push_add_push_bo() helper
2841- nvk/drm: Drop the push_add_bo() helper
2842- nvk: Drop command buffer BO tracking
2843- nvk: Drop memory object tracking
2844- nvk: Drop the device-level mutex
2845- nvk: Get rid of the tiled memory allocation helpers
2846- nvk/drm: Restructure nvk_queue_submit_drm_nouveau()
2847- nvk/drm: Split exec as needed for large command buffers
2848- nvk: Don't store the descriptor pool BO in the set
2849- nvk: Store a 20-bit driver_build_sha in nvk_instance
2850- nvk: Hook up the disk cache
2851- nvk: Re-structure early shader compilation a bit
2852- nvk: Add a default pipeline cache
2853- nvk: Cache NIR shaders
2854- nvk: Init pipelineCacheUUID
2855- drm-uapi: Sync nouveau_drm.h
2856- nvk: Take GETPARAM_EXEC_PUSH_MAX into account
2857- nvk: Handle zero-sized sparse buffers
2858- nvk: Use align() and align64() instead of ALIGN_POT
2859- nouveau: Generate headers for Maxwell B compute
2860- nvk: Add a nvk_cmd_buffer_compute_cls() helper
2861- nvk: Invalidate sampler/texture header caches in BeginCommandBuffer()
2862- nvk: Invalidate SKED caches at the top of command buffers
2863- nvk: Advertise more inline uniform block limits
2864- nvk: Emit MME_DMA_SYSMEMBAR before indirect draw/dispatch
2865- nvk: Set max descriptors to 2^20 for most descriptor types
2866- nvk: Reset descriptor pool allocator when all sets are destroyed
2867- nil/format: Use A for alpha blend
2868- nil/format: Advertise R10G10B10A2_UINT texture buffer support
2869- nvk: Disable depth or stencil tests when unbound
2870- nvk: Always emit at least one color attachment
2871- nvk: Improve address space and buffer size limits
2872- nvk: Always set pixel_min/max_Z to CLAMP
2873- nvk: Use nouveau_ws_bo_unmap() instead of munmap()
2874- nvk: Free the disk cache
2875- nvk: Add an nvk_shader_finish() helper
2876- nvk: Handle unbinding images and buffers
2877- nvk: Clean up the disk cache on physical device create fail path
2878- vulkan/wsi: Allow for larger linear images
2879- nvk: Add a nvk_cmd_buffer_dirty_render_pass() helper
2880- nvk: Re-sort device features
2881- nvk: Implement VK_EXT_depth_bias_control
2882- nvk: Advertise VK_KHR_workgroup_memory_explicit_layout
2883- nvk: Implement VK_EXT_image_sliced_view_of_3d
2884- nvk: Advertise VK_EXT_primitive_topology_list_restart
2885- nvk: Advertise VK_EXT_attachment_feedback_loop_layout
2886- features: Mark VK_EXT_attachment_feedback_loop_layout done for NVK
2887- nvk: Re-arrange Vulkan 1.2 features to match the header
2888- nvk: Advertise shaderOutputLayer and shaderOutputViewportIndex
2889- nvk: Enable descriptorIndexing
2890- nvk: Implement VK_EXT_dynamic_rendering_unused_attachments
2891- nir: Add a nir_ssa_def_all_uses_are_fsat() helper
2892- nir: Add convert_alu_types to divergence analysis
2893- nir/lower_tex: Add a lower_txd_clamp option
2894- nir: Add a load_sysval_nv intrinsic
2895- nir: Add NV-specific texture opcodes
2896- nir: Add an load_barycentric_at_offset_nv intrinsic
2897- nir: Add a range to most I/O intrinsics
2898- nir: Add NVIDIA-specific I/O intrinsics
2899- nir/lower_bit_size: Fix subgroup lowering for floats
2900- nir: add deref follower builder for casts.
2901- nir: Handle wildcards with casts in copy_prop_vars
2902
2903Felix DeGrood (12):
2904
2905- anv: save a shader source uint32_t hash in gfx/compute pipelines
2906- anv: Add Source hash field to VkPipelineExecutableStatisticKHR
2907- iris: save shader source sha1 in ish
2908- mesa: propagate shader source sha1 from gl_shader to nir_shader
2909- intel: use shader source hash in INTEL_MEASURE
2910- intel/compiler: use shader source hash in shader dump code
2911- anv: add fake sparse support
2912- anv: enable fake sparse for Elden Ring
2913- anv: debug messaging for sparse texture usage
2914- anv: fix frame count reporting in INTEL_MEASURE
2915- anv: set ComputeMode.PixelAsyncComputeThreadLimit = 4
2916- anv: remove CS_FLUSH from query regression
2917
2918Feng Jiang (9):
2919
2920- virgl: Only PIPE_BUFFER with VIRGL_BIND_CUSTOM flag is considered busy during creation
2921- meson: Export winsys function symbols for target va
2922- frontends/va: Add slice_count to AV1 slice_parameter
2923- virgl/video: Add definition of virgl_av1_picture_desc
2924- virgl/video: Add support for AV1 decoding
2925- virgl/video: Enable AV1 decoding
2926- meson: Rename dri-vdpau.dyn to dri.dyn
2927- CODEOWNERS: Add \@flynnjiang for VirGL video
2928- meson: Move video to separate section in meson configuration summary
2929
2930Filip Gawin (1):
2931
2932- crocus: Avoid fast-clear with incompatible view
2933
2934Flora Cui (1):
2935
2936- radeonsi: limit CP DMA to skip holes in sparse bo
2937
2938Francisco Jerez (29):
2939
2940- intel/fs/ra: Define REG_CLASS_COUNT constant specifying the number of register classes.
2941- intel/vec4/ra: Define REG_CLASS_COUNT constant specifying the number of register classes.
2942- intel/compiler: Make MAX_VGRF_SIZE macro depend on devinfo and update it for Xe2.
2943- intel/fs/ra/xe2: Scale up register allocation granularity by 2x on Xe2+ platforms.
2944- intel/eu/xe2+: Fix encoding of various message descriptors for change in register size.
2945- intel/fs: Fix signedness of payload_node_count argument of calculate_payload_ranges().
2946- intel/fs/xe2+: Fix payload node live range calculations for change in register size.
2947- intel/fs/xe2+: Fix grf_count in post-RA scheduling for updated register file size.
2948- intel/fs/xe2+: Fixes for increased accumulator register width.
2949- intel/fs/xe2+: Scale MAX_SAMPLER_MESSAGE_SIZE by native register size.
2950- intel/eu/xe2+: Update validation of GRF region size to account for Xe2 reg size
2951- intel/fs/xe2+: Allow increased SIMD width for various get_fpu_lowered_simd_width() restrictions.
2952- intel/compiler/xe2+: Represent dispatch_grf_start_reg in native GRF units.
2953- intel/fs/xe2+: Update encoding of FB write message payload.
2954- intel/fs/xe2+: Round up fs_builder::vgrf() size calculation to HW register unit.
2955- intel/fs/xe2+: Scale BRW_MAX_MSG_LENGTH by native register size.
2956- intel/fs/xe2+: Fix payload layout of sampler messages for Xe2 reg size
2957- intel/fs/xe2+: Update GS payload setup for Xe2 reg size.
2958- intel/fs/xe2+: Update TCS payload setup for Xe2 reg size.
2959- intel/fs/xe2+: Update TES payload setup for Xe2 reg size.
2960- intel/fs: Lower unsupported regioning with non-trivial 2D regions on FIXED_GRFs.
2961- intel/fs/xe2+: Update regioning lowering offset alignment checks for Xe2 regs.
2962- intel/fs/xe2+: Fix execution width of SHADER_OPCODE_GET_BUFFER_SIZE for SIMD16 EU.
2963- intel/fs/xe2+: Fix calculation of spill message width for Xe2 regs.
2964- intel/xe2+: Round up size to reg_unit() in fs_reg_alloc::alloc_spill_reg().
2965- intel/fs/xe2+: Fix URB writes with 0 data components.
2966- intel/fs: Specify number of data components of logical URB writes via control immediate.
2967- intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB writes.
2968- intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB reads.
2969
2970Frank Binns (10):
2971
2972- pvr: clang-format fixes
2973- pvr: skip setting up SPM consts buffer when no const shared regs are used
2974- pvr: cleanup SPM EOT dynarray after upload
2975- pvr: treat VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT as not supported
2976- pvr: remove pvr_pbe_get_src_pos()
2977- pvr: fix attachments segfault in pvr_is_stencil_store_load_needed()
2978- pvr: fix allocation size of clear colour consts shared regs buffer
2979- pvr: change a few places to use PVR_DW_TO_BYTES()
2980- pvr: fix setup of load op unresolved msaa mask
2981- pvr: emit PPP state when vis_test dirty bit is set
2982
2983Friedrich Vock (19):
2984
2985- radv/ci: Set DRIVER_NAME in LAVA raven vkcts jobs
2986- radv: Handle VK_SUBOPTIMAL_KHR in trace layers
2987- ac/msgpack: make fixstrs a const char
2988- ac/sqtt,radv: Split internal and API hash in PSO correlations
2989- ac/rgp: Write lds_size metadata
2990- ac/rgp: Add metadata for separate-compiled RT stages
2991- radv/sqtt: Move record filling to helper function
2992- radv/sqtt: Unregister records based on hash
2993- radv/sqtt: Write LDS size metadata in code objects
2994- radv/sqtt: Handle separately-compiled RT pipelines
2995- ac/sqtt,radv/sqtt: Add and use marker for separate RT compilation
2996- nir/load_store_vectorize: Handle intrinsics with constant base
2997- radv/rt: Pre-initialize instance address
2998- radv: Initialize shader freelist on allocation
2999- radv: Fix check in insert_block
3000- radv/rt: Reject hits within 10ULP of previous hits in emulated RT
3001- radv/rra: Recognize LPDDR memory
3002- radv/rmv: Recognize LPDDR memory
3003- vulkan: Don't use set_foreach_remove when destroying pipeline caches
3004
3005Ganesh Belgur Ramachandra (5):
3006
3007- radeonsi: stores bottom_edge_rule option in the rasterizer state
3008- radeonsi: sets OPTIMAL_BIN_SELECTION to 0 if using bottom_edge_rule
3009- radeonsi: "clear_render_target" shader in nir
3010- radeonsi: "clear_render_target_1d_array" shader in nir
3011- radeonsi: "clear_12bytes_buffer" shader in nir
3012
3013Georg Lehmann (39):
3014
3015- aco/gfx11: fix get_gfx11_true16_mask with v_cmp_class_f16
3016- aco: improve get_gfx11_true16_mask description
3017- aco: combine a & ~b to bfi(b, 0, a)
3018- aco/gfx11: use v_cmp_class_f16 with opsel for bitnz/bitz
3019- aco: fix non constant 16bit bitnz/bitz
3020- ac/nir: handle more special cases in ac_nir_unpack_arg
3021- aco: use s_bitreplicate_b64_b32 to set exec to 0xffff0000ffff0000
3022- nir/opt_intrinsics: optimize (exclusive_scan(op, a) op a) to inclusive scan
3023- aco: always use rtne for fquantize2f16
3024- nir/opt_if: also rewrite uniform uses for read_invocation
3025- nir: unify lower_bitfield_insert with has_{bfm,bfi,bitfield_select}
3026- nir: unify lower_bitfield_extract with has_bfe
3027- nir: unify lower_find_msb with has_{find_msb_rev,uclz}
3028- aco: fix u2f16 with 32bit input
3029- aco: combine a | ~b to bfi(b, a, -1)
3030- aco: use v_cvt_f32_ubyte for signed casts too
3031- nir: add nir_scalar intrinsic helpers
3032- nir: add nir_scalar_equal
3033- aco: implement some exclusive scans with inclusive scans
3034- aco/gfx11: don't use bfe for local_invocation_id if the others are always 0
3035- nir/opt_algebraic: remove broken fddx/fddy patterns
3036- aco: simplify masked swizzle dpp selection by removing or_mask first
3037- aco: fix p_extract with v1 dst and s1 operand
3038- aco: implement 64bit div find_lsb
3039- nir: scalarize masked_swizzle_amd created from shuffle_xor
3040- aco/optimizer: check if we can use omod before labeling it
3041- aco/optimizer: copy propagate to output modifier instructions
3042- aco: remove -0.0 for 32 bit fsign with mul_legacy/omod when denorms are flushed
3043- nir: make quad intrinsic dst bit size match src0
3044- nir/lower_subgroups: use intrinsic builder more
3045- aco: assume new generations are unsupported by clrx
3046- aco: assume newer generation will use GFX11 wait_imm packing
3047- aco: print final ir instead if printing asm is unsupported
3048- aco/gfx11: optimize dual source export
3049- aco/gfx11: apply clamp/omod to vinterp
3050- aco: support v_fma_f32_dpp as fma_mix
3051- aco/gfx11: support vinterp as fma_mix
3052- aco: add missing scc def for SALU quad broadcast
3053- aco/sched: treat p_dual_src_export_gfx11 like export
3054
3055George Ouzounoudis (38):
3056
3057- nouveau/codegen: Support compact clip distances with arrayed_io
3058- nouveau/codegen: Handle nir op amul
3059- nouveau/codegen: Fix compact patch varyings in case of NIR
3060- nouveau/codegen: Add capability to pre-specify tessellation domain
3061- nvk: Do not increment instance id across draws
3062- nvk: Add a macro for root descriptor table byte offsets
3063- nvk: Set base vertex state in sequential mme draw
3064- nvk: Support base instance in instanced draw calls
3065- nvk: Switch point rasterization to point sprites
3066- nvk: Support large points
3067- nvk: Compile geometry shaders
3068- nouveau/mme: Keep device info in mme_builder
3069- nvk: Simplify mme build function argument
3070- nvk: Support VK_KHR_shader_draw_parameters
3071- nvk: Support for vertex shader transform feedback
3072- nvk: Support transform feedback indirect draws
3073- nvk: Support transform feedback geometry streams
3074- nvk: Support transform feedback queries
3075- nvk: Support vertex shader transform feedback on Fermi
3076- nvk: Disable PRIMITIVE_RESTART_VERTEX_ARRAY by default
3077- nvk: Fix geometry shader active stream mask
3078- nvk: Support geometry shaders
3079- nvk: Basic tessellation shader support
3080- nvk: Assign locations correctly for arrayed IO
3081- nvk: Enable multiview with tessellation shader
3082- nvk: Fix cases where execution mode is specified in the tesc shader.
3083- nvk: Respect tessellation domain origin state
3084- nvk: Lower io to temporaries for tessellation evaluation nir
3085- nvk: Support VkDescriptorSetVariableDescriptorCountLayoutSupport
3086- nvk: Handle cases of descriptor bindings with variable counts
3087- nvk: Add nir non-uniform optimization pass
3088- nvk: Enable descriptor indexing
3089- nvk: Do not keep redundant info for tessellation domain
3090- nouveau/codegen: Do not keep redundant info for tessellation domain
3091- nvk: Enable dynamic line rasterization mode state
3092- nvk: Fix support for VK_EXT_sample_locations
3093- nvk: Support dynamic state for enabling sample locations
3094- nouveau/codegen: Add a 4th optimization level for MemoryOpts
3095
3096Gert Wollny (63):
3097
3098- r600/sfn: Switch to register intrinsics
3099- r600/sfn/tests: add simple copy-prop test with register source
3100- r600/sfn: Allow for larger ALU CF's
3101- r600/sfn: Handle indirect array load/store dependencies better
3102- r600/sfn: Increase LDS fetch schedule priority
3103- r600/sfn: Add peephole optimization to move a dest to the previous op
3104- r600/sfn: reorder the value factory class member declaration a bit
3105- r600/sfn: Add some tests for proper register access
3106- r600/sfn: Print more info if scheduling fails
3107- r600/sfn: remove debug output leftovers
3108- r600/sfn: Fix use of multiple IDX with kcache
3109- r600/sfn: Always check arrays writes before allowing copy propagation
3110- r600/sfn: set block sizes based on chip class
3111- r600/sfn: Fix typo with block type
3112- r600/sfn: override slot count for IfInstr
3113- r600/sfn: Add method to convert to AluGroup directly
3114- r600/sfn: Add flags to check whether a group starts CF and can do that
3115- r600/sfn: make remaining slots a signed value
3116- r600/sfn: on Cayman loading an index register needs only one slot
3117- r600/sfn: Splizt ALU blocks in scheduler to fit into 128 slots
3118- r600/sfn: rework checks for ALU CF emission
3119- r600/sfn: Schedule AR uses befor possible groups
3120- r600: Explicitly force new CF in gs copy shader
3121- r600: Assert when backend wants to create a new ALU CF
3122- r600: don't check possible size of ALU CF
3123- r600: don't use sb disasm to disassamble copy shader
3124- r600: Force CF when emitting a NOP on R600 in gs copy shader
3125- r600/sfn: Don't try to propagate to vec4 with more than one use
3126- r600/sfn: Only switch to other CF if no AR uses are pending
3127- r600/sfn: AR loads should depend on all previous non ALU instructions
3128- r600/sfn: Renumber shader blocks in scheduler
3129- r600/sfn: Track whether a register is ALU clause local
3130- r600/sfn: Use clause local registers in RA
3131- r600/sfn: Take source uses into account when switching channels
3132- r600/sfn: take number of dest values into account
3133- r600: retire SB optimizer
3134- r600/sfn: work around injecting extra CF's to handle hardware bugs
3135- r600: use correct cso pointer for fetch shader
3136- r600/sfn: Make use of four clause local registers
3137- r600/sfn: drop unused ControlFlowInstr type enum
3138- r600/sfn: factor out resource as extra class
3139- r600/sfn: Simplify dependency chain for index loads on EG
3140- r600: print texture resource index mode separately
3141- r600/sfn: Make address split pass obligatory
3142- r600/sfn: rename method resource_base to resource_id
3143- r600/sfn: Add old address to update_indirect_addr
3144- r600/sfn: Sepeate resource and sampler in texture instructions
3145- r600/sfn: get rid of the method to get the index mode
3146- r600/sfn: sort the uniforms of the right shader
3147- r600/sfn: Fix use of scheduled_shader vs shader
3148- virgl: report MIRROR_CLAMP features better
3149- ci: Upref virglrenderer
3150- copyimage: check requested slice early when cube maps are involved
3151- mesa: check numlevels and numlayers when creating a texture view
3152- virgl: Use common clear_texture if host doesn't support the feature
3153- r600/sfn: don't remove texture sources by using the enum value
3154- r600: drop egcm_load_index_reg
3155- r600/sfn: Don't override a chgr pinning during copy propagation
3156- r600/sfn: When simplifying src vec4 pinnings, also check all uses
3157- virgl: Fix logic for reporting PIPE_MIRROR_CLAMP
3158- r600: Add callbacks for get_driver_uuid and get_device_uuid
3159- r600: Link with libgalliumvl, when enabling rusticl this is needed
3160- r600/sfn: Fixup component count only if intrinsic has it
3161
3162Guilherme Gallo (5):
3163
3164- bin/ci: Ensure that all jobs have nodes in DAG
3165- ci/radeonsi: Update flake list
3166- ci/freedreno: Add a new flake
3167- ci/zink: Found some flakes
3168- ci/anv: Catch some flakes
3169
3170Hannes Mann (1):
3171
3172- vulkan/wsi/wayland: Fix detection of tearing control protocol
3173
3174Hans-Kristian Arntzen (2):
3175
3176- wsi/x11: Fix potential deadlock in present ID.
3177- wsi/x11: Don't allow signal_present_id to rewind.
3178
3179Helen Koike (21):
3180
3181- ci: re-add EXTRA_LOCAL_PACKAGES to rootfs
3182- ci: add EXTRA_LOCAL_PACKAGES to apt-get install
3183- docs/ci: Add docs for EXTRA_LOCAL_PACKAGES
3184- ci: disable duplicated pipelines triggered by marge
3185- ci: add --project option to ci_run_n_monitor.py
3186- ci/android: remove strace output from cuttlefish-runner.sh
3187- ci: add locked flag to bindgen-cli on x86_64_build.sh
3188- ci: separate hiden jobs to -inc.yml files
3189- ci/ci_run_n_monitor: add docs for multiple targets
3190- ci/ci_run_n_monitor: print stress test results per job
3191- ci/ci_run_n_monitor: simplify with defaultdict
3192- ci/ci_run_n_monitor: merge print_job_status_change with print_job_status
3193- ci/ci_run_n_monitor: make --target mandatory
3194- ci/ci_run_n_monitor: merge enable_job with retry_job
3195- ci/ci_run_n_monitor: simplify enable/cancel logic in monitor_pipeline()
3196- ci/ci_run_n_monitor: allow <user>/<project> in --project
3197- ci/ci_run_n_monitor: limit repetitions on --stress
3198- ci/marge_queue: add missing python-dateutils to requirements.txt
3199- ci/ci_run_n_monitor: keep monitoring if a job is still running
3200- ci/marge_queue: add pretty_dutation()
3201- ci/ci_run_n_monitor: print job duration time
3202
3203Honglei Huang (7):
3204
3205- virgl/video: Add support for mpeg12 decoding
3206- virgl/video: Add support for vc1 decoding
3207- virgl/video: Add support for jpeg decoding
3208- virgl/video: Add support for hevc10bit decoding.
3209- virgl/video: Add more pipe type in virgl formats convert table
3210- virgl/video: Add jpeg buf start code check
3211- virgl: Enable vp9 hardware decode
3212
3213Hyunjun Ko (3):
3214
3215- anv: use ycbcr_info for P010 format
3216- anv: don't use cmd_buffer after destroyed.
3217- anv: don't flush_llc on gen9
3218
3219Iago Toral Quiroga (100):
3220
3221- nir/trivialize: Move decl_reg to the start of the block
3222- v3dv: stop incrementing UBO indices by one
3223- nir/lower_robustness: drop skip_ubo_0 option
3224- v3dv: fix incorrect key setup
3225- broadcom/compiler: stop asserting on Vulkan environment
3226- broadcom/compiler: use NIR's lowering for dispatch base
3227- broadcom/compiler: move uniform offset lowering from compiler to GL driver
3228- broadcom/compiler: move vulkan's point coord lowering to the driver
3229- v3dv: don't set lower_wpos_pntc for Vulkan
3230- broadcom/compiler: always clamp results from logic ops
3231- broadcom/compiler: drop execution environment from the shader key
3232- v3dv: drop cpu path for buffer to image copies
3233- v3dv: remove unused code
3234- nir/lower_tex: copy backend_flags field when copying a tex instruction
3235- nir/lower_tex: use a callback to check sampler return size packing
3236- squash! v3dv,broadcom/compiler: don't abuse sampler index
3237- v3dv: assert that only tex instructions with sampler state have a sampler src
3238- v3d: fix texture packing lowering
3239- v3d,v3dv: use fquantize2f16 lowering in NIR
3240- v3dv: be more precise in vkGetImageSubresourceLayout
3241- v3dv: handle pPlaneLayouts in VkImageDrmFormatModifierExplicitCreateInfoEXT
3242- v3dv: bump up MAX_UNIFORM_BUFFERS to 16
3243- v3dv: add support for sampling simple 2D linear textures
3244- v3dv: expand sampling from linear image hack to support multi-planar images
3245- v3dv: don't assume that bound descriptors have been written
3246- v3dv: only handle Android Hardware Buffer on Android
3247- v3dv: we can sample from 1D array too
3248- broadcom/compiler: add a couple of shader key helpers
3249- v3d: compute nir sha1 for uncompiled shader state
3250- v3d: use pre-computed shader sha1 for disk cache
3251- v3d: fix RAM shader cache
3252- v3d: get rid of shader_state pointer in v3d_key
3253- broadcom/simulator: reset CFG7 for compute dispatch in v71
3254- broadcom/common: retrieve V3D revision number
3255- broadcom/compiler: update node/temp translation for v71
3256- broadcom/compiler: implement "reads/writes too soon" checks for v71
3257- broadcom/compiler: implement read stall check for v71
3258- broadcom/compiler: add a v3d71_qpu_writes_waddr_explicitly helper
3259- broadcom/compiler: prevent rf2-3 usage in thread end delay slots for v71
3260- broadcom/qpu: add new ADD opcodes for FMOV/MOV in v71
3261- broadcom/qpu: fix packing/unpacking of fmov variants for v71
3262- broadcom/compiler: make vir_write_rX return false on platforms without accums
3263- broadcom/compiler: rename vir_writes_rX to vir_writes_rX_implicitly
3264- broadcom/compiler: only handle accumulator classes if present
3265- broadcom/compiler: don't assign rf0 to temps across implicit rf0 writes
3266- broadcom/compiler: CS payload registers have changed in v71
3267- broadcom/compiler: don't schedule rf0 writes right after ldvary
3268- broadcom/compiler: allow instruction merges in v71
3269- broadcom/qpu: add MOV integer packing/unpacking variants
3270- broadcom/qpu: fail packing on unhandled mul pack/unpack
3271- broadcom/compiler: generalize check for shaders using pixel center W
3272- broadcom/compiler: v71 isn't affected by double-rounding of viewport X,Y coords
3273- broadcom/compiler: update peripheral access restrictions for v71
3274- broadcom/qpu: add packing for fmov on ADD alu
3275- broadcom/compiler: handle rf0 flops storage restriction in v71
3276- broadcom/compiler: enable ldvary pipelining on v71
3277- broadcom/compiler: try to use ldunif(a) instead of ldunif(a)rf in v71
3278- broadcom/compiler: don't assign rf0 to temps that conflict with ldvary
3279- broadcom/compiler: convert mul to add when needed to allow merge
3280- broadcom/compiler: implement small immediates for v71
3281- broadcom/compiler: update thread end restrictions for v7.x
3282- broadcom/compiler: update ldvary thread switch delay slot restriction for v7.x
3283- broadcom/compiler: lift restriction for branch + msfign after setmsf for v7.x
3284- broadcom/compiler: start allocating from RF 4 in V7.x
3285- broadcom/compiler: validate restrictions after TLB Z write
3286- broadcom/compiler: lift restriction on vpmwt in last instruction for V3D 7.x
3287- broadcom/compiler: fix up copy propagation for v71
3288- broadcom/compiler: don't allocate spill base to rf0 in V3D 7.x
3289- broadcom/compiler: improve allocation for final program instructions
3290- broadcom/compiler: don't assign registers to unused nodes/temps
3291- broadcom/compiler: only assign rf0 as last resort in V3D 7.x
3292- v3dv: expose V3D revision number in device name
3293- v3dv/device: handle new rpi5 device (bcm2712)
3294- v3dv: setup render pass color clears for any format bpp in v71
3295- v3dv: setup TLB clear color for meta operations in v71
3296- v3dv: fix up texture shader state for v71
3297- v3dv: handle new texture state transfer functions in v71
3298- v3dv: implement noop job for v71
3299- v3dv: handle render pass global clear for v71
3300- v3dv: GFX-1461 does not affect V3D 7.x
3301- broadcom/compiler: update thread end restrictions validation for v71
3302- v3dv: handle early Z/S clears for v71
3303- v3dv: handle RTs with no color targets in v71
3304- v3dv: don't convert floating point border colors in v71
3305- v3dv: handle Z clipping in v71
3306- v3dv: make v3dv_viewport_compute_xform depend on the V3D version
3307- v3dv: fix depth clipping then Z scale is too small in V3D 7.x
3308- v3d/v3dv: fix texture state array stride packing for V3D 7.1.5
3309- v3d,v3dv: support up to 8 render targets in v7.1+
3310- v3d,v3dv: don't use max internal bpp for tile sizing in V3D 7.x
3311- v3d,v3dv: propagate NaNs bits in shader state records are reserved in v7.x
3312- v3dv: use new texture shader state rb_swap and reverse fields in v3d 7.x
3313- v3dv: fix color write mask for v3d 7.x
3314- v3d,v3dv: fix depth bias for v3d 7.x
3315- v3d,v3dv: fix compute for V3D 7.1.6+
3316- v3dv: expose fullDrawIndexUint32 in V3D 7.x
3317- v3dv: expose depthClamp in V3D 7.x
3318- v3dv: expose scalarBlockLayout on V3D 7.x
3319- v3dv: fix confusing nomenclature about DRM nodes
3320- v3d,v3dv: fix MMU error from hardware prefetch after ldunifa
3321
3322Ian Douglas Scott (1):
3323
3324- egl/wayland: Don't segfault if \`create_wl_buffer` returns \`NULL`
3325
3326Ian Romanick (38):
3327
3328- intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress
3329- intel/fs: Constant fold SHL
3330- intel/fs: Constant fold OR and AND
3331- util/rb-tree: Return the actual first node from rb_tree_search
3332- util/rb-tree: Fix typo in comment
3333- nir/builder: Add nir_extract_i8_imm and nir_extract_u8_imm helpers
3334- nir/algebraic: Remove redundant pack / unpack lowering patterns
3335- intel/fs: Completely re-write the combine constants pass
3336- intel/fs: Combine constants for SEL instructions too
3337- intel/fs: Combine constants for integer instructions too
3338- intel/fs: New VGRF packing scheme for constant combining
3339- intel/compiler: Combine control barriers with identical memory semantics
3340- intel/compiler: Don't evict for workgroup-scope fences
3341- glsl/list: Clean up an inappropriate comment
3342- util/rb-tree: Work around C++'s dislike of offsetof
3343- util/rb-tree: Inline rb_tree_init
3344- intel/fs: Don't continue fixed point iteration just because liveout changes
3345- intel/fs: Don't try to copy propagate into a source again after progress is made
3346- intel/fs: Make try_constant_propagate and try_copy_propagate file private
3347- intel/fs: Move src.file checks out of try_constant_propagate and try_copy_propagate
3348- intel/fs: Don't loop in try_constant_propagate
3349- intel/fs: Simplify check in can_propagate_from
3350- intel/fs: Make opt_copy_propagation_local file private
3351- intel/fs: Encapsulate per-block ACP in a structure
3352- intel/fs: Use rb_tree to store ACP entries by source
3353- intel/fs: Use rb_tree to store ACP entries by destination
3354- intel/fs: Use rb_tree for copy prop dataflow
3355- intel/fs: Merge copy prop dataflow loops
3356- intel/compiler/xe2: Update fs_visitor::setup_vs_payload to account for Xe2 reg size
3357- intel/compiler/xe2: Use SIMD16 for nir_intrinsic_image_size
3358- intel/compiler/xe2: TXD is lowered to SIMD16 in SIMD32 mode
3359- nir/rematerialize: Rematerialize ALUs used only by compares with zero
3360- intel/compiler/xe2: Handle new URB read messages
3361- intel/compiler/xe2: Handle new URB write messages
3362- intel/compiler/xe2: Update fs_visitor::emit_urb_writes to not assume SIMD8
3363- spirv: Track when a shader has a cooperative matrix
3364- intel/fs: Add DP4A to get_lowered_simd_width
3365- nir/split_vars: Don't split arrays of cooperative matrix types
3366
3367Igor Torrente (4):
3368
3369- zink: Fix enumerate devices when running compositor
3370- zink: Removes \`disable_xcb_surface`
3371- zink: Fix one addicional case when running a compositor
3372- zink: fix for startup crash of weston running on top of zink + venus
3373
3374Illia Abernikhin (2):
3375
3376- state_tracker: moving initialisation of whandle out from if statement whandle initialization inside if statement but used also outside
3377- i915: change format in dbg string Actually, uintptr_t is of type unsigned long, but the debug line uses the %d format specifier, which expects an int.
3378
3379Illia Polishchuk (7):
3380
3381- iris: remove NULL check for already dereferenced pointer earlier
3382- s/Intel: fix/anv: fix: potentially overflowing expression in genX
3383- glx: fix dead code when gc var cannot be null due to earlier check
3384- state_tracker: fix dereference before null check
3385- anv, drirc: Add workaround to speed up Cyberpunk 2077 reg allocation
3386- zink: move find_sampler_var from zink to nir core
3387- nir: fix invalid sampler search by texture id
3388
3389Italo Nicola (24):
3390
3391- mesa/main: account for RTT samples when updating framebuffer
3392- mesa/main: allow readpix/teximage to read from implicitly multisampled fbos
3393- panfrost/genxml: fix Surface With Stride descriptor alignment
3394- panfrost/genxml: add Multiplanar Surface descriptor
3395- panfrost: refactor (un)packing of surface descriptors
3396- pan/decode: decode Multiplanar Surface descriptors
3397- panfrost: prepare pan_image_view for multiplanar formats
3398- panfrost: prepare the driver to support YUYV and variants
3399- panfrost: advertise support for YUYV and variants
3400- panfrost: mandate proper alignment requirement depending format and arch
3401- panfrost: add PAN_MESA_DEBUG=yuv for debugging yuv sampler
3402- gallium/st: add non-CSC lowering of I420 as PIPE_FORMAT_R8_G8_B8_420
3403- gallium/st: add non-CSC lowering of YV12 as PIPE_FORMAT_R8_B8_G8_420
3404- pan/bi: add support for I420 and YV12 sampling
3405- gallium/st: lower NV21 to R8_B8G8 instead of G8_B8R8
3406- panfrost: fix invalid memory access in get_equation_str()
3407- pan/decode: handle more than one panfrost_device
3408- panfrost/ci: updated CI expectations
3409- egl: reenable partial redraw with a warning when using gallium hud
3410- pan/genxml: add Width/Height fields to v9+ Plane descriptor
3411- panfrost: rename _needs_multiplanar_descriptor to _is_yuv
3412- panfrost: prepare v9+ to support YUV sampling
3413- panfrost: use centered YUV chroma siting
3414- panfrost: advertise YUV formats for valhall
3415
3416Iván Briano (23):
3417
3418- anv: ensure CFE_STATE is emitted for ray tracing pipelines
3419- iris: ensure mesh is disabled on context init
3420- anv: ensure mesh is disabled on context init
3421- anv: implement Wa_14019750404
3422- intel/compiler: call brw_nir_adjust_payload from brw_postprocess_nir
3423- anv,hasvk: respect provoking vertex setting on geometry shaders
3424- anv: fix missing 3DSTATE_SBE_CLIP emission
3425- anv: ensure pipelines have all state
3426- anv: tell blorp to do mesh stuff only if it's enabled
3427- blorp: fix hangs with mesh enabled
3428- anv: use a simpler MUE layout for fast linked libraries
3429- anv: track what kind of pipeline a fragment shader may be used with
3430- intel/fs: read viewport and layer from the FS payload
3431- intel/fs: handle URB setup for fast linked mesh pipelines
3432- anv: enable VK_EXT_mesh_shader where supported
3433- intel/fs: use ffsll so we don't explode on 32 bits
3434- vulkan/runtime: add internal parameter to vk_spirv_to_nir
3435- nir/lower_int64: respect rounding mode when casting to float
3436- intel/compiler: round f2f16 correctly for RTNE case
3437- util: add double_to_float16 helpers
3438- nir: round f2f16{_rtne/_rtz} correctly for constant expressions
3439- anv: advertise VK_KHR_global_priority_queue
3440- anv: use the right vertexOffset on CmdDrawMultiIndexed
3441
3442Jani Nikula (1):
3443
3444- docs/vulkan: fixup some typos
3445
3446Janne Grunau (4):
3447
3448- asahi: toggle more barrier bits after transform feedback
3449- asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs
3450- asahi,agx: Upload constant buffers immediately
3451- asahi: decode: Fix uint64_t format modifiers in agxdecode_stateful()
3452
3453Jesse Natalie (2):
3454
3455- nir_lower_mem_access_bit_sizes: Fix write-mask-constrained 3-byte stores as atomics
3456- d3d12: Fix multidimensional array ordering
3457
3458Jianxun Zhang (1):
3459
3460- intel/common: Only set op mask on instructions in decoder
3461
3462Jonathan Marek (2):
3463
3464- freedreno: move redump.h to common code + cleanup
3465- tu: add a TU_DEBUG=rd option for cmdstream dumping
3466
3467Jordan Justen (73):
3468
3469- isl: Add ISL_SURF_USAGE_STREAM_OUT_BIT
3470- anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS
3471- genxml/hsw: Add additional MOCS field enumerations
3472- genxml/chv: Add MEMORY_OBJECT_CONTROL_STATE_CHV to document compared to BDW
3473- isl/dev: Add uncached MOCS value
3474- isl: Set MOCS to uncached for MTL stream-out
3475- intel/isl: Use intel_needs_workaround() for MTL CCS WA
3476- intel/compiler: Use nir SUBGROUP_INVOCATION for RT TOPOLOGY_ID
3477- intel/dev: Add LNL platform enum
3478- intel/dev: Support xe2 device init (for intel_device_info_test)
3479- intel/tools: Use 'env bash' to find bash executable
3480- intel/decoder: Fix xml filename when verx10 % 10 is not 0
3481- intel/decoder: Add intel_spec_load_common()
3482- intel/decoder: Make intel_spec_load_filename() have separate dir and name strings
3483- intel/genxml: Align "Texture Coordinate Mode" naming
3484- intel/genxml: Split some genxml sorting code into a intel_genxml module
3485- intel/genxml: Convert gen_bits_header to use ElementTree
3486- intel/genxml: Convert gen_pack_header to use ElementTree
3487- intel/genxml: Add GenXml class into intel_genxml module
3488- intel/genxml: Add filter_engines() to GenXml class
3489- intel/genxml: Move sorting & writing into GenXml class
3490- intel/genxml: Don't rewrite sorted xml if the contents didn't change
3491- intel/genxml: Add final newline to output when saving xml
3492- intel/genxml: Update xml with gen_sort_tags.py output
3493- intel/dev: Use RPL-U name on RPL-U devices
3494- intel/dev: Add more RPL PCI IDs
3495- anvil,hasvk: Rename need_clflush to need_flush
3496- intel/common: Move intel_clflush.h to intel_mem.h/intel_mem.c
3497- anvil,hasvk: Replace intel_clflush_range with intel_flush_range
3498- intel/common: Add intel_flush_range_no_fence
3499- anvil,hasvk: Use intel_flush_range_no_fence to flush command buffers
3500- util/u_cpu_detect: Drop unused has_tsc
3501- util/u_cpu_detect: Detect clflushopt support
3502- meson: Check for the __builtin_ia32_clflushopt function
3503- intel/clflush: Add support for clflushopt instruction
3504- intel/dev/xe: Move placeholder subslice info into XEHP_FEATURES
3505- intel/genxml: Ignore tail leading/trailing whitespace in node_validator()
3506- intel/genxml: Fix comparing xml when node counts differ
3507- intel/dev: Update device string for MTL PCI ID 0x7d55
3508- intel/genxml: Support importing from another genxml file
3509- intel/genxml: Add support for excluding items when importing
3510- intel/genxml: Add all xml files as pack dependencies
3511- intel/genxml: Add GenXml.optimize_xml_import()
3512- intel/genxml: Drop assertion to allow for importing
3513- intel/genxml: Add GenXml.add_xml_imports method
3514- intel/genxml: Add GenXml.flatten_xml() method
3515- intel/genxml: Add genxml_import.py script
3516- intel/decoder: ralloc_steal() values from spec context for fields and enums
3517- intel/decoder: Implement support for importing genxml
3518- intel/genxml: Start Xe2 support
3519- intel/genxml: Auto-import genxml files using genxml_import.py
3520- intel/common: Add sse2_args for 32-bit build when -Dsse2=false was set
3521- intel/compiler/fs: Support Xe2 reg size in assign_curb_setup
3522- intel/compiler: Update opt_split_sends() for Xe2 reg size
3523- intel/compiler: Update emit_rt_lsc_fence() for Xe2
3524- intel/compiler: Update lower_trace_ray_logical_send() for Xe2
3525- intel/compiler: Update ray-tracing intrinsic lowering for Xe2
3526- intel/compiler: Update RT stack_id access for Xe2
3527- intel/fs: Update SSBO & shared uniform block loads for Xe2
3528- intel/genxml: Build with gen20.xml
3529- intel/isl: Build for Xe2
3530- iris: Build for Xe2
3531- anv/blorp: Use anv_genX to set device->blorp.exec
3532- anv: Disable Ray Tracing on xe2 until our compiler supports Xe2 RT
3533- anv: Build for Xe2
3534- anv: Print warning that Xe2 is not supported rather than failing
3535- intel/compiler: Add enum xe2_lsc_cache_store
3536- intel/compiler: Use enum xe2_lsc_cache_store on xe2
3537- intel/compiler: Add enum xe2_lsc_cache_load
3538- intel/compiler: Use enum xe2_lsc_cache_load on xe2
3539- anv/batch: Check if batch already has an error in anv_queue_submit_simple_batch()
3540- anv/batch: Assert that extend_cb is non-NULL if the batch is out of space
3541- intel/dev: Add 0x56ba-0x56bd DG2 PCI IDs
3542
3543Jose Maria Casanova Crespo (2):
3544
3545- vc4: mark buffers as initialized at vc4_texture_subdata
3546- vc4: Fix mask RGBA validation at YUV blit
3547
3548José Expósito (3):
3549
3550- zink: Fix crash on zink_create_screen error path
3551- zink: fix dereference before NULL check
3552- zink: allow software rendering only if selected
3553
3554José Roberto de Souza (51):
3555
3556- anv: Use workaround framework to Wa_14016118574
3557- intel/aux_map: Nuke format_enum
3558- intel/aux_map: Use get_aux_entry() in remove_mapping()
3559- intel/aux_map: Replace magic number by INTEL_AUX_MAP_ENTRY_VALID_BIT
3560- intel/aux_map: Rename some variables to improve readability
3561- intel/aux_map: Mask out bits above index 47 in intel_aux_get_meta_address_mask()
3562- intel/aux_map: Convert l1_entry_addr_out to canonical
3563- intel/aux_map: Drop magic sub table size number
3564- intel/aux_map: Add function and macro to return l2 and l1 table masks
3565- anv: Add gem_create_userptr() to KMD backend
3566- anv: Replace handle by anv_bo in the gem_close()
3567- anv: Add support for userptr in Xe KMD
3568- intel: Sync xe_drm.h
3569- intel/dev/xe: Add support for small-bar setups
3570- anv: Request Xe KMD to place BOs to CPU visible VRAM when required
3571- iris: Request Xe KMD to place BOs to CPU visible VRAM when required
3572- iris/xe: Call iris_lost_context_state() when batch engine is replaced
3573- intel/dev: Port intel_dev_info tool to Xe KMD
3574- iris: Replace I915_EXEC_FENCE_SIGNAL by IRIS_BATCH_FENCE_SIGNAL in common code
3575- intel: Move i915_drm.h specific code from common/intel_gem.h to common/i915/intel_gem.h
3576- intel/common: Move functions inside of C++ ifdef
3577- intel: Rename intel_gem_add_ext() to intel_i915_gem_add_ext()
3578- iris: Move i915_gem_set_domain() call to i915 backend
3579- iris: Move iris_bufmgr_bo_close() to kmd backend
3580- iris: Add gem_create_userptr() to KMD backend
3581- iris: Add support for userptr in Xe KMD
3582- intel/genxml/gen125: Add missing fields in MI_MATH
3583- iris: Set MI_MATH MOCS field
3584- anv: Set MI_MATH MOCS field
3585- intel/tests/mi_builder: Set MI_MATH MOCS field
3586- intel/genxml/gen125: Set MI_MATH MOCS field as non-zero
3587- anv: Nuke unused READ_ONCE() from anv_batch_chain.c
3588- anv: Remove VkAllocationCallbacks parameter from reloc functions
3589- anv: Return earlier in anv_reloc_list functions
3590- intel: Sync xe_drm.h and rename engine to exec_queue
3591- anv: Override vendorID for Hogwarts Legacy
3592- intel/isl: Remove unknown workaround
3593- intel/isl: Remove Wa_22011186057
3594- anv: Update Wa_16014390852 for MTL
3595- intel: Sync xe_drm.h
3596- anv: Move i915 specific gem_set_caching to backend
3597- anv: Move i915 specific code from common anv_gem.c
3598- anv: Move bo_alloc_flags_to_bo_flags() to backend
3599- anv: Move i915 handling of imported bos bo_flags
3600- anv: Remove i915_drm.h include from common code
3601- iris: Lock bufmgr->lock before call vma_free() in error path
3602- iris: Nuke useless flags from iris_fine_fence_new()
3603- intel: Prepare implementation of Wa_18019816803 and Wa_16013994831 for future platforms
3604- intel: Sync xe_drm.h
3605- anv: Switch Xe KMD vm bind to sync
3606- anv: Add missing ANV_BO_ALLOC_EXTERNAL flags when calling anv_device_import_bo()
3607
3608Juan A. Suarez Romero (7):
3609
3610- broadcom/ci: update expected results
3611- vc4/ci: update expected results
3612- v3d/shim: include new ioctl parameters
3613- v3dv/ci: update expected list
3614- broadcom: add performance counters for V3D 7.x
3615- broadcom/simulator: add per-hw version calls
3616- v3d/vc4/ci: add new fails/timeout
3617
3618Julia Tatz (10):
3619
3620- gallium/dri: fix dri2_from_names
3621- aux/trace: skip multi-line comments in enums2names
3622- aux/trace: deduplicate enum dump macro work
3623- aux/trace: move trace_sample_view logic
3624- aux/trace: fix set_hw_atomic_buffers method name
3625- aux/trace: add screen video methods
3626- aux/trace: add context video methods
3627- aux/trace: wrap video_codec & video_buffer
3628- aux/trace: unwrap refrence frames in picture_desc
3629- aux/trace: trace video_buffer method return vals
3630
3631Julia Zhang (1):
3632
3633- radeonsi: modify algorithm of skipping holes of sparse bo
3634
3635Julian Hagemeister (1):
3636
3637- Gallium: Fix shared memory segment leak
3638
3639Juston Li (10):
3640
3641- zink: remove venus from renderpass optimizations
3642- venus: sync protocol for VK_EXT_vertex_input_dynamic_state
3643- venus: implement VK_EXT_vertex_input_dynamic_state
3644- venus: set lvp queries as saturate on overflow
3645- venus: add helper function to get cmd handle
3646- venus: refactor out common cmd feedback functions
3647- venus: support deferred query feedback recording
3648- venus: track/recycle appended query feedback cmds
3649- venus: append query feedback at submission time
3650- venus: switch to unconditionally deferred query feedback
3651
3652Kai Wasserbäch (3):
3653
3654- fix: clover: LLVM 18 renamed/moved CGFT_*, update compat layer
3655- fix: clover: LLVM 18: s/CodeGenOpt::/CodeGenOptLevel::/
3656- fix: clover: warning: ignoring return value of ‘int posix_memalign(…)’ [-Wunused-result]
3657
3658Karmjit Mahil (29):
3659
3660- pvr: Remove mrt setup from SPM EOT
3661- pvr: Compile SPM EOT shader
3662- pvr: Use the SPM EOT on barrier stores
3663- pvr: Remove some magic numbers and increments from km stream
3664- pvr: Restructure \`rogue_kmd_stream.xml`
3665- pvr: Submit PR commands
3666- pvr: Use the correct size for the unified store allocation
3667- pvr: Allow query stage for barrier sub cmds
3668- pvr: Fix occlusion query unaccounted for user fences
3669- pvr: Fix writing query availability write out
3670- pvr: Fix packing issue with max_{x,y}_clip
3671- pvr: Fix csb relocation status assert on \`pvr_csb_finish()`
3672- pvr: Fix \`for` loop itarator usage
3673- pvr: Fix dynamic desc offset storage
3674- pvr: Fix cubemap layer stride
3675- pvr: Use the render passes' attachments array to setup ISP state
3676- pvr: Adjust EOT PBE state to account for the iview's base array layer
3677- pvr: Fix MRT index in PBE state
3678- pvr: Fix pbe_emit assert
3679- pvr: Fix OOB access of pbe_{cs,reg}_words
3680- pvr: Order tile buffer EOT emits to be last
3681- pvr: Fix subpass sample count on ds attachment only
3682- pvr: Refactor subpass ds and sample count setup
3683- pvr: Fix SPM load shader sample rate
3684- pvr: Fix PPP_SCREEN sizes
3685- vulkan: Add \`vk_subpass_dependency_is_fb_local()` helper
3686- tu: Use common \`vk_subpass_dependency_is_fb_local()`
3687- pvr: Don't merge subpasses on framebuffer-global dependancy
3688- pvr: Only setup the bgobj to load if we have a load_op
3689
3690Karol Herbst (213):
3691
3692- nvc0: initial Ada enablement
3693- rusticl/mesa: make svm_migrate optional
3694- llvmpipe: enable system SVM
3695- nvc0: fix num_gprs for Volta+
3696- rusticl: fix warnings with newer rustc
3697- gm107/ir: fix SULDP for loads without a known format
3698- nv50/ir/nir: fix txq emission on MS textures
3699- nv50/ir/nir: Fix zero source handling of tex instructions.
3700- rusticl/kernel: only handle function_temp memory before lowering printf
3701- meson,ci: bump meson req for rusticl to 1.2
3702- rusticl/nir: add helper functions we need for a NIR_PASS macro
3703- rusticl/nir: add a nir_pass macro
3704- rusticl/nir: use the new nir_pass macro
3705- rusticl/kernel: rename res to internal_args inside lower_and_optimize_nir_late
3706- rusticl/kernel: merge lower_and_optimize_nir_pre_inputs and lower_and_optimize_nir_late
3707- rusticl/kernel: move things around in lower_and_optimize_nir
3708- rusticl/kernel: get rid of initial function_temp type lowering
3709- rusticl/kernel: mark can_remove_var as unsafe and document it
3710- n50/compute: submit initial compute state in nv50_screen_create
3711- nvk: add vulkan skeleton
3712- nouveau/winsys: add the new winsys implementation
3713- nvk: use winsys lib
3714- nvk: fix nvk_buffer include guards
3715- nouveau/headers: add script to sync in-tree headers with open-gpu-doc
3716- nouveau/headers: initial sync of headers
3717- nvk: implement GetPhysicalDeviceQueueFamilyProperties2 to make the CTS happy
3718- nvk: advertize memory heaps and types
3719- nouveau/ws: reorganize a little
3720- nouveau/ws: dup the fd
3721- nouveau/ws: add a field for the SM version
3722- nvk: set nonCoherentAtomSize as the CTS divides with this value
3723- nouveau/ws: add bo API
3724- nvk: add basic device memory support
3725- nouveau/headers: add nvtypes.h
3726- nouveau/headers: typedef Nv void types
3727- nouveau/headers: add host classes
3728- nouveau/ws: add context support
3729- nouveau/ws: add a cmd buffer
3730- novueau/bo: refcount it
3731- novueau/bo: add nouveau_ws_bo_wait
3732- nvk: allocate a GPU context for each VkDevice
3733- nvk: add nvk_bo_sync
3734- nvk: add nvk_CmdPipelineBarrier2 stub
3735- nvk: impl nvk_CmdCopyBuffer
3736- nouveau/ws: fix setting push bo domains
3737- nouveau/ws: PUSH_IMMD only works with 16 bit values
3738- nouveau/ws: set GPU object class
3739- nouveau/ws: bind 2D class
3740- nvk: use fermi class definitions
3741- nvk: add basic support for images
3742- nvk: simple format table
3743- nvk: add support for blits
3744- nvk: report maxMipLevels as 1
3745- nvk: optimize blit command buffer gen
3746- nvk: implement CmdFillBuffer
3747- nvk: implement CmdUpdateBuffer
3748- nvk: implement CmdCopyBuffer2
3749- nvk: advertise VK_KHR_copy_commands2
3750- nvk: implicitly reset the command buffer
3751- nouveau/ws: handle 0inc inside nvk_push_val as well
3752- nvk: reduce pitch even further in CmdFillBuffer
3753- nvk: support multiple miplevels
3754- nvk: support array blits over multiple layers
3755- nvk: tiling prep work for VK_EXT_image_2d_view_of_3d
3756- nouveau/ws: make sure we don't submit nonsense
3757- nouveau/ws: assert on broken channel
3758- nvk/blit: assert that formats are supported
3759- nouveau/headers: Generate parser functions
3760- nouveau/ws: initial debugging options for command submissions
3761- nouveau/ws: depend on generated class header files
3762- nouveau/ws: get rid of libdrm
3763- nouveau/ws: use new NVIF interface to query oclasses
3764- nvk: set deviceName
3765- nouveau/headers: add path for 3D headers
3766- nouveau/headers: initial 3D headers import
3767- nouveau/ws: allocate 3D subchan
3768- nouveau/ws: allocate copy subchan as well
3769- nouveau/ws: add API to query if the context was killed
3770- nouveau/ws: add a bo unmap helper function
3771- nvk: clean up bo mappings
3772- nouveau/ws: bound check nouveau_ws_push_append
3773- nouveau/ws: rework refing push buffer bos
3774- nouveau/ws: push chaining
3775- nvk: fix OOB read inside nvk_get_va_format
3776- nvk: alloc a zero page and use it for vertex runouts
3777- nvk: fix zero page refing
3778- nvk: support exporting buffers
3779- nvk: fix some class version checks
3780- nvk: properly align shaders pre Turing
3781- nvk: rework QMD handling to support pre Turing
3782- nvk: align desc root table
3783- nvk: Use SET_PIPELINE_PROGRAM pre-Volta
3784- nvk: properly align slm size
3785- nvk: use remaps for image copies
3786- nvk: reduce pitch for FillBuffer
3787- nvk: bind more subchans in init_context_state
3788- nvk: support pre Maxwell Texture Headers
3789- nvk/device: fix order of error handling
3790- nvk: allocate VAB memory area
3791- nvk: wire up M2MF for Fermi
3792- nouveau/mme: add test for BEQ with magic exit offset
3793- nouveau/mme: add a macro exit helper
3794- nvk: Add a macro to set MMIO registers via falcons
3795- nouveau/winsys: fix SM value for Ada
3796- nvk: fix num_gprs for Volta+
3797- nvk: replace mp with tpc
3798- nvk: properly calculate SLM region by taking per arch limits into account
3799- nouveau: fix max_warps_per_mp_for_sm for builds with asserts disabled
3800- nvk: enable fp helper invocations loads on more gens
3801- nv50/ir: use own info struct for sys vals
3802- nv50/ir: convert system values to gl_system_value
3803- nouveau/mme: fix OOB access inside while_ine builder test
3804- nouveau/mme: fix OOB inside tu104 simulator
3805- clc: use CLANG_RESOURCE_DIR for clang's resource path
3806- nv50: fix code uploads bigger than 0x10000 bytes
3807- nouveau: take glsl_type ref unconditionally
3808- rusticl/kernel: optimize nir between lowering io and explicit types
3809- nv50: limit max code uploads to 0x8000
3810- zink: fix source type in load/store scratch
3811- zink: fix global stores
3812- zink: update some compute caps
3813- rusticl: add debug option to sync every event
3814- rusticl/device: _MAX_CONST_BUFFER0_SIZE is unsigned
3815- ci: disable a660 jobs
3816- nir: make workgroup_id 32 bit only
3817- nir: make num_workgroups 32 bit only
3818- ac: drop 64 bit handling for cl workgroup intrinsics
3819- gallivm/nir: drop 64 bit handling for cl workgroup intrinsics
3820- intel/compiler: drop 64 bit handling for cl workgroup intrinsics
3821- panfrost: drop 64 bit handling for cl workgroup intrinsics
3822- rusticl: reduce global_invocation_id_zero_base to 32 bit
3823- panfrost: drop pan_nir_lower_64bit_intrin
3824- rusticl/disk_cache: fix stack corruption
3825- rusticl/query: fix use-after-free, but also fix incorrect usage of unsafe
3826- rusticl/event: disable profiling for devices without timestamps
3827- rusticl/queue: properly implement clCreateCommandQueueWithProperties
3828- rusticl/memory: do not verify pitch for IMAGE1D_BUFFER
3829- rusticl/memory: only specify PIPE_BIND_SHADER_IMAGE where supported
3830- asahi: fetch available system memory
3831- asahi: lower hadd
3832- asahi: handle kernels
3833- asahi: handle load_workgroup_size
3834- asahi: handle load_global_invocation_id_zero_base
3835- asahi: implement get_compute_state_info
3836- asahi: implement set_global_binding
3837- asahi: implement clear_buffer
3838- asahi: gracefully handle allocating linear images
3839- asahi: handle images in is_format_supported
3840- rusticl/memory: fallback if allocating linear images fails
3841- rusticl: enable asahi
3842- rusticl/mesa: create contexts with PIPE_CONTEXT_NO_LOD_BIAS
3843- docs/features: cl_khr_3d_image_writes needs driver support
3844- rusticl/mesa: fix \`set_constant_buffer` when passing an empty buffer
3845- rusticl/kernel: skip adding global id offsets if not used
3846- meson/rusticl: add sha1_h
3847- rusticl/mesa/context: fix clear_sampler_views
3848- nir: add nir_lower_alu_vec8_16_srcs pass
3849- zink: lower vec8/16
3850- rusticl/mesa: create COMPUTE_ONLY contexts
3851- rusticl: fix clippys bool_to_int_with_if
3852- rusticl/memory: fix potential use-after-free in clEnqueueSVMMemFill
3853- nir/load_libclc: fix libclc memory leak
3854- rusticl/kernel: Fix creation from programs not built for every device
3855- ci: add half-life 2 freedreno flake
3856- zink: implement get_compute_state_info
3857- zink: copy has_variable_shared_mem cs property
3858- zink: pass entire pipe_grid_info into zink_program_update_compute_pipeline_state
3859- zink: refactor spec constant handling
3860- zink: variable shared mem support
3861- zink: support more nir opcodes
3862- zink: make spirv_builder_emit_*op compatible with spec constants
3863- zink: support samplers with unnormalized_coords
3864- zink: implement remaining pack ops via bitcast
3865- zink: fix RA textures
3866- zink: fix load/store scratch offsets
3867- rusticl/mesa/screen,device: add driver_name
3868- rusticl: enable zink
3869- pipe-loader: allow to load multiple zink devices
3870- rusticl: bump rustc version to 1.66
3871- rusticl/mesa/nir: mark more methods as mut
3872- rusticl/mesa/nir: Mark NirShader and NirPrintfInfo as Send and Sync
3873- rusticl/mesa: mark PipeResource as Send and Sync
3874- rusticl/mesa: mark PipeTransfer as Send
3875- rusticl/cl: mark _cl_image_desc as Send and Sync
3876- rusticl/queue: get rid of pointless Option around our worker thread handle
3877- rusticl/queue: make it Sync
3878- rusticl/kernel: get rid of Arcs in KernelDevStateVariant
3879- rusticl/memory: use get_mut instead of lock in drop
3880- zink: implement PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS
3881- rusticl/api: remove cl_closure macro
3882- zink: implement load_global_constant
3883- zink: properly emit PhysicalStorageBufferAddresses cap
3884- nir/lower_mem_access_bit_sizes: fix invalid shift bit_size
3885- rusticl/device: restrict 1Dbuffer images for RGB and RGBx
3886- rusticl/memory: use PIPE_BUFFER for IMAGE1D_BUFFER images
3887- rusticl/format: disable all sRGB formats
3888- asahi: flush denorms on exact fmin/fmax
3889- zink: wrap shared memory blocks in a struct
3890- zink: properly alias shared memory
3891- zink: fix zink_destroy_screen for early screen creation fails
3892- docs/features: remove empty lines confusing mesamatrix
3893- rusticl/device: restrict image_buffer_size
3894- rusticl/device: restrict param_max_size further
3895- rusticl/mem: properly set pipe_image_view::access
3896- zink: lower fisnormal as it requires the Kernel Cap
3897- radv: fix buffers in vkGetDescriptorEXT with size not aligned to 4
3898- rusticl/queue: Only take a weak ref to the last Event
3899- rusticl/mesa: pass PIPE_BIND_LINEAR in resource_create_texture_from_user
3900- zink: deallocate global_bindings array
3901- rusticl/mesa/screen: do not derefence the entire pipe_screen struct
3902- nvc0: implement PIPE_CAP_TIMER_RESOLUTION
3903- rusticl/queue: do not send empty lists of event to worker queue
3904- rusticl/queue: fix implicit flushing of queue dependencies
3905
3906Kenneth Graunke (21):
3907
3908- iris: Re-emit 3DSTATE_DS for each primitive (workaround 14019750404)
3909- intel/compiler: Fix sparse cube map array coordinate lowering
3910- intel/compiler: Respect NIR_DEBUG_PRINT_INTERNAL for DEBUG_OPTIMIZER
3911- intel/fs: Account for payload GRFs when calculating register pressure
3912- intel/compiler: Move SCHEDULE_NONE handling into schedule_instructions()
3913- intel/fs: Index scheduler mode string table by mode enum
3914- intel/fs: Make helpers for saving/restoring instruction order
3915- intel/fs: Pick the lowest register pressure schedule when spilling
3916- intel/fs: Dump IR for pre-RA scheduler modes in DEBUG_OPTIMIZER
3917- iris: Check prog[] instead of uncompiled[] for BLORP state skipping
3918- nir: Fix function parameter indentation in nir_opt_barriers.c
3919- nir: Add an optimization pass to reduce barrier modes
3920- nir: Reduce the scope of shared memory barriers
3921- lavapipe: Don't delete control barriers
3922- virgl, nir_to_tgsi: Add a hack for promoting partial memory barriers
3923- dxil: Set UAV_FENCE_THREAD_GROUP any time global isn't required
3924- glsl: Use nir_opt_barrier_modes() to drop unnecessary barriers
3925- anv: Use nir_opt_barrier_modes() to drop unnecessary barriers
3926- mesa: Fix zeroing of new ParameterValues array entries when growing
3927- intel/fs: Fix Xe2 URB read/lowering with per-slot offsets
3928- anv: Add support for a transfer queue on Alchemist
3929
3930Kevron Rees (1):
3931
3932- Force vk vendor for spider-man remastered
3933
3934Konrad Dybcio (5):
3935
3936- freedreno: Set magic writes per-GPU, using existing data
3937- freedreno: Include speedbin fallback in 740 chipid to fix probing
3938- freedreno: Include speedbin fallback in 730 chipid to fix probing
3939- freedreno: Include speedbin fallback in 690 chipid to fix probing
3940- freedreno: Add Adreno 643
3941
3942Konstantin Seurer (95):
3943
3944- radv: Stop using the misleading round_up_u* functions
3945- radv/meta_buffer: Stop setting RADV_META_SAVE_DESCRIPTORS
3946- radv/meta_buffer: Rename size_minus16 to max_offset
3947- llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE
3948- nir/tests: Refactor boilerplate into a common header
3949- nir/tests: Use a single binary
3950- draw: Do not restart the primitive_id at 0
3951- gallivm: Fix subsampled format sampling under Vulkan
3952- gallivm: Ignore nir_tex_src_plane
3953- lavapipe: Remove dummy sampler ycbcr conversion
3954- lavapipe: Store immutable_samplers as lvp_sampler array
3955- lavapipe: Fix binding immutable samplers with desc buffers
3956- lavapipe: Implement samplerYcbcrConversion
3957- lavapipe: Advertise samplerYcbcrConversion
3958- llvmpipe: Zero extend vectors in widen_to_simd_width
3959- vulkan: Add a generated vk_properties struct
3960- radv: Use common physical device properties
3961- clang-format: Disable formatting by default
3962- lavapipe: Use common physical device properties
3963- nir/from_ssa: Don't insert store_reg instructions before phis
3964- gallivm: Run nir_convert_to_lcssa before nir_convert_from_ssa
3965- lavapipe/ci: Remove descriptor_indexing fails
3966- radv/rt: Rename shader_pc and next_shader
3967- radv/rt: Rename traversal_shader to traversal_shader_addr
3968- nir/opt_large_constants: Handle small float arrays
3969- bin: Update spirv sources
3970- vulkan: Allow beta extensions for physical device features
3971- vulkan: Allow beta extensions for physical device properties
3972- vulkan Add enqueue entrypoint for CmdDispatchGraphAMDX
3973- nir: Add shader enqueue data structures and handling
3974- spirv: Update headers and grammer JSON
3975- spirv: Implement SPV_AMDX_shader_enqueue
3976- lavapipe: Add lvp_pipeline_type
3977- lavapipe: Implement exec graph pipelines
3978- lavapipe: Implement AMDX_shader_enqueue commands
3979- lavapipe: Advertise AMDX_shader_enqueue
3980- radv: Add internal_nodes_offset to scratch_layout
3981- radv: Remove leaf_args::dst_offset
3982- radv/rt: Remove some dead code
3983- radv/rt: Do not apply stack_ptr for non-recursive stages
3984- radv/rt: Add and use radv_build_traversal
3985- radv/rt: Insert rt_return_amd before lowering shader calls
3986- radv/rt: Split stage initialization and hashing
3987- aco: Do not fixup registers if there are no shader calls
3988- radv: Stop updating the stack_size in insert_rt_case
3989- lavapipe: Lock around CSO destroys
3990- vulkan/wsi/x11: Implement capture hotkey using the keymap
3991- venus: Use the common GetPhysicalDeviceFeatures2 implementation
3992- nir/lower_shader_calls: Limit the remat chain length
3993- lavapipe: Avoid lowering shaders twice
3994- lavapipe: Fix the locking around cso destruction
3995- aco/validate: Handle p_wqm like p_parallelcopy
3996- aco: Use bytes() instead of size() in emit_wqm
3997- aco: Unify demote and demote_if selection
3998- radv: Only generate debug info if required
3999- aco/lower_to_cssa: Fix typo
4000- radv: Don't use the depth image view for depth bias emission
4001- radv/rt: Store NIR shaders separately
4002- radv/rt: Add monolithic raygen lowering
4003- radv/rt: Enable monolithic pipelines
4004- radv/ci: Document new flake
4005- vulkan/properties: Handle unsized arrays properly
4006- radv: Remove dead radix_sort_vk_get_memory_requirements call
4007- radv/radix_sort: Vendor the radix sort dispatch code
4008- radv: Perform multiple sorts in parallel
4009- radv/ci: Improve ray tracing skips
4010- ac/llvm: Fix typed loads with 16bit formats
4011- ac/llvm: Use the correct return type for uadd_carry and usub_borrow
4012- ac/llvm: Use float types for float atomics
4013- radv: Don't advertise features requiring PS epilogs with LLVM
4014- radv: Update navi21 llvm fails
4015- radv/rt: Handle stages without nir properly
4016- radv: Remove ray tracing shader module identifier skips
4017- radv/bvh: Treat instances with mask == 0 as inactive
4018- radv/ray_queries: Skip cull_mask handling if it is FF
4019- radv/rt: Skip cull_mask handling if it is FF
4020- aco/spill: Make sure that offset stays in bounds
4021- nir: Add nir_cf_node_cf_tree_prev
4022- nir: Add nir_foreach_block_in_cf_node_reverse
4023- nir: Add nir_rematerialize_deref_in_use_blocks
4024- nir/lcssa: Fix rematerializing derefs
4025- nir/deref: Layer rematerialization helpers
4026- lavapipe/ci: Fix asan expectations
4027- hasvk: Use the common GetPhysicalDeviceFeatures2 implementation
4028- vulkan: Remove vk_get_physical_device_core_1_*_feature_ext
4029- radv/bvh/ploc: Load child bounds from LDS
4030- radv: Merge the sync_data and header initialization
4031- radv: Do not sync after radv_update_buffer_cp
4032- zink: Initialize primitive types to an invalid value
4033- nir/passthrough_gs: Support edge flags with points
4034- zink: Enable edge flags with points
4035- mesa: Fix glBegin/End when LINE_LOOP is not supported
4036- llvmpipe: Compile a nop texture function for unsupported configurations
4037- radv/rt: Use nir_shader_instructions_pass for lower_rt_instructions
4038- radv/sqtt: Fix tracing acceleration structure commands
4039
4040Lang Yu (5):
4041
4042- amd/common: add AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 property
4043- radeonsi: use AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 to determine wave size
4044- radeonsi: use wave size to determine index stride
4045- amd/common: add missing stuff for gfx11.5
4046- amd/radeonsi: add missing stuff for gfx11.5
4047
4048Leandro Ribeiro (13):
4049
4050- egl: rewrite outdated comment in _eglFindDevice()
4051- egl: remove unused parameter from _eglAddDRMDevice()
4052- egl: simplify _eglAddDRMDevice()
4053- egl: make explicit that we don't support render nodes for software EGLDevice
4054- egl: move is_render_node flag to platform_wayland
4055- loader: rename loader_open_render_node() to loader_open_render_node_platform_device()
4056- loader: add driver list as parameter in loader_open_render_node_platform_device()
4057- pipe-loader: add pipe_loader_get_compatible_render_capable_device_fd()
4058- dri: add queryCompatibleRenderOnlyDeviceFd() to __DRI_MESA extension
4059- kmsro: try to use only compatible render-capable devices
4060- loader: add loader_is_device_render_capable()
4061- egl/drm: get compatible render-only device fd for kms-only device
4062- egl: error out if we can't find an EGLDevice in _eglFindDevice()
4063
4064Leo Liu (4):
4065
4066- radeonsi: add AV1 profile to supported profile
4067- radeonsi/vcn: fix the incorrect dt_size
4068- Revert "frontends/va: Also map VAImageBufferType for reading"
4069- ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG
4070
4071Lina Versace (14):
4072
4073- docs: Add row for VK_KHR_maintenance5
4074- intel/pci_ids: Consistently use lowercase
4075- venus: Sync protocol for VK_EXT_graphics_pipeline_library
4076- venus: Erase pViewports and pScissors in fewer cases
4077- venus: Fix crash when VkGraphicsPipelineCreateInfo::layout is missing
4078- venus: Fix subpass attachments
4079- venus: Drop incorrectly-used always-true pipeline vars
4080- venus: Use VkImageAspectFlags in vn_subpass
4081- venus: Add enum vn_pipeline_type
4082- venus: Renames for VkGraphicsPipelineCreateInfo fixes
4083- venus: Refactor pipeline fixup into two stages
4084- venus: Do pipeline fixes for VK_EXT_graphics_pipeline_library
4085- venus: Enable VK_EXT_graphics_pipeline_library behind debug flag
4086- venus: Fix -Wmaybe-uninitialized
4087
4088LingMan (22):
4089
4090- rusticl/memory: fix potential use-after-free in clEnqueueSVMFree
4091- rusticl: Rename XyzCB aliases to FuncXyzCB
4092- rusticl: add structs to hold the C callbacks
4093- rusticl: use CreateContextCB
4094- rusticl: use DeleteContextCB
4095- rusticl: use EventCB
4096- rusticl: use MemCB
4097- rusticl: use ProgramCB
4098- rusticl: use SVMFreeCb
4099- rusticl: Make EventSig take ownership of its environment
4100- rusticl: add a safe abstraction to execute a DeleteContextCB
4101- rusticl: add a safe abstraction to execute an EventCB
4102- rusticl: add a safe abstraction to execute a MemCB
4103- rusticl: add a safe abstraction to execute an SVMFreeCb
4104- rusticl: add a safe abstraction to execute a CreateContextCB
4105- rusticl: add a safe abstraction to execute a ProgramCB
4106- rusticl/api: drop a few include paths
4107- rusticl: mark the fields of callback structs private
4108- rusticl: drop an \`#[allow(dead_code)]` marker
4109- rusticl/core: don't take a lock while dropping \`Context`
4110- rusticl: Show an error message if the build is attempted with an outdated bindgen version
4111- rusticl: Show an error message if the version of bindgen can't be detected
4112
4113Lionel Landwerlin (169):
4114
4115- anv: hide exec_flags selection inside the i915 backend
4116- isl: add a tool to query surface parameters
4117- intel/fs: fix missing predicate on SEL instruction
4118- intel/compiler: rework input parameters
4119- ci/a530: switch a few tests to flakes to unblock CI
4120- vulkan: bump header register to 1.3.258
4121- intel/fs: don't try to rebuild sequences of non ssa values
4122- intel/vec4: fix log_data pointer
4123- intel/fs: consider UNDEF as non-partial write
4124- intel/fs: add more UNDEFs around SEND messages
4125- isl: add ability to store buffer size in unused RENDER_SURFACE_STATE fields
4126- anv: simplify buffer address+size loads from descriptor buffer
4127- intel/fs: add support for sparse accesses
4128- intel/nir: handle image_sparse_load in storage format lowering
4129- intel/nir: add lower for sparse images & textures
4130- anv: wire image sparse loads
4131- blorp: switch blorp_update_clear_color to early return
4132- blorp: update and move fast clear PIPE_CONTROLs to drivers
4133- anv: fix 3DSTATE_RASTER::APIMode field setting
4134- anv: enable EDS3 ConservativeRasterizationMode
4135- vulkan: skip non required extension structures
4136- vulkan/runtime: add a layered implementation of vkCmdBindIndexBuffer
4137- anv: enable INTEL_DEBUG=nofc
4138- anv: fake non intel vendorID for Death Stranding
4139- hasvk: fix null descriptor handling with A64 messages
4140- anv: remove descriptor array bounds checking
4141- hasvk: remove descriptor array bounds checking
4142- anv/hasvk: track robustness per pipeline stage
4143- anv: implement VK_EXT_pipeline_robustness
4144- intel/fs: track more steps with INTEL_DEBUG=optimizer
4145- intel/fs: add variable for output of debug backend optimizer
4146- intel/decoder: constify some input parameters
4147- blorp: drop programming of 3DSTATE_(MESH|TASK)_SHADER
4148- anv: emit 3DSTATE_GS only once per pipeline
4149- intel/decoder: add options to decode surfaces/samplers
4150- anv: get rid of genX(emit_multisample)
4151- anv: move genX(rasterization_mode) to gfx8_cmd_buffer.c
4152- anv: don't try to access dynamic buffers from surface states
4153- iris: ensure stalling pipe control before fast clear
4154- intel/compiler: disable per-sample interpolation modes with non-per-sample dispatch
4155- intel/compiler: fix dynamic alpha-to-coverage handling
4156- intel/fs: implement dynamic interpolation mode for dynamic persample shaders
4157- intel/fs: move lower of non-uniform at_sample barycentric to NIR
4158- zink+anv: add regression testing with pipeline libraries
4159- anv: implement vkCmdBindIndexBuffer2KHR
4160- anv: handle new VkBufferViewUsageCreateInfoKHR
4161- anv: add vkGetRenderingAreaGranularityKHR()
4162- anv: implement GetDeviceImageSubresourceLayoutKHR/GetImageSubresourceLayout2KHR
4163- anv: add maintenance5 A8_UNORM/A1B5G5R5_UNORM support
4164- anv: deal with new pipeline flags
4165- anv: enable KHR_maintenance5
4166- anv: add missing ISL storage usage
4167- genxml/gfx11: remove Tiled Resource Mode field from HIER_DEPTH_BUFFER
4168- genxml/gfx12: rename Tiled Resource Mode
4169- isl: program 3DSTATE_HIER_DEPTH_BUFFER_BODY::TiledMode as documented
4170- intel/isl: Disallow Yf, Ys and Tile64 for 3D depth/stencil surfaces
4171- isl: disable Yf/Ys/Tile64 tilings for 1D images
4172- isl: add a usage flag to request 2D/3D compatible views
4173- isl: disallow TileYs/Yf on 3D storage images on Gfx9/11
4174- intel/isl: Add a max_miptail_levels field to isl_tile_info
4175- isl: make isl_surf_get_uncompressed_surf robust to argument accesses
4176- isl: add Gfx12/12.5 restriction on 3D surfaces & compression
4177- isl: disallow miptails on planar formats
4178- isl: disable miptails on gfx12 with yuv formats
4179- isl: disable CCS on Ys/Yf
4180- blorp: allow 3D blits/copies on Ys/Yf/Tile64 tiling
4181- intel/aux_map: correctly program tiling mode for Ys
4182- isl: reorder tiling selection
4183- anv: enable standard Y tiles
4184- isl/tilememcpy_test: add multiple tile testing
4185- anv: rename total_batch_size
4186- anv: reuse cmd_buffer::total_batch_size
4187- intel/measure: track batch buffer sizes
4188- intel/nir: rerun lower_tex if it lowers something
4189- intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL
4190- hasvk: add state cache invalidation back before fast clears
4191- blorp: remove unused variable
4192- anv: remove ReorderMode from pipeline 3DSTATE_GS emission
4193- anv: change anv_batch_emit_merge to also do packing
4194- intel/anv: batch stats util
4195- intel/decoder: implement accumulated prints
4196- anv: move all dynamic state emission to cmd_buffer_flush_dynamic_state
4197- anv: rename files to represent their usage
4198- anv: categorize partial/final pipeline instruction
4199- anv: split 3DSTATE_TE packing between static & dynamic parts
4200- anv: split 3DSTATE_VFG emission
4201- anv: add a flag tracking occlusion query count change
4202- anv: split pipeline programming into instructions
4203- vulkan/runtime: add helper to name dirty states
4204- anv: add new low level emission & dirty state tracking
4205- anv: remove unused state emission
4206- anv: split BLEND_STATE packing from BLEND_STATE_POINTERS emit
4207- docs: update Anv documentation about dynamic state emission
4208- anv: create individual logical engines on i915 when possible
4209- anv: Copy/Clear MSAA images over companion RCS while we are on compute
4210- pps-producer: add ability to select device with DRI_PRIME
4211- anv: remove aux checking asserts
4212- anv: bound image usages to the associated queue family
4213- anv: fix 3DSTATE_VFG emission
4214- anv: emit 3DSTATE_URB_ALLOC_(MESH|TASK) only when mesh shaders are enabled
4215- anv: ensure mesh pipeline have all pre-rasterization stages disabled
4216- anv: ensure partially packed instructions are emitted in the pipeline
4217- anv: fix missing 3DSTATE_SBE_MESH emission
4218- anv: fix utrace timestamp buffer copies
4219- anv: add a memcpy compute internal kernel
4220- anv: add simple shader support without a command buffer
4221- anv: move simple shaders code to its own object
4222- anv: move utrace flush out of backends
4223- anv: enable utrace timestamp buffer copies on compute engine
4224- intel: don't assume Linux minor dev node
4225- intel/ds: lock submissions to u_trace_context
4226- util/u_trace: count number of tracepoints
4227- intel/ds: track number of tracepoint timestamp copies
4228- anv/utrace: trace CPU on timestamp buffer readiness
4229- intel/ds: avoid dropping traces when running out of shared memory
4230- anv/iris: widen Wa_14015946265 to Gfx11+
4231- anv: add missing workaround for 3DSTATE_LINE_STIPPLE
4232- iris: add missing workaround for 3DSTATE_LINE_STIPPLE
4233- intel/fs: handle ishl in surface/sampler rematerialization
4234- intel/fs: handle add3 in surface/sampler rematerialization
4235- intel/fs: switch from SIMD 1 to 8 instructions surface/sampler rematerialization
4236- anv: fix internal compute copy shader build
4237- anv: reduce working temporary memory for BVH builds
4238- anv: move bo_pool allocation flags to init caller
4239- anv: use buffer pools for BVH build buffers
4240- intel/ds: track acceleration RT commands
4241- anv: fix index buffer size programming
4242- anv: implement INTEL_DEBUG=reemit
4243- anv: add missing workaround handling in simple shader
4244- anv: fix a couple of missing input for 3DSTATE_RASTER programming
4245- anv: flag 3DSTATE_RASTER as dirty after simple shader primitive
4246- vulkan: bump headers/registry to 1.3.267
4247- anv: rename primary in container in ExecuteCommands()
4248- anv: add support for VK_EXT_nested_command_buffer
4249- anv: simplify push descriptors
4250- anv: fixup spirv cap for ImageReadWithoutFormat on Gfx12.5
4251- Revert "intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL"
4252- anv: update batch chaining to Gfx9 commands
4253- anv: workaround Gfx11 with optimized state emission
4254- u_trace: generate tracepoint index parameter in perfetto callbacks
4255- u_trace: generate tracepoint name array in perfetto header
4256- intel/ds: provide names for different events of a timeline's row
4257- anv: reuse local variable for gfx state
4258- anv: track render targets & render area changes separately
4259- anv: don't uninitialize bvh_bo_pool is not initialized
4260- anv: uninitialize queues before utrace
4261- anv: move generation shader return instruction to last draw lane
4262- anv: fix generated draws gl_DrawID with more than 8192 indirect draws
4263- anv: extract out draw call generation
4264- anv: identify internal shader in NIR
4265- anv: avoid MI commands to copy draw indirect count
4266- anv: move generation batch fields to a sub-struct
4267- util/glsl2spirv: add ability to pass defines
4268- anv: factor out host/gpu internal shaders interfaces
4269- anv: index indirect data buffer with absolute offset
4270- anv: add ring buffer mode to generated draw optimization
4271- anv: merge gfx9/11 indirect draw generation shaders
4272- anv: document the draw indirect optimization ring mode
4273- anv: fixup 32bit build of internal shaders
4274- anv: fix uninitialized use of compute initialization batch
4275- intel/fs: fix dynamic interpolation mode selection
4276- anv/meson: add missing dependency on the interface header
4277- anv: fix corner case of mutable descriptor pool creation
4278- isl: disable MCS compression on R9G9B9E5
4279- intel/fs: rerun divergence analysis prior to convert_from_ssa
4280- intel/nir/rt: fix reportIntersection() hitT handling
4281- anv: fix CC_VIEWPORT pointer dirty after blorp/simple-shaders
4282- anv: fix dirty state tracking for 3DSTATE_PUSH_CONSTANT_ALLOC
4283- intel/perf: fix querying of configurations
4284
4285Louis-Francis Ratté-Boulianne (15):
4286
4287- panfrost: Fix error in comment
4288- panfrost: Add methods to determine slice and body alignment
4289- panfrost: Add method to get size of AFBC subblocks
4290- panfrost: Precalculate stride and nr of blocks for AFBC layouts
4291- panfrost: Add panfrost_batch_write_bo
4292- panfrost: Make panfrost_resource_create_with_modifier public
4293- panfrost: Split out internal of \`panfrost_launch_grid`
4294- panfrost: Add infrastructure for internal AFBC compute shaders
4295- panfrost: Add method to get size of AFBC superblocks valid data
4296- panfrost: Add support for AFBC packing
4297- panfrost: Legalize resource when attaching to a batch
4298- panfrost: Don't force constant modifier after converting
4299- panfrost: Add debug flag to force packing of AFBC textures on upload
4300- panfrost: Add some debug utility methods for resources
4301- panfrost: Add env variable for max AFBC packing ratio
4302
4303Lucas Stach (33):
4304
4305- ci/etnaviv: update ci expectation
4306- etnaviv: move resource seqnos to level
4307- etnaviv: flush destination before executing blit
4308- etnaviv: optimize resource copies by skipping clean levels
4309- etnaviv: add helper to mark resource level as flushed
4310- etnaviv: add helper to mark resource level as changed
4311- etnaviv: add helper to transfer resource level age to another
4312- etnaviv: add helper to get TS validity
4313- etnaviv: add helper to set TS validity
4314- etnaviv: move TS meta into etna_resource_level
4315- etnaviv: add tile status buffer status into TS metadata
4316- etnaviv: optimize sampler source update
4317- etnaviv: allow sampler TS even if the resource is flushed
4318- etnaviv: keep blit destination tile status valid if possible
4319- etnaviv: optimize render resource update
4320- etnaviv: optimize transfers when whole resource level is discarded
4321- etnaviv: split etna_copy_resource_box levels parameter in src/dst
4322- etnaviv: don't allocate full resource as transfer staging
4323- etnaviv: check for valid TS as condition to create the staging resource
4324- etnaviv: reword comment about staging resource usage
4325- etnaviv: remove huge outdated comment
4326- etnaviv: move buffer range tracking into the PIPE_MAP_WRITE clause
4327- etnaviv: remove superfluous braces
4328- etnaviv: remove always true assert in etna_transfer_unmap
4329- etnaviv: remove bogus comment about replacing resource storage
4330- etnaviv: initialize VIVS_GL_BUG_FIXES
4331- etnaviv: fix read staging buffer leak
4332- Revert "ci/etnaviv: allow failure on failing test"
4333- mesa: enable NV_texture_barrier in GLES2+ (again)
4334- etnaviv: use correct blit box sizes when copying resource
4335- etnaviv: zero shared TS metadata block
4336- Revert "etnaviv: use correct blit box sizes when copying resource"
4337- mesa: add GL_APPLE_sync support
4338
4339Luigi Santivetti (1):
4340
4341- pvr: do not claim support for ASTC texture compression
4342
4343M Henning (31):
4344
4345- nv50/ir: Drop nir_jump_return handling
4346- nv50/ir: Remove ArgumentMovesPass
4347- nv50/ir: Remove Function.stackPtr
4348- nv50/ir: Remove dead loop from assignSlot
4349- nv50/ir: Remove SpillSlot
4350- nvc0: Keep nir directly in nvc0_program
4351- nv50: Keep nir directly in nv50_program
4352- nouveau: Delete nv50_ir_from_tgsi.cpp
4353- nouveau: Drop tgsi support from nv50_ir_prog_info
4354- nouveau: Drop ConverterCommon::Subroutine
4355- nouveau: Drop BuildUtil::DataArray
4356- nouveau: Drop BuildUtil::Location
4357- nouveau: Delete the nouveau_compiler tool
4358- nv/codegen: Call nir_shader_gather_info
4359- nv/codegen: Implement nir_op_fquantize2f16
4360- nvk: Remove reference to genUserClip
4361- nv/codegen: Use nir_lower_clip
4362- nv50_ir_from_nir: Use nir's lower_fpow
4363- nv/codegen: Delete OP_POW
4364- nv/codegen: Fix an uninitialized variable warning
4365- nv/codegen: Delete OP_WRSV
4366- nv/codegen: Delete OP_EXP, OP_LOG
4367- nv/codegen: Remove fragCoord variable.
4368- nv/codegen: Merge from_common into from_nir
4369- nv/codegen: Remove unused clipVertexOutput var
4370- nv50_ir_ra: Delete unused functions
4371- nv/codegen: Delete unused OP_CONSTRAINT
4372- nv/codegen: Delete periodicMask32
4373- nv/codegen: Remove Function::buildDefSets
4374- nv/codegen: Change copy-constructor call to assign
4375- nv/codegen: Delete copy and assign
4376
4377Maaz Mombasawala (2):
4378
4379- svga: Make surfaces shareable at creation.
4380- svga: Unify gmr and mob surface pool managers
4381
4382Marcin Ślusarz (16):
4383
4384- iris: avoid duplicating validation entries
4385- hasvk: remove dead code & comments related to mesh shading
4386- anv: drop support for VK_NV_mesh_shader
4387- intel/compiler: remove NV_mesh_shader support
4388- intel/compiler: remove redundant code
4389- anv: drop unused function
4390- anv: merge cases leading to the same code
4391- intel/compiler/mesh: compactify MUE layout
4392- intel/compiler,anv: put some vertex and primitive data in headers
4393- intel/compiler: load debug mesh compaction options once
4394- intel/compiler/test: fix crashes when TEST_DEBUG is set
4395- intel/compiler: add lsc_msg_desc_wcmask
4396- intel/compiler: add initial support for URB_LOGICAL_SRC_CHANNEL_MASK to lower_urb_write_logical_send_xe2
4397- intel/compiler/mesh: fix position of output URB handle for xe2
4398- intel/compiler/mesh: implement IO for xe2
4399- intel/compiler: mask GS URB handles at thread payload construction
4400
4401Marek Olšák (125):
4402
4403- Revert "ac/nir/ngg: Follow intrinsic sources when analyzing before culling."
4404- glthread: determine global locking once every 64 batches to fix get_time perf
4405- mesa: fix 38% decrease in display list performance of Viewperf2020/NX8_StudioAA
4406- freedreno,lima,zink: update CI fixes and flakes
4407- util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock
4408- util/u_queue: always enable UTIL_QUEUE_INIT_SCALE_THREADS, remove the flag
4409- radeonsi: fix a CDNA regression breaking compute
4410- glthread: sync for VDPAU sync functions
4411- radeonsi: turn sh_base[PIPE_SHADER_VERTEX] into a constant in emit_draw_packets
4412- radeonsi: restructure the loop for non-indexed multi draws
4413- radeonsi: cosmetic changes to radeon_opt_* macros
4414- radeonsi: handle draw user SGPRs as tracked registers
4415- radeonsi: update obsolete comments about compiler queues
4416- radeonsi: remove si_compute.h, move the contents into si_pipe.h
4417- radeonsi: move si_update/emit_tess_io_layout_state into si_state_shaders.cpp
4418- radeonsi: move si_emit_spi_map into si_state_shaders.cpp
4419- radeonsi: move si_emit_rasterizer_prim_state out of si_emit_all_states
4420- radeonsi: remove splitting IBs that use too much memory
4421- radeonsi: add padding to si_resource to fix Viewperf2020/catiav5test1 perf
4422- radeonsi: remove unused check_mem parameter from si_sampler_view_add_buffer
4423- radeonsi: remove the draw counter with primitive restart from the HUD
4424- radeonsi: always inline si_prefetch_shaders
4425- radeonsi: specialize si_draw_rectangle using a C++ template
4426- radeonsi: add index parameter into si_atom::emit
4427- radeonsi: split direct pm4 emission from si_pm4_emit
4428- radeonsi: move code around si_pm4_emit_state into si_pm4_emit_state
4429- radeonsi: merge pm4 state and atom emit loops into one
4430- radeonsi: add a simple version of si_pm4_emit_state for non-shader states
4431- radeonsi: handle deferred cache flushes as a state (si_atom)
4432- radeonsi: remove render condition logic from si_draw by reordering atoms
4433- radeonsi: abort when failing to upload descriptors instead of skipping draws
4434- radeonsi: rename shader_pointers state -> gfx_shader_pointers
4435- radeonsi: merge si_upload_*_descriptors into si_emit_*_shader_pointers
4436- radeonsi: convert si_gfx_resources_add_all_to_bo_list to a state atom
4437- radeonsi/ci: update gfx11 failures
4438- radeonsi: move GE_CNTL emission from si_draw into si_emit_vgt_pipeline_state
4439- radeonsi: use num_patches_per_workgroup directly in si_get_ia_multi_vgt_param
4440- radeonsi: enable shader culling by default because it helps Viewperf
4441- radeonsi: rewrite how occlusion query precision is determined for performance
4442- radeonsi: set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET on aux_context explicitly
4443- radeon_winsys: move allow_context_lost from cs_create to ctx_create
4444- winsys/amdgpu: rework how SW reset status is generated and reported
4445- radeon_winsys: add a ctx_set_sw_reset_status callback
4446- radeonsi: don't abort for descriptor failures, let the winsys handle it
4447- radeonsi: don't use threadID.yz/blockID.yz for copy_image if those are always 0
4448- radeonsi: don't use threadID.yz/blockID.yz for compute_blit if they're always 0
4449- nir: fix constant evaluation of fddx/fddy sourcing Inf & NaN constant
4450- nir/algebraic: collapse ALU opcodes sourcing NaN
4451- ac/gpu_info: add the /dev/dri/ filename into radeon_info
4452- Revert "ac: don't call ac_query_pci_bus_info from ac_query_gpu_info"
4453- ac: implement AMD_FORCE_FAMILY properly, remove SI_FORCE_FAMILY
4454- ac: document ac_shader_args::gs_vtx_offset
4455- ac: minor updates to packet documentation and definitions
4456- ac: change offsets of DMA_DATA dwords to prevent reg offset conflicts
4457- ac: improve the IB parser
4458- ac: update gfx11 shadowed register tables
4459- ac: add a standalone IB parser program
4460- ac/surface: trivial non-functional changes
4461- ac/surface: add radeon_surf::u::gfx9::uses_custom_pitch
4462- radeonsi: allow setting any index in radeon_set_sh_reg_idx
4463- radeonsi: rename uses_subgroup_info to uses_tg_size
4464- radeonsi: improve the heuristic when to use Wave32 for compute shaders
4465- radeonsi: simplify/merge emit_shader_ngg functions
4466- radeonsi: don't pass gl_Layer to PS for blit shaders
4467- radeonsi/gfx11: pass attribute ring addr via SGPR instead of memory for blits
4468- radeonsi: fix templated si_draw_rectangle callback for Navi14
4469- nir: replace undef only used by ALU opcodes with 0 or NaN
4470- nir: remove nir_op_unpack_64 handling from nir_opt_undef
4471- ac/llvm: don't convert undef to 0 because nir_opt_undef does it now
4472- meson: use llvm-config instead of cmake to fix linking errors with meson 1.2.1
4473- gallivm: fix build with LLVM 18
4474- amd/llvm: fix build with LLVM 18
4475- radeonsi: fix compute-only contexts
4476- ac/llvm: replace removed amdgcn.ldexp for LLVM 18
4477- ac/perfcounter: remove a bogus assert to fix an assertion failure on gfx11
4478- ac/llvm: set !fpmath 3.0 for llvm.sqrt
4479- ac/gpu_info: don't align IBs to the GL2 cache line size
4480- ac/llvm: fix flat PS input corruption
4481- amd: rename GFX110x to NAVI31-33
4482- ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments
4483- ac/gpu_info: pad IBs according to ib_size_alignment
4484- winsys/amdgpu: pad gfx and compute IBs with a single NOP packet
4485- Revert "radeonsi: specialize si_draw_rectangle using a C++ template"
4486- radeonsi/ci: update navi10 results
4487- gallium/util: fix GALLIUM_TESTS=1 by using cso_set_vertex_buffers_and_elements
4488- gallium/util: add more tests for compute-only contexts
4489- radeonsi: add another aux context for uploading shaders
4490- radeonsi: upload shaders via a staging buffer so as not to map VRAM directly
4491- ac/surface: don't require exact pitch for gfx6-8 tiled imports
4492- Revert "ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG"
4493- Revert "radv/amdgpu: fix alignment of command buffers"
4494- Revert "radv: fix alignment of DGC command buffers"
4495- Revert "winsys/amdgpu: pad gfx and compute IBs with a single NOP packet"
4496- Revert "ac/gpu_info: pad IBs according to ib_size_alignment"
4497- Revert "ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments"
4498- nir: sort variables by location in nir_lower_io_passes to work around a bug
4499- nir: recompute IO bases after DCE in nir_lower_io_passes
4500- nir: add dual-slot input information into load_input intrinsics
4501- nir: take dual slot input info into account when computing IO driver locations
4502- nir: gather dual slot input information
4503- nir: expose reusable linking helpers for cloning uniform loads
4504- nir: handle nir_var_mem_ubo in nir_clone_uniform_variable
4505- ac/gpu_info: split ib_alignment as ip[type].ib_alignment
4506- ac/gpu_info: move ib_pad_dw_mask into ip[]
4507- ac/gpu_info: drop the hack unifying all IB alignments
4508- ac/gpu_info: conservatively decrease IB alignment and padding to 256B
4509- ac/gpu_info: set gfx and compute IB padding to only 8 dwords
4510- winsys/amdgpu: properly pad the IB in amdgpu_submit_gfx_nop
4511- winsys/amdgpu: correctly pad noop IBs for RADEON_NOOP=1
4512- winsys/amdgpu: pad gfx and compute IBs with only 1 NOP
4513- ac/gpu_info: don't allow register shadowing with SR-IOV due to bad performance
4514- radeonsi: disable register shadowing without SR-IOV to fix bad performance
4515- winsys/amdgpu: don't send CP_GFX_SHADOW chunk if shadow address is not set
4516- radeonsi/ci: update gfx1100 results
4517- nir: split FLOAT_CONTROLS_SIGNED_ZERO_INF_NAN_PRESERVE_FP* flags
4518- nir/algebraic: use only signed_zero_preserve_* for addition by 0 patterns, etc.
4519- mesa: don't pass Infs to the shader via gl_Fog.scale
4520- radeonsi/ci: update the runner for new build scripts
4521- radeonsi/ci: enable GTF tests in the runner
4522- radeonsi/ci: enable GLES CTS in the runner
4523- radeonsi/ci: update failures and flakes
4524- amd/common: update DCC for gfx11.5
4525- radeonsi: initialize perfetto in the right place
4526- radeonsi/gfx11: don't set OREO_MODE to fix rare corruption
4527- nir: fix gathering TESS_LEVEL_INNER/OUTER usage with lowered IO
4528
4529Marek Vasut (1):
4530
4531- etnaviv: Fully replicate back stencil config
4532
4533Mark Collins (10):
4534
4535- tu/a7xx: Adapt r3d blits for A7xx
4536- freedreno/rnn: Remove %n usage in fprintf
4537- freedreno: Only add drm/computerator when system_has_kms_drm
4538- freedreno/decode: Support building replay for multiple KMDs
4539- freedreno+meson: Add lua+libarchive+libxml from Meson WrapDB
4540- meson: Warn about side-effects from DRM for FD KMDs
4541- meson: Update libarchive to v3.7.2-2
4542- freedreno/common: Add max_sets property to A6xxGPUInfo
4543- tu: Support higher descriptor set count for A7XX
4544- tu,util/driconf: Add option to not reserve descriptor set
4545
4546Mark Janes (1):
4547
4548- intel: allow reduced memory usage for INTEL_MEASURE
4549
4550Martin Roukala (né Peres) (22):
4551
4552- radv/ci: drop the auto-reboot-on-hang for vkcts-navi10
4553- radv/ci: use the default kernel on vkcts-navi10
4554- zink/ci: automatically reboot when hitting a kernel BUG on vangogh
4555- zink/ci: document more flakes seen on vangogh
4556- radv/ci: move vkcts-navi10 testing to KWS
4557- radv/ci: add more tests to the navi10 vkcts flake list
4558- radv/ci: increase the parallelism of the vkcts-navi21 job
4559- radv/ci: add more tests to the navi21 vkcts flake list
4560- radv/ci/vkcts-navi21: catch all the line_stipple_(enable|params) flakes
4561- radv/ci/vkcts-navi21: document more flakes
4562- radv/ci/vkcts-navi10: catch all the line-related flakes
4563- radv/ci: update the vkcts gfx1100 flake/fail lists
4564- radv/ci: add a manual job to run vkcts on navi31
4565- radv/ci: add a manual job for vkd3d-proton on navi31
4566- ci/vkcts-vangogh: mark dEQP-VK.dynamic_rendering.primary_cmd_buff.basic.* as flake
4567- ci/vkcts-navi21: mark more of the RT handles checks as flakes
4568- ci: make B2C_JOB_VOLUME_EXCLUSIONS to all .b2c-test jobs
4569- zink/ci: remove 19 tests from the zink-radv-polaris10-fails list
4570- ci/b2c: switch containers to a back-up ahead of valve-infra renaming
4571- zink/ci: remove 42 tests from the zink-radv-polaris10-fails list
4572- radv/ci: tighten the vkcts-navi21 timeouts
4573- zink/ci: tighten the zink-radv-vangogh timeouts
4574
4575Martin Stransky (1):
4576
4577- llvmpipe: fix UAF in lp_scene_is_resource_referenced.
4578
4579Mary (6):
4580
4581- nouveau/mme: Add initial Fermi definition
4582- nouveau/mme: Add Fermi builder
4583- nouveau/mme: Add Fermi simulator
4584- nouveau/mme: Add Fermi hardware tests
4585- agx: Move nir_lower_fragcolor out of agx_preprocess_nir
4586- agx: Ensure to lower 1D image load/store to 2D
4587
4588Mary Guillemard (4):
4589
4590- nir: Add NVIDIA-specific geometry shader opcodes
4591- venus: skip bind sparse info when checking for feedback query
4592- zink: Check for VK_EXT_extended_dynamic_state3 before setting A2C
4593- venus: Do not submit batch manually when no feedback is required
4594
4595Matt Coster (21):
4596
4597- pvr: Pad rogue_regarray_cache_key union members to avoid UB
4598- pvr: Clean up extension tables
4599- pvr: Refactor pvr_GetPhysicalDeviceProperties2()
4600- docs: Fixup imagination/pvr extension support
4601- pvr: Add VK_KHR_get_display_properties2
4602- pvr: Add VK_KHR_get_memory_requirements2
4603- pvr: Add VK_KHR_get_surface_capabilities2
4604- pvr: Print VkStructureType name on pvr_debug_ignored_stype()
4605- pvr: Add VK_KHR_copy_commands2
4606- pvr: Don't override commands copied to new buffer when extending cs
4607- pvr: Do not require TA_STATE_HEADER.pres_ispctl_dbsc for {db,sc}enable
4608- pvr: Zero tail of cs buffers after linking when dumping cs
4609- pvr: Cleanup comments in pvr_physical_device_get_supported_*()
4610- pvr: Don't rely on GNU void pointer arithmetic
4611- pvr: Force compile error on GNU void pointer arithmetic
4612- pvr: Switch to common pipeline cache implementation
4613- pvr: Use vk_sampler base
4614- pvr: Clean up & fix sampler border color support
4615- pvr: Don't pass pvr_physical_device when only device info is needed
4616- pvr: Minor refactor of pvr_device.c
4617- pvr: Use common physical device properties
4618
4619Matt Turner (10):
4620
4621- Revert "intel/fs: only avoid SIMD32 if strictly inferior in throughput"
4622- intel: Rearrange for next commit
4623- intel: Consider with_intel_clc in with_any_intel
4624- intel: Only build blorp if drivers are enabled
4625- intel: Only build ds if drivers are enabled
4626- intel: Only build perf if drivers or tools are enabled
4627- intel: Allow using intel_clc from the system
4628- intel: Limit Intel Vulkan RT to x86_64
4629- r600: Add missing dep on git_sha1.h
4630- util: Include stdint.h in libdrm.h
4631
4632Mauro Rossi (7):
4633
4634- nouveau/ws: fix building error in nouveau_ws_push_dump()
4635- vulkan/meta: fix gnu-empty-initializer build error
4636- nouveau/mme: fix print inst for case MME_FERMI_OP_MERGE
4637- anv/android: remove numFds check
4638- hasvk/android: remove numFds check
4639- Android.mk: filter out cflags to build with Android 14 bundled clang
4640- Android.mk: disable android-libbacktrace to build with Android 14
4641
4642Mike Blumenkrantz (293):
4643
4644- ci: bump VVL to 1.3.257
4645- zink: set pipeline dynamic state count after all dynamic states are set
4646- zink: set feedback attachments on batch init
4647- zink: be even dumber about buffer refs when replacing storage
4648- zink: emit SpvCapabilitySampleMaskPostDepthCoverage with SpvExecutionModePostDepthCoverage
4649- zink: fix the fix for separate shader program refcounting
4650- kopper: handle pixmap creation failure more gracefully
4651- glxsw: check geometry of drawables on creation
4652- kopper: move pixmap param for drawable creation to info struct
4653- glx/dri3: split out modifier check
4654- glx/sw: check for modifier support in the kopper path
4655- kopper: pass modifier availability to drawable creation
4656- kopper: determine modifier support per-drawable
4657- zink: don't clobber descriptor mode on multiple screen creation
4658- nir: fix slot calculations for compact variables with location_frac
4659- lavapipe: use the component offset directly for xfb
4660- nir: add a helper for calculating variable slots
4661- radv: bump max xfb output to 128
4662- ir3: bump max xfb output to 128
4663- gallium: bump PIPE_MAX_SO_OUTPUTS to 128
4664- zink: add feedback loop exts to optimal profile
4665- glsl: only explicitly check GS components in PSIZ injection with output variables
4666- lavapipe: statically allocate fb attachment array
4667- lavapipe: zero fb attachment array at rp start
4668- lavapipe: don't check geometry for fb attachments
4669- lavapipe: be slightly more permissive for bad apps (and cts) with dynrender
4670- lavapipe: VK_EXT_host_image_copy
4671- zink: better handle separate shader dsl creation when no bindings exist
4672- zink: force image barriers after dmabuf import
4673- ci: bump VVL to 1.3.261
4674- zink: use VK_WHOLE_SIZE when binding null db buffer descriptors
4675- zink: unset line stipple ds3 state flags when stipple not available
4676- nir/lower_io_to_scalar: fix 64bit io splitting
4677- nir/linking_helpers: force type matching in does_varying_match
4678- nir/print: print location names for (some) tess slots
4679- nir/print: always group variables by type when printing
4680- zink: add batch refs for transient images
4681- zink: fix zs resolve attachment indexing
4682- zink: don't add VK_IMAGE_USAGE_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT for transient images
4683- zink: don't append msrtss to dynamic render if not supported
4684- zink: set msrtss depth resolve mode when enabled
4685- zink: hook up VK_KHR_workgroup_memory_explicit_layout
4686- zink: propagate have_workgroup_memory_explicit_layout to ntv
4687- zink: use SPV_KHR_workgroup_memory_explicit_layout when available
4688- zink: add more locking for pipeline cache
4689- zink: add VK_PIPELINE_CACHE_CREATE_EXTERNALLY_SYNCHRONIZED_BIT_EXT
4690- aux/trace: fix winsys handle dumping
4691- zink: generated tcs is on the tes, not the vs
4692- zink: apply ZINK_DEBUG=noopt to linked separate shaders
4693- gallivm: handle A8_UNORM image stores
4694- llvmpipe: enable A8_UNORM for shader images
4695- llvmpipe: export PIPE_CAP_IMAGE_LOAD_FORMATTED
4696- lavapipe: GetRenderingAreaGranularityKHR
4697- llvmpipe: block weird uses of subsampled formats in buffers
4698- llvmpipe: fix early depth + alpha2coverage + occlusion query interaction
4699- lavapipe: fix BindVertexBuffers2 buffer size handling
4700- lavapipe: fix resolves where src image has a layer offset
4701- lavapipe: block yuv formats from getting blit feature flags
4702- lavapipe: BindIndexBuffer2
4703- lavapipe: GetDeviceImageSubresourceLayoutKHR
4704- lavapipe: VK_REMAINING_ARRAY_LAYERS for copy ops
4705- lavapipe: maintenance5
4706- zink: fix xfb buffer array sizing to use buffer limit, not output
4707- zink: move ZINK_DEBUG=nir printing to just before compile
4708- draw: fix so debug offset printing
4709- zink: reindex ssa defs before dumping debug shaders
4710- lavapipe: zero-init pipe_sampler_state
4711- zink: explicitly set non-optimal last_vertex_stage shader key on ctx create
4712- zink: fix big tcs output io
4713- zink: don't try to replace separate shader prog in noopt mode
4714- zink: pre-convert mode in fixup_io_locations
4715- zink: add a special separate shader i/o mode for legacy variables
4716- nir: minor fixes for io_to_scalar
4717- nir/lower_io: add a new doubles-only 64bit lowering option
4718- nir: add a filter cb to lower_io_to_scalar
4719- d3d10umd: use cso_context to set vertex buffers and elements
4720- virgl: move virgl_vertex_elements_state to header
4721- virgl: fix some indentation
4722- nouveau: calloc vertex csos
4723- gallium: move vertex stride to CSO
4724- zink: fix null config screen creation
4725- zink: fix crash in lower_pv_mode_gs_store
4726- u/draw: skip zero-sized indirect draws
4727- lavapipe: handle VkPipelineCreateFlagBits2KHR
4728- lavapipe: handle VkBufferUsageFlags2KHR
4729- zink: ci updates
4730- zink: track start/stop of a couple query types
4731- zink: require EDS1 for CWE usage
4732- zink: unset primgen suspended flag when ending a primgen query
4733- zink: rework rast-discard for primgen queries
4734- zink: rip out some awkward parts of the old non-cwe path
4735- zink: drop CWE requirement for renderpass tracking with primgen queries
4736- nir/zink: fix gs emulation xfb_info sizing
4737- zink: move fragcolor lowering further along the compile process
4738- zink: add a mode param to find_var_with_location_frac
4739- zink: use lowered io (kinda) for i/o vars
4740- zink: stop lowering indirect derefs
4741- ntt: handle interp intrinsics as derefs
4742- zink: delete split_blocks pass
4743- zink: delete lower_64bit_vertex_attribs pass
4744- zink: fix clip/cull dist xfb inlining
4745- zink: delete all the extra gross xfb handling
4746- zink: stop using pipe_stream_output
4747- zink: remove pipe_stream_output from function params
4748- zink: ci updates
4749- aux/trace: print bindless handles as pointers
4750- zink: remove unused param from create_ici
4751- zink: split create_ici to init and eval
4752- zink: add maintenance extensions to profile
4753- zink: use maintenance5
4754- zink: use real A8_UNORM when possible
4755- vk/graphics: fix CWE handling with DS3
4756- Revert "vk/wsi/x11: handle geometry updating more asynchronously"
4757- r600: store the mask of buffers used by a vertex state
4758- r600: better tracking for vertex buffer emission
4759- zink: wait on async fence during ctx program removal
4760- zink: handle patch variable locations for separate shaders better
4761- zink: don't start multiple cache jobs for the same program
4762- zink: use the "set" optimal key for prog last_variant_hash for consistency
4763- zink: sanitize optimal keys
4764- zink: copy some cs shader properties to the program struct
4765- zink: handle global atomic intrinsics
4766- zink: use Aligned with global load/store ops
4767- zink: fix rewrite_read_as_0 filtering
4768- rusticl: fixes for zink shader images
4769- zink: pass KERNEL shaders through successfully
4770- zink: add a618 flake
4771- zink: break out ds3 state resetting
4772- zink: be consistent with ds3 state resetting for blits
4773- zink: fix optimal_keys warning message
4774- zink: force-reset unordered flags for buffer barriers on non-matching batch access
4775- zink: reset unordered flags for image barriers on non-matching batch access
4776- zink: make image barrier init functions void return
4777- zink: simplify some image barrier conditionals
4778- zink: remove sync TODO
4779- zink: add lavapipe flake
4780- ci: disable nouveau shaderdb
4781- egl/dri3: only set driver_name if not already set
4782- egl: call dri3_x11_connect() for zink
4783- egl: bind dri2_set_WL_bind_wayland_display for zink when necessary
4784- zink: be more precise about flagging rp changes around unordered u_blitter
4785- zink: don't block reordering during ref updates in unordered blits
4786- lavapipe: update vbo indices before propagating stride
4787- lavapipe: fix pipeline stride propagation
4788- zink: fix linear modifier dmabuf imports
4789- zink: polaris ci updates
4790- aux/tc: handle stride mismatch during rp-optimized subdata
4791- zink: always add a per-prog ref for gpl libs
4792- zink: use a pointer to simplify submit struct mechanics
4793- zink: make zink_resource_image_barrier2_init public
4794- zink: add a third submitinfo (unused for now)
4795- zink: make submitinfo handling easier to manage with enum
4796- zink: add another submitinfo for fd semaphore waits
4797- zink: add a screen cache for fd semaphores
4798- zink: add a util for getting cached fd semaphores
4799- zink: hook up cached fd semaphore usage for batch signal/waits
4800- zink: handle implicit sync for dmabufs
4801- zink: handle multi-plane implicit sync
4802- zink: ci updates
4803- zink: set is_xfb=false for all i/o variables
4804- zink: reorder bindless io lowering
4805- zink: fix typing on bindless io lowering
4806- zink: delete some bindless io lowering code
4807- zink: use nir_io_semantics::num_slots for indirect var creation
4808- zink: simplify an arrayed io check during variable creation
4809- zink: use explicit stride from types instead of copying old_var stride
4810- zink: use MAX_PATCH_VERTICES directly for arrayed io var sizing
4811- zink: use explicit sizing for builtins when creating variables
4812- zink: create new vars without copying existing ones
4813- zink: add a new linker pass to handle mismatched i/o components
4814- zink: use right function to get src_type in eliminate_io_wrmasks
4815- zink: re-rework i/o variable handling to make having variables entirely optional
4816- ci: bump VVL to 1.3.263
4817- zink: simplify redundant is_buffer check
4818- zink: use VkFormatProperties3
4819- lavapipe: handle VkHostImageCopyDevicePerformanceQueryEXT
4820- lavapipe: don't advertise UNDEFINED layout for HIC
4821- zink: hook up VK_EXT_host_image_copy
4822- zink: move mem type detection up in file
4823- zink: disable HIC without resizable BAR
4824- zink: add a fixup method for extra driver props
4825- zink: fix some off-by-one indentation
4826- zink: use some return codes for check_ici errors
4827- zink: check/use suboptimal HIC during ici init
4828- zink: use HIC for image subdata when possible
4829- zink: slightly refactor psiz deletion during linking
4830- zink: delete all psiz=1.0 stores if maintenance5 is present
4831- nir/inline_uniforms: fix oob access with nir_find_inlinable_uniforms
4832- zink: add ZINK_DEBUG=quiet
4833- zink: imply ZINK_DEBUG=quiet if ZINK_DEBUG=optimal_keys is set on turnip
4834- zink: set optimal_keys for turnip jobs
4835- aux/tc: fix staging buffer sizing for texture_subdata
4836- aux/tc: fix address calc for segmented texture subdata
4837- zink: ci updates
4838- lavapipe: KHR_map_memory2
4839- zink: slightly refactor pipeline compile selection
4840- zink: add a flag for combined pipeline compile for doing FAIL_ON_PIPELINE_COMPILE_REQUIRED
4841- zink: remove an intermediate variable in pipeline compile selection
4842- zink: use FAIL_ON_PIPELINE_COMPILE_REQUIRED for GPL path
4843- zink: pass a stage mask to pipeline create functions
4844- glsl: check for xfb setting xfb info
4845- zink: don't warn about missing scalarBlockLayout on v3dv
4846- aux/tc: fix renderpass tracking fb state clobber scenario
4847- vk/enum2str: add more max enum vendors
4848- aux/tc: fix rp info handling around tc_sync calls
4849- aux/tc: don't use pipe_buffer_create_with_data() for rp-optimized subdata
4850- zink: flag db maps as unsynchronized
4851- lavapipe: clamp cache uuid size
4852- lavapipe: EXT_load_store_op_none
4853- tu: handle unused color attachments without crashing
4854- zink: use much bigger dummy surfaces
4855- zink: propagate rp_tc_info_updated across unordered blits
4856- zink: use null attachments for null attachments with dynamic render
4857- egl/swrast: expose EXT_swap_buffers_with_damage and EXT_present_opaque
4858- egl/wayland: split out wl drm extension init
4859- egl/wayland: use more registry listeners to better handle device init
4860- egl/wayland: enable WL_bind_wayland_display for zink
4861- zink: delete injected pointsize during shader creation
4862- zink: require maintenance5 for shobj
4863- zink: delete a non-maintenance5 workaround for shobj use
4864- lavapipe: set separate_shaders for shader objects
4865- zink: set workgroup_memory_explicit_layout for shader validation
4866- zink: add a ZINK_DEBUG=validation alias
4867- zink: fix semaphore signal ordering
4868- zink: move swapchain fence to swapchain object
4869- zink: avoid UAF on wayland async present with to-be-retired swapchain
4870- zink: always trace_screen_unwrap in acquire
4871- lavapipe: fix variable descriptor count support handling
4872- lavapipe: always set independent blend
4873- lavapipe: more vertex stride fixups
4874- lavapipe: set default viewport and scissor count for cmdbufs
4875- lavapipe: set default min sample shading to 1
4876- glx: XFree visual info
4877- radv: fix external handle type queries for dmabuf/fd
4878- zink: fix crashing in image rebinds
4879- zink: move push descriptor disable to driver workarounds
4880- zink: move v3dv scalarBlockLayout workaround
4881- zink: fix end-of-batch barrier pipeline stages
4882- zink: guarantee egl syncobj lifetime
4883- aux/trace: dump enum names for map usage
4884- gallium: add PIPE_MAP_NONE
4885- Revert "egl/wayland: Add image loader extension for swrast"
4886- egl/wayland: don't block in swrast when updating buffers for zink
4887- egl/wayland: return sooner from swrast_update_buffers() if zink
4888- zink: don't check submit count for unflushed usage
4889- egl: don't set ForceSoftware for all zink loading
4890- zink: error at handle export on missing EXT_image_drm_format_modifier
4891- gbm: delete some zink handling
4892- zink: apply ZINK_DEBUG=quiet to all missing feature warnings
4893- zink: set ZINK_DEBUG=quiet for polaris jobs
4894- lavapipe: don't block begin/end cmdbuf pipeline barriers
4895- ci: add a630 trace flakes
4896- zink: shrink vectors during optimization
4897- zink: always clamp shader stage in descriptor handling
4898- zink: add set_global_binding
4899- zink: eliminate samplers from no-sampler CL texops
4900- zink: add some checks to determine whether queue is init on screen destroy
4901- zink: don't destroy any simple_mtx_t objects during screen destroy
4902- zink: don't destroy uninitialized disk cache thread
4903- zink: reorder glsl_type_singleton_init_or_ref call
4904- zink: use screen destructor for creation fails
4905- zink: fix readback_present locking
4906- zink: add automatic swapchain readback using heuristics
4907- lavapipe: VK_EXT_nested_command_buffer
4908- zink: ignore unacquired swapchain images during end-of-frame flush
4909- nir/lower_fragcolor: preserve location_frac
4910- zink: update pointer for GPL pipeline cache entry formats
4911- zink: fix legacy depth texture rewriting for single component reads
4912- egl: unify dri2_egl_display creation
4913- egl: init dri3 version info during screen creation
4914- egl/glx: don't load non-sw zink without dri3 support
4915- egl: add automatic zink fallback loading between hw and sw drivers
4916- glx: add automatic zink fallback loading between hw and sw drivers
4917- ci: don't set GALLIUM_DRIVER for zink
4918- egl/wayland: only add more registry listeners for hardware devices
4919- zink: only increment image_rebind_counter on image export if binds exist
4920- zink: check for sampler view existence during zink_rebind_all_images()
4921- zink: use weston for anv ci
4922- zink: blow up broken xservers more reliably
4923- zink: delete some dead modifier handling
4924- ci: skip implicit modifier piglits for zink
4925- zink: don't block large vram allocations
4926- zink: add copy box locking
4927- zink: emit SpvCapabilitySampleRateShading with SampleId
4928- zink: always set VK_EXTERNAL_MEMORY_HANDLE_TYPE_HOST_ALLOCATION_BIT_EXT for usermem
4929- zink: clamp resolve extents to src/dst geometry
4930- zink: only emit xfb execution mode for last vertex stage
4931- aux/u_transfer_helper: set rendertarget bind for msaa staging resource
4932- zink: unset explicit_xfb_buffer for non-xfb shaders
4933- mesa/st/texture: match width+height for texture downloads of cube textures
4934- zink: add more locking for compute pipelines
4935- radv: correctly return oom from the device when failing to create a cs
4936- zink: check for cbuf0 writes before setting A2C
4937
4938Mohamed Ahmed (19):
4939
4940- vulkan/util: Support 10-bit and 12-bit color formats in ycbcr_info in vk_format.c
4941- vulkan/util: Support VK_EXT_ycbcr_2plane_444_formats color formats in vk_format.c
4942- vulkan/util: Use ycbcr_info for multiplane helpers in vk_format.c
4943- nvk: implement vkGetDeviceImageMemoryRequirementsKHR()
4944- nvk: add stub for vkGetDeviceImageSparseMemoryRequirementsKHR()
4945- nvk: implement vkGetDeviceBufferMemoryRequirementsKHR()
4946- nvk: advertise VK_KHR_maintenance4
4947- nvk: advertise DemoteToHelperInvocation
4948- nvk: Enable multiplane images and image views
4949- nouveau/nvk: Add YCbCr sampler NIR lowering pass
4950- nouveau/nvk: Support multi-plane descriptors in nvk_nir_lower_descriptors.c
4951- nouveau/nvk: Create helper function for sampler creation
4952- nouveau/nvk: Add multiple sampler planes for CONVERSION_SEPARATE_RECONSTRUCTION_FILTER_BIT
4953- nouveau/nvk: Enable VK_KHR_sampler_ycbcr
4954- util/format: Add G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM formats
4955- vulkan/format: Translate G8B8G8R8_422_UNORM and B8G8R8G8_422_UNORM properly
4956- nvk: Enable SEPARATE_RECONSTRUCTION_FILTER_BIT for multi-planar formats only
4957- nvk: Enable MIDPOINT_CHROMA_SAMPLES_BIT for multi-planar formats only
4958- nil: Add support for G8B8_G8R8_UNORM and B8G8_R8G8_UNORM
4959
4960Nanley Chery (33):
4961
4962- iris: Remap DRM_FORMAT_MOD_INVALID more often during import
4963- anv: Don't support ASTC images with modifiers
4964- intel: Add and use isl_drm_modifier_get_plane_count
4965- anv: Handle explicit surface layout of DG2_RC_CCS
4966- anv: Reduce accesses of isl_mod_info->aux_usage
4967- iris: Reduce accesses of mod_info->aux_usage
4968- crocus: Delete modifier with aux code
4969- hasvk: Delete modifier with aux code
4970- iris: Swap stencil and modifier aux assignment order
4971- intel: Describe modifier compression with booleans
4972- intel/isl: Move the Tile4 modifier score case down
4973- intel/isl: Add a score for DG2_RC_CCS
4974- intel/blorp: Ambiguate after CCS resolves on gfx7-8
4975- iris: Reorder render_aux_usage parameters
4976- iris: Pass the render format to prepare_render
4977- iris: Create BLORP surfaces after resource preparation
4978- iris: Handle clear color compatibility in prepare_render
4979- iris: Sample more texture view fast-clears on gfx11+
4980- iris: Fix aux usage tracking in prepare_render
4981- iris: Fix iris_copy_region calls involving FCV_CCS_E
4982- iris: Drop get_copy_region_aux_settings
4983- iris: Inline iris_can_sample_mcs_with_clear
4984- anv: Initialize the clear color more often for FCV
4985- intel: Return a bool from intel_aux_map_add_mapping
4986- anv: Move scope of CCS binding determination
4987- anv: Allocate space for aux-map CCS in image bindings
4988- anv: Wrap aux surface image binding queries
4989- anv: Refactor CCS disabling at image bind time
4990- anv: Place images into the aux-map when safe to do so
4991- anv: Loosen anv_bo_allows_aux_map
4992- anv: Meet CCS alignment reqs with dedicated allocs
4993- anv: Delete implicit CCS code
4994- intel/isl: Add scores for GEN12_RC_CCS and MTL_RC_CCS
4995
4996Neal Gompa (1):
4997
4998- asahi: Fix 32-bit x86 build with correct data type for overflow error message
4999
5000Neha Bhende (1):
5001
5002- ntt: lower indirect tesslevels in ntt
5003
5004Paul Gofman (2):
5005
5006- driconf: add a workaround for Captain Lycop: Invasion of the Heters
5007- driconf: add a workaround for Rainbow Six Extraction
5008
5009Paulo Zanoni (15):
5010
5011- anv: rename the vm_bind vfuncs
5012- anv: add a new vm_bind vfunc
5013- anv/xe: make vm_binds async
5014- anv/xe: return failure in case waiting for the vm_bind syncobj fails
5015- anv: remove misleading comment about batch_len
5016- iris: assert bufmgr->bo_deps_lock is held
5017- iris: avoid stack overflow in iris_bo_wait_syncobj()
5018- iris: assert(bo->deps) after realloc()
5019- intel/isl: add ISL_SURF_USAGE_SPARSE_BIT
5020- intel/isl: simplify the check for maximum surface size
5021- anv/sparse: add the initial code for Sparse Resources
5022- anv/sparse: get ready to issue a single vm_bind ioctl per non-opaque bind
5023- anv/sparse: add INTEL_DEBUG=sparse
5024- anv: enable sparse resources by default
5025- vulkan: fix potential memory leak in create_rect_list_pipeline()
5026
5027Pavel Ondračka (44):
5028
5029- r300: update RV370 failures
5030- r300: check for index overflow when translating from TGSI
5031- r300: source register index is always unsigned
5032- r300: bump the RC_MAX_INDEX_BITS
5033- r300: normal instruction can't have presubtract op
5034- r300: add a helper for checking number of temporary sources
5035- r300: cycles estimate for shader-db
5036- r300: fix cycles calculation
5037- r300: don't abort on flow control when using draw for vs
5038- r300: add dEQP baseline for RV370 with forced swtcl
5039- r300: copy ntt to r300 compiler
5040- r300: add lower_sqrt to nir option
5041- r300: remove unused intrinsics in ntr
5042- r300: remove irrelevant opcodes in ntr
5043- r300: remove unused integer support in ntr
5044- r300: remove ntr_tgsi_usage_mask
5045- r300: remove more unused 64-bit pieces from ntr
5046- r300: simplify vectorization rules
5047- r300: remove more ntr unused helpers
5048- r300: remove the unneeded ntr_lower_vec_to_reg callback
5049- r300: remove unneeded 64bit and atomic lowering passes
5050- r300: remove unused ntr default settings
5051- r300: remove ntr default options
5052- r300: simplify ntr_emit_load_ubo
5053- r300: simplify ntr_emit_load_input
5054- r300: remove some virglrenderer specifics from ntr
5055- r300: simplify ntr_setup_uniforms
5056- r300: simplify ntr_output_decl
5057- r300: simplify ntr_try_store_in_tgsi_output
5058- r300: remove some unsupported texture opcodes
5059- r300: remove unused barrier code from ntr
5060- r300: simplify ntr_get_gl_varying_semantic
5061- r300: remove the nrt main optimization loop
5062- r300: reorder for easier presubtract 1-x pattern recognition
5063- r300: exit early in presubtract is not supported
5064- r300: implement bias presubtract
5065- r300: convert x * 2 into x + x for presubtract
5066- r300: move power of two multipliers down
5067- r300: there is no limitation on presubtract source file
5068- r300: use w channel for scalar opcodes if possible
5069- r300: reduce number of iterations for vertex shader loops
5070- r300: enable nir_move_vec_src_uses_to_dest
5071- nir/move_vec_src_uses_to_dest: skip reuse if vec is used only once in store_output
5072- nir/move_vec_src_uses_to_dest: allow to skip reuse of constant sources
5073
5074Philipp Zabel (1):
5075
5076- etnaviv: fix segfault after compile failure
5077
5078Pierre-Eric Pelloux-Prayer (18):
5079
5080- radeonsi/sdma: use multiple commands if required
5081- radv/sdma: use multiple commands if required
5082- radv/sdma: use correct limits for gfx10.3
5083- glx: drop the 'libGL' log prefix
5084- loader: refactor DRI_PRIME handling code
5085- loader: extend DRI_PRIME to support =N
5086- loader: add DRI_PRIME_DEBUG env var
5087- device_select_layer: support DRI_PRIME=n
5088- docs: update DRI_PRIME documentation
5089- device_select: add shortcut for MESA_VK_DEVICE_SELECT_FORCE_DEFAULT_DEVICE
5090- st/mesa: check renderbuffer before using it
5091- radeonsi: emit framebuffer state after allocating cmask
5092- amd/common: update addrlib for gfx11.5
5093- amd/common: add registers for gfx11.5
5094- ac/nir: extract must_wait_attr_ring helper
5095- amd, radeonsi: Add code to enable gfx11.5
5096- mesa: restore call to _mesa_set_varying_vp_inputs from set_vertex_processing_mode
5097- radeonsi: check sctx->tess_rings is valid before using it
5098
5099Piotr Kocia (2):
5100
5101- nir: Remove dead nir_const_value variables
5102- glsl: ir_function_param_visitor::visit_enter always true condition
5103
5104Qiang Yu (77):
5105
5106- aco,radv: replace tess_input_vertices shader info param
5107- radeonsi: aco does not pass LS outputs to HS by arg
5108- radeonsi: extract si_get_prev_stage_nir_shader to be shared with aco
5109- radeonsi: init aco shader info for merged LS/HS
5110- radeonsi: simplify si_build_wrapper_function
5111- radeonsi: move vertex shader vb desc input sgpr args to last
5112- radeonsi: remove param type check in wrapper function
5113- radeonsi: refine si_llvm_ls_build_end
5114- radeonsi: refine si_llvm_es_build_end
5115- radeonsi: aco compile support merged mono shader
5116- radeonsi: calculate lds size for merged shaders
5117- radeonsi: enable aco compile for mono merged LS/HS
5118- radeonsi: enable aco compile for mono merged ES/GS
5119- aco: extract aco_compile_shader_part from aco_compile_ps_epilog
5120- aco: add p_end_with_regs pseudo instruction
5121- aco: move jump to epilog out of ic_merged_wave_info
5122- aco: add tcs end regs for epilog usage
5123- aco: allow tcs with epilog to keep nir store output instruction
5124- aco: add pending_lds_access option for insert waitcnt
5125- aco: add tcs epilog generation for radeonsi
5126- aco: don't emit s_endpgm for tcs with epilog
5127- aco: skip scratch init when no scratch arg provide
5128- aco,radeonsi: save const addr to symbol
5129- ac/nir/tess: move tess factor output out of control flow
5130- aco: use semantic location as io temp index
5131- radeonsi: add exec_size to shader binary
5132- radeonsi: support upload multi part shader binary
5133- radeonsi: share si_get_tcs_out_patch_stride with aco
5134- radeonsi: fill part mode tcs aco shader info
5135- radeonsi: extract si_llvm_build_shader_part
5136- radeonsi: remove separate_prolog arg from prolog/epilog build
5137- radeonsi: add si_get_tcs_epilog_args
5138- radeonsi: change si_fill_aco_options args
5139- radeonsi: add si_aco_build_shader_part
5140- radeonsi: part mode standalone tcs support aco compile
5141- radeonsi: remove unused arg of get_tcs_tes_buffer_address
5142- aco: simplify setup_tcs_info
5143- aco: pass sw_stage when setup_isel_context
5144- aco: prepare fix_ls_vgpr_init_bug to be used by gl vs prolog
5145- aco: add vs prolog instruction selection for radeonsi
5146- aco: add aco compile interface for radeonsi vs prolog
5147- aco: do not fix_exports when program is prolog
5148- radeonsi: fill aco_shader_info->is_monolithic
5149- radeonsi: remove is_monolithic from vs prolog key
5150- radeonsi: extract si_get_vs_prolog_args to be shared with aco
5151- radeonsi: fix aco options has_ls_vgpr_init_bug setup
5152- radeonsi: add vs prolog aco build
5153- radeonsi: set vs has prolog aco shader info
5154- radeonsi: enable aco compile for part mode standalone vs
5155- aco,radv,radeonsi: rename is_monolithic to merged_shader_compiled_separately
5156- ac,radeonsi: move ps arg pos_fixed_pt to ac_shader_args
5157- aco: do not eliminate final exec write when p_end_with_regs block
5158- aco: remove p_end_with_regs from needs_exact()
5159- aco: add ps prolog generation for radeonsi
5160- aco: handle ps outputs from radeonsi
5161- aco: add create_fs_end_for_epilog for radeonsi
5162- aco,radv: remove unused ps epilog info fields
5163- aco,radv: rename ps epilog info inputs to colors
5164- aco: simplify export_fs_mrt_color
5165- aco,radv: add radeonsi spec ps epilog code
5166- aco: compact ps expilog color export for radeonsi
5167- aco,radv,radeonsi: pass spi ps input ena and addr
5168- aco: do not fix_exports when program has epilog
5169- aco: fix assertion fail when program contains empty block
5170- aco: create exit block for p_end_with_regs to branch to
5171- aco: wait memory ops done before go to next shader part
5172- radeonsi: reduce sgpr count for scratch_offset when aco
5173- radeonsi: init spi_ps_input_addr for part mode ps
5174- radeonsi: extract si_prolog_get_internal_binding_slot
5175- radeonsi: extract si_get_ps_prolog_args to be shared with aco
5176- ac,radeonsi: remove unused ps prolog key fields
5177- radeonsi: add ps prolog shader part build
5178- radeonsi: extract si_get_ps_epilog_args to be shared with aco
5179- radeonsi: fill aco shader info for ps part
5180- radeonsi: add ps epilog shader part build
5181- radeonsi: enable aco compile for part mode ps
5182- radeonsi: disable disk cache when use aco
5183
5184Rebecca Mckeever (32):
5185
5186- vulkan/runtime: Add helper functions for VK_EXT_host_image_copy
5187- nouveau/codegen: Support nir_intrinsic_load_workgroup_id_zero_base
5188- nouveau/codegen: Set lower_device_index_to_zero
5189- nvk: Convert system values for gl_PointCoord and PointCoord into inputs
5190- nvk: Add base_group to root descriptor table
5191- nvk: Lower base_workgroup_id
5192- nvk: Implement nvk_CmdDispatchBase and delete nvk_CmdDispatch
5193- nvk: Advertise KHR_device_group
5194- nvk: Add VK_FORMAT_B4G4R4A4_UNORM_PACK16 format to nil_format_info table
5195- nvk: Add A4B4G4R4 formats to nil_format_info table
5196- nvk: Advertise EXT_4444_formats
5197- nvk: Enable shadow sampling
5198- nvk: Implement VK_EXT_non_seamless_cube_map
5199- nouveau/nil: Add macros for ufixed
5200- nvk: Implement VK_EXT_image_view_min_lod
5201- nvk: Update mutable descriptor struct type
5202- nvk: Replace asserts with conditional that sets type_list = NULL
5203- nvk: Implement nvk_GetDescriptorSetLayoutSupport
5204- nvk: Enable VK_KHR_maintenance3
5205- nvk: Advertise VK_EXT_mutable_descriptor_type
5206- nvk: Set image index to zero for NULL nvk_buffer_view
5207- nvk: Advertise VK_EXT_image_robustness
5208- nvk: Advertise VK_EXT_robustness2
5209- nvk: Add view_index to root descriptor table
5210- nvk: Lower nir_intrinsic_load_view_index
5211- nvk: Add draw support for multiview
5212- nvk: Add query support for multiview
5213- nvk: Add input attachments support for multiview
5214- nvk: Advertise VK_KHR_multiview
5215- nvk: Load view_mask to shadow scratch in nvk_CmdBeginRendering
5216- nvk: Combine CLEAR_VIEWS and CLEAR_LAYERS MME macros
5217- nvk: Move code inside view mask loops to a helper function
5218
5219Rhys Perry (89):
5220
5221- ac/llvm: fix AC_TM_CHECK_IR
5222- radv: fix radv_get_ballot_bit_size with CS
5223- ac/llvm: fix wave32 ac_build_mbcnt_add with 64-bit mask
5224- ac/llvm: skip ballot zext for 32-bit dest with wave32-as-wave64
5225- radv: add conformant_trunc_coord to cache UUID
5226- radv: don't unset TRUNC_COORD if conformant_trunc_coord=true
5227- ac/nir: always round cube array layers
5228- nir/unsigned_upper_bound: fix phi(bcsel)
5229- nir/tests: add test for unsigned_upper_bound with loop header phis
5230- nir/opt_dead_cf: remove nodes after a jump earlier
5231- nir/tests: add nir_opt_dead_cf_test.jump_before_constant_if
5232- aco: insert s_nop before VGPR deallocation
5233- nir/lower_shader_calls: vectorize stack access for all shaders
5234- radv: workaround WWZ exporting index=1 through location=1
5235- radv: correctly skip MRT output NaN fixup for meta shaders
5236- radv: don't set vertex_attribute_strides on GFX8+
5237- radv/ci: skip some mesh shader tests on GFX1100
5238- aco: summarize register demand after handling branches
5239- aco: don't create sendmsg(dealloc_vgprs) if scratch is used
5240- radv: disable 64-bit color attachments
5241- radv: fix 128bpp comp-to-single clears
5242- radv: support 128bpp comp-to-single with all colors
5243- radv/gfx11: re-enable 0001/1110 clear values
5244- nir/lower_shader_calls: fix align_offset
5245- nir/opt_load_store_vectorize: support scratch access
5246- radv: vectorize RT stack access
5247- radv: vectorize scratch access
5248- aco: fix p_bpermute_gfx6 with input at non-zero byte
5249- aco: fix p_bpermute_gfx6's exec save/restore with wave32
5250- aco: clarify bpermute pseudo opcode names
5251- aco: add adjust_bpermute_dst helper
5252- aco/spill: skip p_branch in process_block
5253- aco/spill: add all live-in to merge block spill candidates
5254- nir/lower_system_values change num_workgroups to uint32_t
5255- radv: optimize mesh workgroup ID using ts_mesh_dispatch_dimensions
5256- radv: use shortcut_1d_workgroup_id
5257- aco: remove fast path in insert_exec_mask's process_instructions
5258- aco/optimizer_postRA: check overwritten_subdword in is_overwritten_since()
5259- aco: check logical_phi_info at p_logical_end when eliminating exec writes
5260- aco: remove unused p_logical_end check when optimizing branching sequence
5261- radv: disable mesh dispatch XYZ_DIM when possible
5262- nir/deref: remove rematerialize_deref_in_block cache
5263- aco: reset prefetch in the correct block after removing the exit
5264- aco/waitcnt: replace wait_cnt::\*_cnt with booleans
5265- aco/waitcnt: add print helpers
5266- nir/lower_int64: fix find_lsb(0)
5267- nir/algebraic: optimize u2u32(a >> 32)
5268- aco/optimizer_postRA: don't combine DPP across exec on GFX8/9
5269- aco: don't combine DPP into v_cmpx
5270- aco: disable zero offset optimization for strict WQM coords
5271- nir/constant_folding: remove zero texel offset
5272- aco: remove zero offset optimization
5273- aco: shrink DPP8_instruction
5274- aco: add fetch_inactive field to DPP instructions
5275- nir: add fetch inactive index to quad_swizzle_amd/masked_swizzle_amd
5276- aco: disable FI for quad/masked swizzle
5277- aco: fix LdsDirectVMEMHazard WaW with the wrong waitcnt
5278- aco: only mitigate VcmpxExecWARHazard when necessary
5279- aco: fix s_setreg hazards
5280- aco: consider exec_hi in reads_exec()
5281- aco: resolve all possible hazards at the end of shader parts
5282- aco/tests: test that hazards are resolved at the end of shader parts
5283- radv: skip zero-sized memcpy
5284- ac/nir: fix out-of-bounds access in ac_nir_export_position
5285- radv: fix signed integer overflow
5286- Revert "radv: pre-init surface info"
5287- nir: improve ms_cross_invocation_output_access with local_invocation_id
5288- aco,nir: add export_row_amd intrinsic
5289- ac/nir: add row parameter to helpers
5290- ac/nir: remove dead code
5291- ac/nir: refactor mesh vertex/primitive export
5292- ac/nir: implement mesh shader gs_fast_launch=2
5293- ac/nir: optimize mesh shader local_invocation_index
5294- radv: implement mesh shader gs_fast_launch=2
5295- ac/nir: add emit_ms_outputs helper
5296- ac/nir,radv: pass workgroup size to ac_nir_lower_ngg_ms
5297- ac/nir: implement mesh shader multi-row export
5298- radv: implement mesh shader multi-row export
5299- radv: enable mesh shader gs_fast_launch=2 and multi-row export
5300- nir/serialize: fix signed integer overflow
5301- nir/lower_shader_calls: skip zero-sized qsort
5302- util: skip zero-sized SHA1Update
5303- radv: call lower_array_deref_of_vec before lower_io_arrays_to_elements
5304- radv: skip radv_remove_varyings for mesh shaders
5305- radv: disable gs_fast_launch=2 by default
5306- docs: fix RADV_THREAD_TRACE_CACHE_COUNTERS default
5307- radv: add radv_disable_trunc_coord option
5308- radv: enable radv_disable_trunc_coord for vkd3d-proton/DXVK
5309- ac/nir: fix partial mesh shader output writes on GFX11
5310
5311Rob Clark (60):
5312
5313- freedreno: move virtgpu msm_proto.h to common
5314- freedreno/drm/virtio: Remove unused header
5315- tu/msm: staticify a couple things
5316- tu/knl: Remove some random const'ness
5317- drm-uapi: Update virtgpu header
5318- freedreno: Update virtgpu proto
5319- freedreno/drm/virtio: Use global_faults
5320- tu: close submitqueues before device_finish()
5321- tu/drm: Factor out shared helpers
5322- tu/drm: Add missing error path cleanup
5323- tu/drm: Split out helper for iova alloc
5324- tu: Add virtgpu support
5325- util: Decouple disk cache from EGL_ANDROID_blob_cache
5326- docs: Followup to !24636
5327- tu: Workaround bionic _SC_LEVEL1_DCACHE_LINESIZE
5328- ir3+tu: Simplify ir3_find_sysval_regid callers
5329- freedreno/a6xx: Drop unused screen args
5330- freedreno/a6xx: Re-work fd6_emit_shader
5331- freedreno/a6xx: Re-write the function-of-doom
5332- freedreno: Implement ATI_meminfo
5333- freedreno/a6xx: ARB_post_depth_coverage
5334- freedreno/a6xx: ARB_sample_locations
5335- freedreno/a6xx: ARB_texture_filter_minmax
5336- freedreno/a6xx: EXT_demote_to_helper_invocation
5337- freedreno/a6xx: EXT_shader_image_load_formatted
5338- freedreno/a6xx: EXT_depth_bounds_test
5339- freedreno/a6xx: Use pipe_blit_info::sample0_only
5340- freedreno/a6xx: Handle PIPE_BIND_BLENDABLE
5341- freedreno/a6xx: ARB_shader_viewport_layer_array
5342- tu: Fix heap size
5343- freedreno: Fix crash with debug msgs enabled
5344- freedreno/layout: Handle 565/etc MSAA special case
5345- freedreno/decode: Fix printing chip-id
5346- freedreno/a6xx: Add L8_SRGB
5347- freedreno: Add reformatting commits to .git-blame-ignore-revs
5348- freedreno/fence: Hold a strong ref to batch
5349- freedreno/decode: Lookup device info
5350- freedreno/decode: Use info->chip to decode
5351- freedreno/decode: Remove gpu_id
5352- freedreno: Indentation fix
5353- freedreno: Use explicit QCOM_TILED3 modifier
5354- freedreno/a6xx: Remove dummy packet for globals
5355- freedreno: Fix streamout offset_buf dirtiness
5356- freedreno: Fix user const buffer dirtiness
5357- freedreno/batch: Move query_buf allocation
5358- freedreno: Add private-BO tracking
5359- freedreno: Add missing indirect_draw_count tracking
5360- freedreno: Move/add some attach_bo()
5361- freedreno: Add attach-bo debugging
5362- freedreno: Rework supported-modifiers handling
5363- mesa: Introduce MESA_texture_const_bandwidth
5364- mesa: Implement MESA_texture_const_bandwidth
5365- freedreno: Add PIPE_CAP_HAS_CONST_BW support
5366- panfrost: Add PIPE_CAP_HAS_CONST_BW support
5367- iris: Add PIPE_CAP_HAS_CONST_BW support
5368- radeonsi: Add PIPE_CAP_HAS_CONST_BW support
5369- tu/msm: Fix timeline semaphore support
5370- tu/virtio: Fix timeline semaphore support
5371- freedreno/drm: Fix race in zombie import
5372- freedreno: Always attach bo to submit
5373
5374Robert Foss (9):
5375
5376- egl: Expose access to DeviceList
5377- egl: Rename _eglRefreshDeviceList() to _eglDeviceRefreshList()
5378- egl: Refresh DeviceList during eglInitialize()
5379- egl/surfaceless: Use EGL DeviceList instead of drmGetDevices2()
5380- egl/android: Use EGL DeviceList instead drmGetDevices2()
5381- egl: Rename _eglAddDevice() to _eglFindDevice()
5382- egl: Rename _eglAddDevice() to _eglFindDevice()
5383- egl: Fix attrib_list[0] == EGL_NONE check
5384- egl: Always set _EGLDisplay->Device during eglGetPlatformDisplay()
5385
5386Robert Mader (6):
5387
5388- egl/wayland: wait for compositor to release shm buffers
5389- iris: Support parameter queries for main planes
5390- util: Add new helpers for pipe resources
5391- panfrost: Support parameter queries for main planes
5392- vc4/resource: Support offset query for multi-planar planes
5393- v3d/resource: Support offset query for multi-planar planes
5394
5395Rohan Garg (33):
5396
5397- iris: migrate WA 14013910100 to use the WA framework
5398- iris: migrate WA 14016118574 to use the WA framework
5399- iris: fix iris for WA 16013000631
5400- intel/perf: add perf query support for Intel Raptorlake
5401- intel/genxml: set a default value for "Pixel Position Offset Enable" in genxml
5402- anv: use the WA infrastructure where possible when generating state
5403- anv: use the correct GFX_VERx10 macro for WA
5404- anv,iris: program the maximum number of threads on compute queue init
5405- anv: drop CFE state validation checks
5406- iris: track reset signalling instead of replacing the context
5407- iris: allow for a unsynchronized device reset query
5408- anv: partially revert 2e8b1f6d
5409- anv: emitting 3DSTATE_PRIMITIVE_REPLICATION is required on Gen12+
5410- anv: use the pre defined _3DPRIMITIVE_DIRECT macro
5411- anv: drop dead ifdef
5412- iris: use the correct WA macros and lineage numbers
5413- anv: use the lineage number for WA
5414- crocus: add a __gen_get_batch_address declaration
5415- crocus: fix GFX_VERx10 macro
5416- blorp: drop undefined macro
5417- iris: migrate preemption streamwout wa to WA infra
5418- intel/genxml: update PIPE_CONTROL instruction for dg2
5419- anv: define clear color localy within can_fast_clear_color_att
5420- intel/compiler: Adjust CS payload registers for new register width on Xe2+
5421- intel/compiler: Adjust fence message lengths for new register width on Xe2+
5422- intel/compiler: Adjust barrier emission for Xe2+
5423- intel/genxml: fix 3DSTATE_3D_MODE length to align with BSpec
5424- anv: ensure that FCV_CCS_E fast clears are properly tracked
5425- anv: enable FCV for Gen12.5
5426- anv: fix debug string for PC flush
5427- anv: cleanup includes
5428- anv: turn off non zero fast clears for CCS_E
5429- anv: selectively enable FCV optimization for DG2
5430
5431Roland Scheidegger (1):
5432
5433- lavapipe: further limit accurate_a0 hack
5434
5435Roman Stratiienko (22):
5436
5437- egl: android: Remove legacy name-based shared buffers support
5438- util: Add NONNULL macro
5439- android: Introduce the Android buffer info abstraction
5440- android: Fix num_planes assignment in u_gralloc_fallback
5441- v3dv/android: Use u_gralloc code
5442- v3dv/android: Enable shared presentable image support
5443- v3dv: Migrate to vk_device_memory
5444- v3dv/android: Skip swapchain binding
5445- v3dv: Rely on the internal tiled flag instead of the common vk structure
5446- v3dv/android: Add a helper function to support explicit layouts
5447- v3dv/android: Rework Android native buffer importing logic
5448- v3dv: Use format stored in vk_image and vk_image_view after init
5449- v3dv: Split v3dv_image_init to use layout setting logic separately
5450- v3dv/android: Add AHardwareBuffer support
5451- v3dv: Enable VK API v1.2 for Android
5452- panvk: Add Android ICD loader entry point
5453- u_gralloc: Remove inline modifiers from the functions
5454- u_gralloc: Remove usage of NONNULL macro
5455- Revert "util: Add NONNULL macro"
5456- u_gralloc: Add a function that returns gralloc type
5457- dri: Remove __driDriverExtensions leftovers
5458- v3d: Don't implicitly clear the content of the imported buffer
5459
5460Ruijing Dong (2):
5461
5462- frontends/va: checking va version for av1enc support
5463- radeonsi/vcn: change max_poc to fixed value for hevc encoder.
5464
5465Ryan Neph (1):
5466
5467- vulkan/android: add missed STACK_ARRAY_FINISH()
5468
5469Sagar Ghuge (34):
5470
5471- intel/compiler: Look at 2 register worth of data instead of 4
5472- isl: Disable MCS compression just on ACM platform
5473- intel: Add env variable to add break point on/before draw
5474- anv: Add GPU breakpoint before/after specific draw call
5475- iris: Add GPU breakpoint before/after draw call
5476- blorp: Implement blorp hooks to emit breakpoint
5477- docs: Add INTEL_DEBUG_BKP_BEFORE/AFTER_DRAW_COUNT
5478- intel/isl: Enable INTEL_DEBUG=noccs/nohiz in ISL helpers
5479- anv,hasvk: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks
5480- iris,crocus: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks
5481- blorp: Drop unnecessary assertions in blorp_can_hiz_clear_depth
5482- anv: Add helper to create companion RCS command buffer
5483- anv: Split out End/Destroy/Reset cmd buffer code into helper
5484- anv: Handle companion RCS in end/destory/reset code path
5485- intel: Add helper to create/destroy i915 VM
5486- intel: Pass virtual memory address space ID while creating context
5487- anv: Create companion RCS engine
5488- anv: Move compute specfic bits under compute queue init
5489- anv: Execute RCS init batch on companion RCS context/engine
5490- anv: Setup companion RCS command buffer submission
5491- anv: Execute an empty batch to sync main and companion RCS batch
5492- anv: Add secondary companion RCS cmd buffer to primary
5493- anv: Skip layout transition on the compute queue
5494- anv: Extract batch print code to anv_print_batch helper
5495- iris: Enable always flush cache with DEBUG_STALL option
5496- intel/genxml: Add STATE_COMPUTE_MODE instruction
5497- anv: Program and emit STATE_COMPUTE_MODE
5498- anv: Enable barrier handling on video engines
5499- isl: Use 16-bit instead of 8-bits for surface format info fields
5500- anv: Handle end of pipe with MI_FLUSH_DW on transfer queue
5501- anv: Enable transfer queue only on ACM+ platforms
5502- blorp: Use the correct miptail start LOD for surfaces
5503- anv: Write timestamp using MI_FLUSH_DW on blitter
5504- anv: Flush data cache while clearing depth using HIZ_CCS_WT
5505
5506Saleemkhan Jamadar (1):
5507
5508- radeonsi/vcn: set jpeg reg version for gfx 1150
5509
5510Samuel Holland (3):
5511
5512- Android.mk: Allow building only Vulkan drivers
5513- Android.mk: Explicitly enable/disable LLVM support
5514- Android.mk: Only link LLVM for radeonsi, not amd_vk
5515
5516Samuel Pitoiset (299):
5517
5518- radv: remove support for VK_INDIRECT_COMMANDS_TOKEN_TYPE_STATE_FLAGS_NV
5519- radv: make radv_get_pa_su_sc_mode_cntl() static
5520- zink/ci: update list of expected failures for NAVI10
5521- radv: stop using a pipeline for emitting VGT_VERTEX_REUSE_BLOCK_CNTL
5522- radv: remove unused param in radv_pipeline_emit_vgt_gs_out()
5523- radv: pass a shaders array for computing ia_multi_vgt_param
5524- radv: bind the pre-compiled PS epilog to the cmdbuf state
5525- radv: stop using an array of binaries when compiling a compute shader
5526- radv: add radv_compile_cs() to compile a compute shader
5527- radv: remove the pipeline dependency for creating a GS copy shader
5528- radv: add a helper to compute the ESGS itemsize
5529- radv: use the number of GS linked inputs to compute the ESGS itemsize
5530- radv: determine ES info for VS/TES with GS earlier
5531- radv: determine as_ls earlier by using the next stage
5532- radv: simplify getting next VS stage for VS prologs
5533- radv: use next_stage for determining the stage to lower NGG
5534- radv/amdgpu: fix dumping CS with the chained IBs path
5535- radv/amdgpu: rename old_ib to ib in radv_amdgpu_winsys_cs_dump()
5536- radv: pass submit info to radv_check_gpu_hangs()
5537- radv: initialize stage/next_stage earlier
5538- radv: set next_stage to MESA_SHADER_NONE if there is no FS
5539- radv: rework considering force VRS without relying on graphics pipeline
5540- radv: stop passing radv_graphics_pipeline to radv_fill_shader_info()
5541- radv: move removing all varyings when the FS is a noop
5542- radv: rename graphics pipeline linking helpers
5543- radv: simplify lowering NGG GS intrinsics
5544- radv: rework determining the NGG stage without a graphics pipeline
5545- radv: cleanup pipeline compute emit helpers
5546- radv: rename radv_pipeline_stage to radv_shader_stage
5547- radv: rename NGG query state to be more generic
5548- radv: declare the shader query user SGPR for emulating GS counters
5549- radv: enable pipelinestat query emulation for legacy GS
5550- radv: simplify the NGG vs legacy pipelinestat query path
5551- radv: rename RADV_SHADER_QUERY_PIPELINE_STAT_OFFSET
5552- radv: implement nir_intrinsic_atomic_add_gs_invocation_count_amd
5553- radv: emulate GEOMETRY_SHADER_INVOCATIONS query on RDNA1-2
5554- radv: track whether inputs/outputs are linked per shader stage
5555- radv: add support for VS/TES as ES without shaders IO linking
5556- radv: use next_stage to determine if the layer should be exported
5557- radv: use next stage to determine if primID/clip dist should be exported
5558- radv: compute the legacy GS info earlier
5559- radv: stop copying some NIR info fields from TES to TCS
5560- radv: stop lowering patch vertices for TES
5561- radv: do not always copy the number of tess patches to TES
5562- radv: initialize tcs.tes_{patch}_inputs_read to a default value
5563- radv: prevent linking TCS<->TES when TES is NULL
5564- radv: use a packed user SGPR for the TES state
5565- radv: stop checking if patch control points is dynamic everywhere
5566- radv: copy the number of TCS vertices out to TES shader info
5567- radv: add support for dynamic TCS vertices out for TES
5568- radv: remove radv_shader_info::tes::num_linked_patch_inputs
5569- amd,radeonsi: move si_shader_io_get_unique_index_patch() to common code
5570- radv: allow to use fixed IO locations for VS<->TCS<->TES without linking
5571- aco: add aco_shader_info::tcs::has_epilog
5572- aco: add infra for compiling TCS epilogs
5573- radv,aco: move has_epilog to radv_shader_info
5574- radv: assume a TCS needs an epilog unless it's linked with a TES
5575- radv: do not write tess factors in main TCS when it has an epilog
5576- radv: track if TES reads tess factors differently
5577- radv: declare new argument for the TCS epilog PC
5578- radv: add radv_tcs_epilog_key
5579- radv: add infra for creating TCS epilogs
5580- radv: add support for a TCS epilogs cache in the device
5581- radv: add support for emitting TCS epilogs in cmdbuf
5582- radv: remove unnecessary check in radv_pipeline_nir_to_asm()
5583- radv: stop passing a graphics pipeline to radv_pipeline_nir_to_asm()
5584- radv: inline radv_pipeline_get_nir() in radv_graphics_pipeline_compile()
5585- radv: add a struct for the retained shaders and GPL
5586- radv: add radv_graphics_shaders_compile() to compile graphics shaders
5587- radv: remove redundant check in radv_cmd_buffer_after_draw()
5588- radv: track if patch control points is dynamic from the cmdbuf state
5589- radv: re-emit binning state if the framebuffer is dirty
5590- radv: track if vertex binding stride is dynamic from the cmdbuf state
5591- vulkan: bump header register to 1.3.261
5592- vulkan/runtime: add common implementation for GetImageSubresourceLayout()
5593- vulkan/format: add VK_FORMAT_{A8_UNORM,A1B5G5R5_UNORM_PACK16}_KHR
5594- radv: use the RT prolog scratch size directly for tracing rays
5595- radv: add a helper to get the maximum number of scratch waves per shader
5596- radv: update the number of scratch waves for RT prolog at bind time
5597- radv: update cmdbuf scratch size info when shaders are bound
5598- vulkan: add init/finish helpers for vk_buffer_view
5599- radv: use vk_buffer_view
5600- radv: use vk_sampler
5601- radv: use common vkCmdBegin/EndQuery wrappers
5602- radv: use vk_query
5603- zink: fix setting VkShaderCreateInfoEXT::nextStage
5604- radv/rt: fix capture/replay support
5605- vulkan/render_pass: add common vkGetRenderingAreaGranularityKHR()
5606- radv: implement vkCmdBindIndexBuffer2KHR()
5607- radv: allow VK_WHOLE_SIZE for pSizes in vkCmdBindVertexBuffers2()
5608- radv/rmv: remove unused pipeline create flags when logging pipelines
5609- radv: store pipeline create flags to radv_pipeline::create_flags
5610- radv: add support for VkPipelineCreateFlags2CreateInfoKHR
5611- radv: add support for VkBufferUsageFlags2CreateInfoKHR
5612- radv: allow VK_REMAINING_ARRAY_LAYERS with VkImageSubresourceLayers
5613- radv: implement radv_Get{Device}ImageSubresourceLayout2KHR()
5614- radv: advertise VK_KHR_maintenance5
5615- radv: remove useless NULL for pipeline layout during shader info pass
5616- radv: introduce radv_shader_layout for per-stage descriptor layout
5617- radv: stop passing redundant stage to radv_shader_stage_init()
5618- radv: re-introduce radv_pipeline_stage_init()
5619- radv: add support for loading the LSHS vertex stride from a SGPR
5620- radv: use the number of VS outputs for computing the tessellation info
5621- vulkan: ignore VkPipelineColorWriteCreateInfoEXT if the state is dynamic
5622- radv: reduce TCS_OFFCHIP_LAYOUT_NUM_PATCHES to 6-bits
5623- radv: add missing comment about TCS_OFFCHIP_LAYOUT_LSHS_VERTEX_STRIDE
5624- radv: fix emitting TCS epilogs for GFX6-9
5625- radv: remove radv_cmd_buffer::cached_vertex_formats
5626- radv: remove unused param from radv_pipeline_init_multisample_state()
5627- radv: simplify declaring VS specific input SGPRs
5628- radv: stop copying if VS or TES uses the InvocationID built-in
5629- Revert "radv/amdgpu: workaround a kernel bug when replacing sparse mappings"
5630- Revert "radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build"
5631- radv/amdgpu: allow to execute external IBs on the compute queue
5632- radv/amdgpu: add support for submitting external IBs with the chained path
5633- zink/ci: update list of expected failures for NAVI10
5634- radv: use the maximum possible workgroup size for TCS epilogs
5635- radv: stop declaring the scratch offset argument for TCS epilogs
5636- radv: declare shader arguments for TCS epilogs
5637- radv: add tcs_out_patch_fits_subgroup to radv_tcs_epilog_key
5638- aco: fix jumping from main TCS to epilog on GFX9+
5639- aco: adjust TCS epilogs for RADV
5640- aco: allow SGPRs operands with p_jump_to_epilog
5641- aco: implement create_tcs_jump_to_epilog()
5642- radv: track the pipeline bind point for indirect commands layout
5643- radv: prepare radv_get_sequence_size() for DGC compute
5644- radv: prepare radv_prepare_dgc() for DGC compute
5645- radv: implement NV_device_generated_commands_compute
5646- radv: allow DGC on the compute queue
5647- radv: advertise NV_device_generated_commands_compute
5648- aco: rework printing shader stages
5649- radv: fix the per-patch data offset when TES isn't linked with TCS
5650- radv: stop declaring unused SGPR arguments for PS epilogs
5651- radv: add radv_shader_info::is_monolithic
5652- radv: use info->uses_view_index directly when declaring shader arguments
5653- radv: do not inline push constants for non-monolithic shaders
5654- radv: force indirect descriptor sets for non-monolithic shaders
5655- radv: always declare some arguments for non-monolithic VS/TCS shaders
5656- radv: add a new shader argument for non-monolithic shaders PC
5657- ac: allow to mark shader arguments as preserved
5658- radv: preserve shader arguments for non-monolithic VS/TCS on GFX9+
5659- aco: disable shared VGPRs for non-monolithic shaders on GFX9+
5660- aco: ensure to initialize exec manually for VS as LS on GFX9+
5661- aco: add support for compiling VS+TCS separately on GFX9+
5662- radv: always declare some arguments for non-monolithic {VS,TES}/GS shaders
5663- radv: preserve shader arguments for non-monolithic {VS,TES}/GS on GFX9+
5664- aco: ensure to initialize exec manually for non-monolithic {VS,TES}/GS on GFX9+
5665- aco: add support for compiling {VS,TES}+GS separately on GFX9+
5666- radv,aco: remove unused clip/cull distances variables
5667- radv: rename tcs_shader to tcs in radv_emit_tcs_epilog_state()
5668- radv: small cleanups in radv_emit_patch_control_points()
5669- radv: fix emitting TCS epilogs if TES and GS are linked on GFX9+
5670- radv: remove the pipeline dependency for emitting VGT_GS_MODE
5671- aco: fix emitting TCS epilogs end on GFX9+
5672- radv: re-order IO slot layout for stages that aren't linked
5673- amd/ci: update list of failures/flakes for glcts-vangogh-valve
5674- ci: uprev vkd3d-proton
5675- ci: uprev Fossilize
5676- ci: add comment explaining which image tags to update for Fossilize
5677- radv: preserve shader argument for separate compilation of NGG shaders
5678- aco: flag blocks with long-jump as export_end for separate compilation
5679- aco: adjust fix_exports() for VS/TES as NGG and non-monolithic shaders
5680- aco: allow separate compilation of NGG shaders
5681- zink/ci: add zink-radv-polaris10-valve
5682- radv/ci: re-enable vkcts-polaris10-valve
5683- radv: fix capturing indirect dispatches with SQTT
5684- radv/ci: re-enable vkd3d-polaris10-valve
5685- ci: do not fail vkd3d-proton job when the expectations match
5686- radv/amdgpu: fix executing secondaries without IB2
5687- radv/amdgpu: do not copy the original chain link for IBs
5688- radv: avoid emitting SQTT markers for DGC calls
5689- radv: add support for DGC with SQTT
5690- zink/ci: merge GLCTS testing with GLESx for RADV
5691- zink/ci: merge piglit testing with deqp-runner for RADV
5692- radv: fix interactions with primitives generated queries and pipeline stats
5693- radv: skip DGC calls when the indirect sequence count is zero with a predicate
5694- radv: avoid emitting THREAD_TRACE_MARKER for predicated draws/dispatches
5695- radv: adjust next stage for VS prologs and merged shaders compiled separately
5696- radv: adjust emitted prolog regs for merged shaders compiled separately
5697- radv: do not use pre-compiled prologs when VS is compiled separately
5698- radv: remove useless PIPELINE_CREATE_2_LIBRARY_BIT check for retained shaders
5699- radv: fix enabling DGCC
5700- radv: fix emitting SQTT userdata when CAM is needed
5701- radv: fix capturing RGP on RDNA3 with more than one Shader Engine
5702- zink/ci: update list of expected failures for POLARIS10/NAVI10
5703- radv: set THREAD_TRACE_TOKEN_MASK.BOP_EVENTS_TOKEN_INCLUDE on GFX10.3+
5704- radv: disable unsupported hw shader stages for RGP on GFX11+
5705- radv: fix instruction timing on GFX11
5706- ac/rgp: use correct API stage string for mesh/task shaders
5707- radv: set THREAD_TRACE_MARKER_ENABLE for mesh/task draws
5708- radv: emit relocation for mesh/task shaders
5709- issue_templates/Bug Report: fix outdated URL for GFXReconstruct
5710- ac,radv,radeonsi: rework SPM counters configuration and share it
5711- ac/perfcounter: add new SQ_WGP block for GFX11+
5712- ac/spm: add SPM counters configuration for GFX11
5713- radv: enable the PKT3 CAM bit for some SPM register writes
5714- radv,radeonsi: use AC_SPM_SEGMENT_TYPE_xxx instead of magic values
5715- ac/spm: remove useless SPM block setting for GFX9 and older GPUs
5716- ac/spm: add SPM block definition for GFX10-GFX10.3
5717- ac/gpu_info: init num_cu_per_sh from the kernel
5718- ac/perfcounter: set the number of instances of GL1C to 4
5719- ac/perfcounter: compute the number of global instances of TCP,SQ,GL1C and GL2C
5720- ac/spm: fix checking if the counter instance is valid
5721- ac/spm: rework how segment muxsel RAM are filled
5722- ac/spm: initialize and set instance mapping for counters
5723- radv: reserve more CS space in SQTT/SPM paths
5724- ac/spm: use block flags to initialize instance mapping
5725- ac/spm: select correct segment type for per-SE blocks
5726- radv,radeonsi: make sure to emit GRBM_GFX_INDEX before SQ select registers
5727- ac/spm: fix number of instances of GL2C
5728- ac,radv,radeonsi: prepare support for multi-instance SPM SQ counters
5729- ac,radv,radeonsi: prepare support for multi-instance SPM generic counters
5730- ac/spm: move the counter instance to ac_spm_counter_create_info
5731- ac/spm: enable support for multi-instance counters
5732- radv: fix checking if RGP is enabled with others tracing tools
5733- radv: fix missing ISA with RGP and GPL
5734- ac/perfcounter: add SG_WQP group for GFX11
5735- ac/perfcounter: add GFX11 groups
5736- drirc: remove Path of Exile workarounds
5737- radv: remove drirc workarounds for Path Of Exile
5738- radv: remove absolute_depth_bias workaround
5739- ac/gpu_info: define AMD_MAX_WGP
5740- ac/spm: add new segment types for GFX11
5741- ac/spm: add support for GFX11
5742- radv: add SPM support for GFX11
5743- radv: enable cache counters for RGP on GFX11
5744- ci: update to vulkan-cts-1.3.6.3
5745- radv/ci: skip dEQP-VK.robustness.* on Vangogh due to weird GPU hangs
5746- nir: rename atomic_add_gs_invocation_count_amd to make it more generic
5747- ac/nir: add lowering for mesh shader queries
5748- ac/nir: add lowering for task shader queries
5749- radv: add GDS counters offset for mesh/task queries
5750- radv: adjust lowering of intrinsic queries for mesh/task shaders
5751- radv: enable lowering of mesh/task shader queries when enabled
5752- radv: declare shader_query_state for mesh/task shaders
5753- radv: stop skip emitting CB states when there is no color attachment
5754- radv: re-enable DCC with mipmaps on GFX11
5755- radv: fix COMPUTE_SHADER_INVOCATIONS query on compute queue
5756- radv: emit missing PA_{SC,SU}_LINE_STIPPLE_xxx regs in gfx preamble
5757- radv: fix alignment of DGC command buffers
5758- radv/ci: update list of expected failures on PITCAIRN
5759- radv/ci: update list of flakes for NAVI10/VEGA10
5760- radv/amdgpu: fix alignment of command buffers
5761- radv: enable DCC for MSAA images on GFX11
5762- zink/ci: update list of expectations for zink-anv-tgl
5763- zink/ci: bump zink-anv-tgl-full timeout to 1h45m
5764- radv/ci: rename GFX1100 lists to NAVI31
5765- radv: fix emulated geometry shader primitives/invocations queries
5766- radv/ci: remove duplicate skipped tests for RAVEN/STONEY
5767- radv/ci: exclude dEQP-VK.texture.explicit_lod.2d.sizes.128x128_* for all jobs
5768- radv: fix synchronization with emulated GS primitives/invocations queries
5769- radv/ci: remove no longer existing test for VANGOGH
5770- radv/ci: cleanup list of expected failures for NAVI10/NAVI21/VEGA10
5771- radv: always write the sample positions when a new descriptor BO is created
5772- radv: fill the scratch BO in radv_fill_shader_rings()
5773- radv: fix gang submissions with chaining
5774- radv: fix re-emitting streamout descriptors for NGG streamout
5775- radv: fix IB alignment
5776- zink: use warn_missing_feature for missing modifier support
5777- radv: fix destroying GDS/OA BOs
5778- radv: allocate only 1 GDS OA counter for gfx10 NGG streamout
5779- ac/nir: only consider overflow for valid feedback buffers
5780- radv/ci: update list of expected failures on RAVEN
5781- radv/ci: update list of flakes for VANGOGH
5782- radv/ci: update list of flakes for STONEY
5783- radv: disable primitive restart for non-indexed draws on GFX11
5784- radv: enable radv_disable_aniso_single_level=true for Zink too
5785- amd/llvm,aco,radv: implement NGG streamout with GDS_STRMOUT registers on GFX11
5786- radv: mark GDS as needed for XFB queries with NGG streamout on GFX11
5787- radv: skip GDS allocation for NGG streamout on GFX11
5788- zink/ci: remove expected failures that are skipped for RADV
5789- ci: update CTS to vulkan-cts-1.3.7.0
5790- ci: bump the number of tests per group from 500 to 5000 for Vulkan drivers
5791- ci: bump DEQP_FRACTION for some jobs
5792- radv: set ENABLE_PING_PONG_BIN_ORDER for GFX11.5
5793- radv: initialize video decoder for GFX11.5
5794- ac/gpu_info: query the maximum number of IBs per submit from the kernel
5795- Revert "radv: fix finding shaders by PC"
5796- radv: fix missing predicate bit for WRITE_DATA helper
5797- ac/gpu_info: fix querying the maximum number of IBs per ring
5798- radv: remove outdated RADV_DEBUG=vmfaults support
5799- amd: update amdgpu_drm.h
5800- amd: add has_gpuvm_fault_query
5801- radv/amdgpu: add support quering the last GPUVM fault
5802- radv: query and report the last GPUVM fault with RADV_DEBUG=hang
5803- radv: report the last GPUVM fault when a device lost is detected
5804- ac/gpu_info: remove bogus assertion about number of COMPUTE/SDMA queues
5805- radv: fix a synchronization issue with primitives generated query on RDNA1-2
5806- radv: bind the non-dynamic graphics state from the pipeline unconditionally
5807- radv: fix compute shader invocations query on compute queue on GFX6
5808- radv: emit COMPUTE_PIPELINESTAT_ENABLE for CS invocations on ACE
5809- nir: fix inserting the break instruction for partial loop unrolling
5810- radv: fix registering queues for RGP with compute only
5811- radv: set radv_zero_vram=true for Unreal Engine 4/5
5812- radv: fix a descriptor leak with debug names and host base descriptor set
5813- radv: add a missing async compute workaround for Tonga/Iceland
5814- radv: disable TC-compatible HTILE on Tonga and Iceland
5815- radv: set radv_invariant_geom=true for War Thunder
5816- radv: do not set OREO_MODE to fix rare corruption on GFX11
5817
5818Saroj Kumar (4):
5819
5820- radeonsi: Add perfetto support in radeonsi
5821- radeonsi: Add u_trace init code in radeonsi
5822- radeonsi: Add tracepoints in radeonsi driver
5823- radeonsi: fixes compilaton error when perfetto is disabled
5824
5825Sathishkumar S (2):
5826
5827- radeonsi/vcn: support variable number of bs_bufs
5828- radeonsi/vcn: num bs_bufs must be proportional to num jpeg engines
5829
5830Semjon Kravtsenko (1):
5831
5832- glx: Assign unique serial number to GLXBadFBConfig error
5833
5834Seppo Yli-Olli (1):
5835
5836- zink: Fix SyntaxWarning in zink_extensions script
5837
5838Sergi Blanch Torne (7):
5839
5840- Introduce ci-kdl builder and launcher.
5841- Integrate ci-kdl in the building process and launch process.
5842- ci: disable Collabora's LAVA lab for maintance
5843- Revert "ci: disable Collabora's LAVA lab for maintance"
5844- Revert "ci: disable Collabora's LAVA lab for maintance"
5845- ci: disable Collabora's LAVA lab for maintance
5846- Revert "ci: disable Collabora's LAVA lab for maintance"
5847
5848Sid Pranjale (1):
5849
5850- nvk: Enable VK_EXT_load_store_op_none
5851
5852Sil Vilerino (20):
5853
5854- util: Blake3 - Identify arm64ec as aarch64 instead of x64
5855- d3d12: Fix Map/Unmap of YUV resources
5856- d3d12: Fix H264 interlaced decode
5857- d3d12: Video Decode - Remove unnecessary copy for texture array case
5858- util/vl_vlc: Use UINT64_MAX instead of ~0UL with MSVC compiler
5859- d3d12: Extend video screen AV1 encode tile support checking
5860- aux/tc: Add ASSERTED to unreferenced release build variable
5861- d3d12: Video - Relax ID3D12VideoDevice QI version for decode, process
5862- frontends/va: Add profile param when querying PIPE_VIDEO_CAP_ENC_QUALITY_LEVEL
5863- d3d12: Upgrade to D3D12 Agility SDK 1.611 Video interface
5864- d3d12: Fixes AV1 tx_mode_support reporting and unsupported tx_mode overriding
5865- d3d12: Video Decode - Wait for GPU completion before destroying decoder in-flight objects
5866- d3d12: Do not destroy codec when destroying video buffer
5867- d3d12: AV1 encode - Add lower resolution fallback check for uniform tile support
5868- d3d12: AV1 encode - add fallback for app passing unsupported pic_params.InterpolationFilter
5869- d3d12: AV1 Encode - Fix VAConfigAttribEncMaxRefFrames reporting
5870- frontend/va: Add support for VAConfigAttribEncMaxTileRows/Cols
5871- d3d12: Add support for PIPE_VIDEO_CAP_ENC_MAX_TILE_ROWS/COLS
5872- d3d12: Allocate d3d12_video_buffer with higher alignment for compatibility
5873- d3d12: d3d12_video_buffer_create_impl - Fix resource importing
5874
5875Simon Ser (7):
5876
5877- wayland: enable use of wayland-protocols as a subproject
5878- vulkan/wsi/wayland: add support for IMMEDIATE
5879- vulkan/wsi/wayland: fix unset present_mode
5880- radv/winsys: check amdgpu_create_bo_from_user_mem() for EINVAL
5881- egl: extract EGLDevice setup in dedicated function
5882- egl: move dri2_setup_device() after dri2_setup_extensions()
5883- egl: ensure a render node is passed to _eglFindDevice()
5884
5885Simon Zeni (1):
5886
5887- nouveau/winsys: use mmap instead of mmap64 in nouveau_bo
5888
5889SoroushIMG (1):
5890
5891- pvr: fix mipmap size calculation for bc formats
5892
5893Sviatoslav Peleshko (9):
5894
5895- dri: Use RGB internal formats for RGBX formats
5896- intel/isl: Don't over-allocate CLEAR_COLOR size to use whole cache line
5897- anv: Do fast clear color initialization more delicately
5898- zink: Change zink_vertex_elements_hw_state::b.strides to VkDeviceSize
5899- intel/fs: Check if the whole ubo load range is in the push const range
5900- zink: Store zink_vertex_elements_hw_state::b.strides by binding id
5901- intel/fs: Fix "packed word exception" condition for register regioning
5902- intel/eu/validate: Validate "packed word exception" stricter
5903- nir/loop_analyze: Fix inverted condition handling in iterations calculation
5904
5905Sylvain Munaut (9):
5906
5907- egl/dri2: Add a couple of missing mutex release in error path
5908- mesa: Enable ARB_texture_border_clamp in GL Core
5909- include: Fix the PFN declarations to be pointers as they should
5910- glx: Add missing MesaGLInteropGLXFlushObjects
5911- glx: Export the MESA GL Interop functions through glXGetProcAddress
5912- egl: Export the MESA GL Interop functions through eglGetProcAddress
5913- glx: Remove MESA_depth_float_bit from enum
5914- glx: Advertise GLX_MESA_gl_interop extension if support present
5915- egl: Advertise EGL_MESA_gl_interop extension if support present
5916
5917Tapani Pälli (34):
5918
5919- intel/blorp: add a new flag to communicate PSS sync need
5920- anv: implement required PSS sync for Wa_18019816803
5921- iris: implement required PSS sync for Wa_18019816803
5922- vulkan/runtime: change assert to match specification needs
5923- anv: remove assert, size is asserted in the runtime
5924- anv: refactor batch_set_preemption to use batch_emit_pipe_control
5925- anv: implement a dummy depth flush for Wa_14016712196
5926- iris: implement a dummy depth flush for Wa_14016712196
5927- mesa: fix some TexParameter and SamplerParameter cases
5928- mesa: remove GL_UNSIGNED_BYTE as supported for snorm reads
5929- ci: add a fix for KHR-GLES3.packed_pixels.*snorm tests
5930- anv: implement Wa_14018912822
5931- iris: implement Wa_14018912822
5932- driconf: use lower_depth_range_rate for The Spirit and The Mouse
5933- mesa: disable snorm readpix clamping with EXT_render_snorm
5934- iris: modify Wa_14014414195 to use intel_needs_workaround
5935- mesa: some cleanups for texparam extension checks
5936- iris: avoid issues with undefined clip distance
5937- crocus: avoid issues with undefined clip distance
5938- anv: refactor to fix pipe control debugging
5939- anv: fix a leak of fp64_nir shader
5940- iris: use intel_needs_workaround for Wa_14014414195 part 2
5941- iris: correct dst alpha blend factor in Wa_14018912822
5942- iris/anv: move Wa_14018912822 as a drirc workaround
5943- iris: flush data cache when flushing HDC on GFX < 12
5944- anv: HDC flush is available only for GFX_VER 12+
5945- iris: HDC flush is available only for GFX_VER 12+
5946- intel/genxml: remove HDC from gen11.xml, it is not available
5947- mesa/st: ignore StencilSampling if stencil not part of the format
5948- intel/dev: expand existing fix for all gfx12 with small EU count
5949- egl: fix leaking drmDevicePtr in _eglFindDevice
5950- iris: add data cache flush for pre hiz op
5951- anv/drirc: add option to disable FCV optimization
5952- drirc: Set limit_trig_input_range option for Valheim
5953
5954Tatsuyuki Ishi (8):
5955
5956- radv/amdgpu: Remove unused bo_list variable from cs_submit.
5957- radv/winsys: Remove unused struct radv_winsys_bo_list.
5958- radv/amdgpu: Do not pass in a BO handle when clearing PRT VA region.
5959- radv: Fix IB size for RADV_DEBUG=hang.
5960- radv: Fix dumping vertex descriptors with RADV_DEBUG=hang.
5961- radv/amdgpu: Use rwlock to protect access to virtual BOs.
5962- zink: Fix missing sparse buffer bind synchronization.
5963- zink: Fix waiting for texture commit semaphores.
5964
5965Thomas H.P. Andersen (65):
5966
5967- tgsi: remove unused tgsi_shader_info.num_tokens
5968- tgsi: remove unused tgsi_shader_info.array_max
5969- tgsi: remove unused tgsi_shader_info.num_memory_instructions
5970- tgsi: remove unused tgsi_shader_info.colors_read
5971- tgsi: remove unused tgsi_shader_info.colors_written
5972- tgsi: remove unused tgsi_shader_info.reads_position
5973- tgsi: remove unused tgsi_shader_info.reads_samplemask
5974- svga: remove unused struct field
5975- tgsi: remove unused tgsi_shader_info.reads_tess_factors
5976- tgsi: remove unused tgsi_shader_info fields
5977- tgsi: remove unused tgsi_shader_info fields
5978- tgsi: remove unused tgsi_shader_info.uses_drawid
5979- tgsi: remove unused tgsi_shader_info fields
5980- tgsi: remove unused tgsi_shader_info.uses_subgroup_info
5981- tgsi: remove unused tgsi_shader_info.writes_primid
5982- tgsi: remove unused tgsi_shader_info.uses_doubles
5983- tgsi: remove unused tgsi_shader_info.uses_derivatives
5984- tgsi: remove unused tgsi_shader_info.uses_bindless_samplers
5985- tgsi: remove unused tgsi_shader_info.uses_bindless_images
5986- tgsi: remove unused tgsi_shader_info.clipdist_writemask
5987- tgsi: remove unused tgsi_shader_info.culldist_writemask
5988- tgsi: remove unused tgsi_shader_info.images_load
5989- tgsi: remove unused tgsi_shader_info.images_store
5990- tgsi: remove unused tgsi_shader_info.images_atomic
5991- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_load
5992- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_store
5993- tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_atomic
5994- tgsi: remove unused tgsi_shader_info.uses_bindless_image_load
5995- tgsi: remove unused tgsi_shader_info.uses_bindless_image_store
5996- tgsi: remove unused tgsi_shader_info.uses_bindless_image_atomic
5997- tgsi: remove unused tgsi_shader_info.indirect_files_read
5998- tgsi: remove unused tgsi_shader_info.indirect_files_written
5999- tgsi: remove unused tgsi_shader_info.const_buffers_indirect
6000- tgsi: remove unused tgsi_shader_info.max_depth
6001- tgsi: drop two unused functions
6002- nvk: use common physical device enumeration
6003- nvk: fix implicit-fallthrough warnings with clang
6004- nvk: delete commented code
6005- nvk: fix mem leaks
6006- nvk: use common descriptor set layout code
6007- nvk: use common pipeline layout code
6008- nvk: advertise KHR_shader_non_semantic_info
6009- nvk: advertise KHR_image_format_list
6010- nvk: advertise EXT_private_data
6011- nvk: advertise KHR_sampler_mirror_clamp_to_edge
6012- nvk: KHR_descriptor_update_template
6013- nvk: CmdPushDescriptorSetWithTemplateKHR
6014- nvk: drop dead assignment
6015- nvk: drop dead assignment
6016- nvk: fix initialization override
6017- nvk: sort extensions
6018- nvk: advertize KHR_relaxed_block_layout
6019- nvk: add check for VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT
6020- nvk: advertise EXT_image_2d_view_of_3d
6021- nvk: fix maxPushDescriptors
6022- nvk: call correct macro to clear views
6023- nouveau/mme: use fermi enum in fermi builder
6024- nvk: add warning on non-nouveau drm driver
6025- nvk: Implement VK_KHR_draw_indirect_count on Turing+
6026- nvk: set device info before use in nvk_get_device_extensions
6027- nvk: simplify code by using new helpers
6028- nvk: remove duplicated device features
6029- nvk: EXT_conditional_rendering
6030- nvk: advertise VK_EXT_tooling_info
6031- nvk: set optimization level to 3
6032
6033Thong Thai (3):
6034
6035- radeonsi: enable vcn encoder rgb input support
6036- Update radeon_vcn_enc.c
6037- frontends/va/config: report max width and height for encoding/decoding
6038
6039Timothy Arceri (27):
6040
6041- glsl: fix validation of ES vertex attribs
6042- nir/opt_copy_prop_vars: don't clone copies if branch empty
6043- nir/opt_copy_prop_vars: speedup cloning of copy tables
6044- nir/opt_copy_prop_vars: remove var hash entry on kill alias
6045- nir/opt_copy_prop_vars: skip cloning of copies arrays until needed
6046- nir/opt_copy_prop_vars: drop reuse of dynamic arrays
6047- glsl: fix spirv sso validation
6048- glsl: mark structs containing images as bindless
6049- util: add radeonsi workaround for Nowhere Patrol
6050- glsl: fix out params in glsl to nir
6051- glsl_to_nir: add more unhandled function types
6052- nir: replace use of nir_src_copy()
6053- nir: remove unused nir_src_copy()
6054- nir: remove unused param from nir_alu_src_copy()
6055- glsl: remove field from gl_shader_program
6056- glsl: move get_varying_type() declaration earlier
6057- glsl: add nir version of validate_first_and_last_interface_explicit_locations()
6058- glsl: switch to nir validate_first_and_last_interface_explicit_locations()
6059- glsl: remove unused validate_first_and_last_interface_explicit_locations()
6060- nir: fix typo in comment
6061- nir: copy explicit_invariant flag to nir vars
6062- glsl: move interpolation_string() to linker_util
6063- glsl: move is_gl_identifier() to linker_util
6064- nir: add used field to nir variables
6065- glsl: implement cross_validate_outputs_to_inputs() in nir linker
6066- glsl: switch to nir linkers cross_validate_outputs_to_inputs()
6067- glsl: remove now unused varying linker code
6068
6069Timur Kristóf (39):
6070
6071- aco: Fix subgroup_id intrinsic on GFX10.3+.
6072- ac/nir: Simplify arg unpacking when shift is zero.
6073- ac/nir: Add new pass to lower intrinsics to shader args.
6074- radv: Move radv_select_hw_stage to radv_shader_info.
6075- radv: Use ac_nir_lower_intrinsics_to_args.
6076- radeonsi: Move si_select_hw_stage to si_shader_info.
6077- radeonsi: Use ac_nir_lower_intrinsics_to_args.
6078- aco: Remove subgroup_id and num_subgroups intrinsics.
6079- ac/llvm: Remove subgroup_id and num_subgroups intrinsics.
6080- aco: Refactor select_program to smaller functions.
6081- nir/opt_dead_cf: Remove if branches with undef condition.
6082- ac/nir: Add done arg to ac_nir_export_position.
6083- ac/nir: Slightly refactor how pos0 exports are added when missing.
6084- ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export.
6085- ac/nir/ngg: Refactor mesh shader primitive export.
6086- ac/nir/ngg: Wait for attribute ring stores in mesh shaders.
6087- ac/nir/ngg: Extract nogs_export_vertex_params function.
6088- ac/gpu_info: Add some SDMA related information.
6089- ac: Clarify SDMA opcode defines.
6090- ac: Add amd_ip_type argument to ac_parse_ib and ac_parse_ib_chunk.
6091- ac: Rename ac_do_parse_ib to parse_pkt3_ib.
6092- ac: Print IP type for IBs.
6093- ac: Add rudimentary implementation of printing SDMA IBs.
6094- radv: Rename SDMA file to radv_sdma.c
6095- radv: Use const device argument in radv_sdma_copy_buffer.
6096- radv: Use const on vi_alpha_is_on_msb arguments.
6097- radv: Only call si_cp_dma_wait_for_idle on GFX and ACE queues.
6098- radv: Move radv_cp_wait_mem to radv_cs.h and add queue family argument.
6099- radv: Refactor WRITE_DATA helper function.
6100- radv: Use new WRITE_DATA helper in more places.
6101- radv: Add queue family argument to some functions.
6102- radv: Wait for bottom of pipe in ACE gang wait postamble.
6103- radv: Simplify gang CS and semaphore initialization.
6104- radv: Allow gang submit use cases other than task shaders.
6105- radv: Slightly refactor gang semaphore functions.
6106- radv: Add gang follower semaphore functions.
6107- radv: Support SDMA in radv_cs_write_data_head.
6108- radv: Support SDMA in radv_cp_wait_mem.
6109- radv: Support SDMA in si_cs_emit_write_event_eop.
6110
6111Vignesh Raman (4):
6112
6113- ci: add Vignesh Raman into restricted traces access list
6114- Do explicit cast to suppress clang warnings
6115- ci: enforce -Wimplicit-const-int-float-conversion for clang
6116- ci: Uprev crosvm
6117
6118Vinson Lee (8):
6119
6120- nvk: Fix assert
6121- lavapipe: Fix struct initialization
6122- intel/decoder: Fix memory leak on error path
6123- nv50: Remove unused value
6124- vk/wsi/x11: Remove dead code
6125- freedreno/replay: Fix implicit-function-declaration error
6126- anv: Fix transfer type assert
6127- broadcom/qpu: Remove duplicate variable opcode
6128
6129Vitaliy Triang3l Kuzmin (3):
6130
6131- r600/asm: Fix AR force_add_cf setting if a clause is not open
6132- r600/asm: Make sure MOVA and SET_CF_IDX are in the same clause
6133- r600: Replace R600_BIG_ENDIAN with UTIL_ARCH_BIG_ENDIAN
6134
6135Vlad Schiller (15):
6136
6137- pvr: Implement VK_EXT_tooling_info
6138- pvr: Add 'info' PVR_DEBUG flag
6139- pvr: Implement VK_KHR_format_feature_flags2
6140- pvr: Remove PVR_WINSYS_BO_FLAG_ZERO_ON_ALLOC flag
6141- pvr: Add VK_KHR_driver_properties
6142- pvr: Use correct index when writing query availability data
6143- pvr: Enable VK_EXT_scalar_block_layout
6144- pvr: Enable KHR_image_format_list
6145- pvr: Enable VK_KHR_uniform_buffer_standard_layout
6146- pvr: Implement VK_KHR_external_fence
6147- pvr: Implement VK_KHR_external_semaphore
6148- pvr: Enable VK_KHR_bind_memory2 extension
6149- pvr: Implement VK_EXT_texel_buffer_alignment
6150- pvr: Implement VK_EXT_host_query_reset
6151- pvr: Fix VK_EXT_texel_buffer_alignment
6152
6153WinLinux1028 (1):
6154
6155- radeonsi: prefix function with si\_ to prevent name collision
6156
6157Xaver Hugl (1):
6158
6159- vulkan wsi: add support for PresentOptionAsyncMayTear
6160
6161Yiwei Zhang (46):
6162
6163- venus: handle query feedback creation failure
6164- venus: ensure consistency of query overflow behavior
6165- venus: add a missing barrier before copying query feedback
6166- venus: refactor query feedback cmd record
6167- venus: reduce to use 4K mem suballoc align on platforms known to fit
6168- turnip: flush cache for dstBuffer in vkCmdCopyQueryPoolResults
6169- lvp: avoid reading immutable sampler from desc write info
6170- ci/venus: update venus-lavapipe expectations
6171- venus: fix a cmd builder render_pass state leak across reset
6172- venus: fix cmd state leak across implicit reset
6173- venus: log and doc the broken query feedback in suspended render pass
6174- venus: move transient storage from cmd to pool
6175- venus: remove redundant fb tracking from cmd builder
6176- venus: use tracked queue_family_index from the cmd pool
6177- venus: cleanup vn_cmd_begin_render_pass usage
6178- venus: add helpers to track subpass view mask
6179- venus: avoid redundant tracking of render pass
6180- venus: refactor more cmd states into cmd builder
6181- venus: use in_render_pass to skip present_src counting
6182- ci/venus: remove fixed tests that no longer run
6183- ci/venus: reenable pipeline cts
6184- venus: suppress a false logging
6185- venus: add no_sparse debug option to disable sparse resource support
6186- venus: set deviceMemoryReport feature
6187- venus: expose at least one cached memory type
6188- venus: expose KHR_external_fence/sempahore_fd extensions
6189- venus: fix a device memory report leak
6190- vulkan: remove a dup entry from vk_image_usage_to_ahb_usage
6191- vulkan/android: improve vkQueueSignalReleaseImageANDROID
6192- vulkan/android: add missing AHARDWAREBUFFER_USAGE_GPU_DATA_BUFFER usage
6193- vulkan/android: drop vk_buffer dependency from common AHB impl
6194- venus: use common vk_queue object
6195- venus: use common ANB implementation
6196- venus: use more common vk_queue related implementations
6197- venus: drop device, family, index, flags tracking from vn_queue
6198- venus: fix re-export of imported classic 3d resources
6199- venus: remove redundant bo roundtrip and add more docs
6200- venus: track VkPhysicalDeviceMemoryProperties instead
6201- venus: refactor vn_device_memory to prepare for async alloc
6202- venus: make device memory alloc async
6203- venus: enable Vulkan 1.3 for Android 13 and above
6204- zink: sync queue access for vkQueueWaitIdle
6205- venus: properly expose KHR_external_fence/sempahore_fd
6206- ci/venus: mark more flaky tests after recent cts uprev
6207- venus: fix query feedback batch leak and race upon submission
6208- zink: apply can_do_invalid_linear_modifier to Venus
6209
6210Yogesh Mohan Marimuthu (12):
6211
6212- gallium: remove start_slot parameter from pipe_context::set_vertex_buffers
6213- ac/surface: add astc block size to bpe_to_format() function
6214- util: move ASTCLutHolder from mesa/main to util
6215- vulkan/formats,zink: move vk_format_from_pipe_format() function
6216- vulkan/runtime: add compute astc decoder helper functions
6217- vulkan add 3D texture support for compute astc decoder
6218- radv: integrate meta astc compute decoder to radv
6219- radeonsi: add more documentation for dpbb debug env variable
6220- docs: remove document for unused variable dfsm from AMD_DEBUG
6221- radeonsi: correct old comment in si_emit_framebuffer_state()
6222- radeonsi: In gfx6_init_gfx_preamble_state() use gfx_level only from sctx
6223- radeonsi: add radeonsi to GL_RENDERER string
6224
6225Yonggang Luo (43):
6226
6227- lima: Convert to use nir_foreach_function_impl when possible
6228- freedreno: Switch to use nir_foreach_function_impl in tu_shader.cc
6229- zink: Convert to use nir_foreach_function_impl when possible
6230- lavapipe: Convert to use nir_foreach_function_impl
6231- lavapipe: fixes indent of function lvp_inline_uniforms
6232- microsoft/compiler: convert to use nir_foreach_function_with_impl in function emit_module
6233- microsoft/clc/compiler: Convert to use nir_foreach_function_impl when possible
6234- radeonsi: Convert to use nir_foreach_function_impl
6235- ac: Switch to use nir_foreach_function_impl in function analyze_shader_before_culling
6236- util: Move pipe_swizzle from p_defines.h to u_formats.h
6237- util: Move PIPE_MASK_* from p_defines.h to u_formats.h
6238- util: Move pipe_color_union from p_defines.h into u_formats.h
6239- util: Move u_pack_color.h and dbughelp.h into src/util from/src/gallium/auxiliary/util/
6240- util: Remove include "pipe/\*.h" in src/util/* files
6241- util:Move only gallium used u_debug_refcnt.* and u_debug_describe.* into src/gallium/auxiliary/util/
6242- util/meson: Getting mesa util core to be self contained
6243- pvr: decouple vulkan driver and compiler from gallium
6244- freedreno: decouple compiler and vulkan driver from gallium
6245- glx: decouple from gallium
6246- meson: Remove arm_neon_workaround
6247- nouveau/drm-shim: Decouple from gallium
6248- ac/radv: decouple radv vulkan driver and compiler from gallium
6249- etnaviv: decouple drm from gallium
6250- asahi: decouple layout from gallium
6251- compiler: Move WRITEMASK_* from prog_instruction.h into shader_enums.h
6252- intel/blorp: Use float directly to avoid #include "mesa/main/format_utils.h"
6253- intel/blorp: brw_sampler_prog_key_data::swizzles is only and should only accessed in crocus
6254- intel/brw: Define and use BRW_SWIZZLE_* instead of SWIZZLE_*
6255- crocus: #include "program/prog_instruction.h" for SWIZZLE_*
6256- intel/compiler,intel/blorp,intel/vulkan: decouple vulkan driver and compiler from gallium
6257- util/treewide: Use alignas(x) instead __attribute__((aligned(x)))
6258- v3dv: Use alignas(8) over 64 bit atomic value
6259- svga: use alignas over struct MKSGuestStatInfoEntry
6260- radv: Fixes mingw linkage error undefined reference to \`radv_GetCalibratedTimestampsEXT'
6261- v3d: Use DIV_ROUND_UP instead div_round_up
6262- freedreno: Use shared DIV_ROUND_UP instead div_round_up
6263- sfn: Use 4 instead of ATOMIC_COUNTER_SIZE
6264- intel/brw: use 4 instead of MAX_VERTEX_STREAMS to avoid #include "mesa/main/config.h"
6265- d3d12: replace use of MAX_VERTEX_STREAMS with PIPE_MAX_VERTEX_STREAMS
6266- compiler: use 4 instead ATOMIC_COUNTER_SIZE in glsl_types.h to avoid #include "mesa/main/config.h"
6267- compiler/glsl: Move glsl_print_type from glsl_types.* to ir_print_visitor.cpp
6268- util: Deduplicate macros between u_math.h and macros.h
6269- nvk: Should use alignment instead of align
6270
6271Yusuf Khan (4):
6272
6273- nouveau/ws: remove the drm.h header
6274- nvk: implement GetDeviceMemoryCommitment
6275- nvk: support GetImageSparseMemoryRequirements2
6276- nvk: expose KHR_driver_properties
6277
6278Zhang Ning (1):
6279
6280- Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR"
6281
6282antonino (14):
6283
6284- virgl: add ci flake
6285- freedreno: add ci flake
6286- zink: remove unused indices from \`nir_load_push_constant` calls
6287- zink/nir: add a zink specific intrinsic for push constants
6288- vulkan/wsi: add \`vk_wsi_force_swapchain_to_current_extent` driconf
6289- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "The Talos Principle"
6290- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "Serious Sam Fusion"
6291- vulkan: Extend vkGet/SetPrivateDataEXT handling to all platforms
6292- vulkan: Extend vkGet/SetPrivateDataEXT handling to VkSurface
6293- vulkan: Handle vkSetDebugUtilsObjectNameEXT on WSI objects
6294- zink: store bindless var when creating it to avoid creating it again
6295- nir: fix several crashes in \`nir_lower_tex`
6296- nir: don't take the derivative of the array index in \`nir_lower_tex`
6297- vulkan: use instance allocator for \`object_name` in some objects
6298
6299cheyang (1):
6300
6301- isaspec : fix isaspec build error in aosp
6302
6303georgeouzou (1):
6304
6305- nvk: Support VK_EXT_line_rasterization
6306
6307jazzfool (1):
6308
6309- zink: Hash only first 32 bits of zink_gfx_pipeline_state with full DS3
6310
6311lorn10 (1):
6312
6313- docs: Update Clover's env variable documentation
6314
6315norablackcat (2):
6316
6317- spirv/nir_to_spirv: add expect assume op codes
6318- rusticl: add cl_khr_expect_assume
6319
6320timmac-qmc (1):
6321
6322- glsl: fix potential crash with DisableUniformArrayResize
6323
6324twisted89 (1):
6325
6326- util/driconf: add workarounds for the Chronicles of Riddick
6327
6328wangra (1):
6329
6330- tu/kgsl: Fix bitfield of DITHER_MODE_MRT6
6331
6332xurui (1):
6333
6334- glx: There is no need to psc++
注意: 瀏覽 TracBrowser 來幫助您使用儲存庫瀏覽器

© 2025 Oracle Support Privacy / Do Not Sell My Info Terms of Use Trademark Policy Automated Access Etiquette