1 | Mesa 23.3.0 Release Notes / 2023-11-29
|
---|
2 | ======================================
|
---|
3 |
|
---|
4 | Mesa 23.3.0 is a new development release. People who are concerned
|
---|
5 | with stability and reliability should stick with a previous release or
|
---|
6 | wait for Mesa 23.3.1.
|
---|
7 |
|
---|
8 | Mesa 23.3.0 implements the OpenGL 4.6 API, but the version reported by
|
---|
9 | glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
|
---|
10 | glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
|
---|
11 | Some drivers don't support all the features required in OpenGL 4.6. OpenGL
|
---|
12 | 4.6 is **only** available if requested at context creation.
|
---|
13 | Compatibility contexts may report a lower version depending on each driver.
|
---|
14 |
|
---|
15 | Mesa 23.3.0 implements the Vulkan 1.3 API, but the version reported by
|
---|
16 | the apiVersion property of the VkPhysicalDeviceProperties struct
|
---|
17 | depends on the particular driver being used.
|
---|
18 |
|
---|
19 | SHA256 checksum
|
---|
20 | ---------------
|
---|
21 |
|
---|
22 | ::
|
---|
23 |
|
---|
24 | 50f729dd60ed6335b989095baad81ef5edf7cfdd4b4b48b9b955917cb07d69c5 mesa-23.3.0.tar.xz
|
---|
25 |
|
---|
26 |
|
---|
27 | New drivers
|
---|
28 | -----------
|
---|
29 | - NVK: A Vulkan driver for Nvidia hardware
|
---|
30 |
|
---|
31 | New features
|
---|
32 | ------------
|
---|
33 | - VK_EXT_pipeline_robustness on ANV
|
---|
34 | - VK_KHR_maintenance5 on RADV
|
---|
35 | - OpenGL ES 3.1 on Asahi
|
---|
36 | - GL_ARB_compute_shader on Asahi
|
---|
37 | - GL_ARB_shader_atomic_counters on Asahi
|
---|
38 | - GL_ARB_shader_image_load_store on Asahi
|
---|
39 | - GL_ARB_shader_image_size on Asahi
|
---|
40 | - GL_ARB_shader_storage_buffer_object on Asahi
|
---|
41 | - GL_ARB_sample_shading on Asahi
|
---|
42 | - GL_OES_sample_variables on Asahi
|
---|
43 | - GL_OES_shader_multisample_interpolation on Asahi
|
---|
44 | - GL_OES_gpu_shader5 on Asahi
|
---|
45 | - EGL_ANDROID_blob_cache works when disk caching is disabled
|
---|
46 | - VK_KHR_cooperative_matrix on RADV/GFX11+
|
---|
47 |
|
---|
48 |
|
---|
49 | Bug fixes
|
---|
50 | ---------
|
---|
51 |
|
---|
52 | - crash in si_update_tess_io_layout_state during _mesa_ReadPixels (radeonsi_dri, mesa 23.2.1)
|
---|
53 | - mesa: vertex attrib regression
|
---|
54 | - [RADV] War Thunder has some grass flickering.
|
---|
55 | - radv: satisfactory broken shader
|
---|
56 | - RADV problem with R7 M440 in some games
|
---|
57 | - gpu driver crashes when opening ingame map playing dead space 2023
|
---|
58 | - [anv] Valheim water misrendering
|
---|
59 | - EGL/v3d: EGL applications under a X compositor doesn't work
|
---|
60 | - RADV: trunc_coord breaks ambient occlusion in Dirt Rally and other games
|
---|
61 | - radv: Mass Effect Legendary Edition: a line going across the screen is visible in some areas with Ambient Occlusion enabled
|
---|
62 | - anv: DIRT5 gfx11_generated_draws_spv_source triggers "assert(!copy_value_is_divergent(src) || copy_value_is_divergent(dest));"
|
---|
63 | - panfrost: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12 buffers
|
---|
64 | - [RADV][TONGA] - BeamNG.drive (284160) - Artifacts are present when looking at the skybox.
|
---|
65 | - LEGO Star Wars: The Skywalker Saga graphical glitches (DXVK) on R9 380
|
---|
66 | - [radv] Crypt not rendering properly
|
---|
67 | - Leaks of DescriptorSet debug names
|
---|
68 | - [Tracing flake] Missing geometry in trace\@freedreno-a630\@freedoom\@freedoom-phase2-gl-high.trace
|
---|
69 | - Unreal Engine 5.2 virtual shadow maps have glitchy/lazy tile updates
|
---|
70 | - RADV: Visual glitches in Unreal Engine 5.2.1 when using material with anisotropy and light channel 2
|
---|
71 | - radv: Regression with UE5 test
|
---|
72 | - SIGSEGV with MESA_VK_TRACE=rgp and compute only queue
|
---|
73 | - [ANV] Corruptions in Battlefield 4
|
---|
74 | - anv regression w/ commit e488773b29d97 ("anv: Fast clear depth/stencil surface in vkCmdClearAttachments")
|
---|
75 | - ir3: dEQP-GLES31.functional.synchronization.inter_invocation.image_atomic_read_write crash on a6xx gen4
|
---|
76 | - Zink + Venus: driver can't handle INVALID<->LINEAR!
|
---|
77 | - Anv: Particles have black square artifacts on Counter Strike 2 on Skylake
|
---|
78 | - Lords of the Fallen 2023 Red Eye mode crashing game and desktop
|
---|
79 | - [radeonsi] [vulkan] [23.3-rc1 regression] Video output corrupted in QMplay2 with Vulkan renderer
|
---|
80 | - [BISECTED] ac/radeon commit somehow breaks nv12 surface from HEVC decode
|
---|
81 | - Parsec displays completely green screen with hardware decoder selected while using Mesa 23.3 and Mesa 24
|
---|
82 | - H264 to H264 transcode output corruption with gst-vaapi
|
---|
83 | - opencl-jpeg-encoder does not work with nouveau/rusticl, works with nouveau/clover
|
---|
84 | - [R600] X-plane 11 demo (Linux Native) crashes upon launch on HD5870 and HD6970
|
---|
85 | - Ubuntu 23.10 build error with rusticl_opencl_bindings.rs
|
---|
86 | - Rusticl fails to build
|
---|
87 | - ANV not handling VkMutableDescriptorTypeCreateInfoEXT::pMutableDescriptorTypeLists[i] being out of range
|
---|
88 | - tu: Wolfenstein: The New Order misrenders on a740
|
---|
89 | - DRI_PRIME fails with ACO only radeonsi
|
---|
90 | - nir_to_tgsi: Incorrect handling of indirect array access
|
---|
91 | - ANV gen9 32 bit vulkan asserts on many cts tests
|
---|
92 | - GPU hang observed while launching 3DMark Wildlife Unlimited on MTL
|
---|
93 | - ac/gpu_info: Query maximum submitted IBs from the kernel
|
---|
94 | - RADV: regression in 23.2.1 causing GPU hang with RDNA1 in various UE5 games
|
---|
95 | - GPU page faults reported while playing Talos Principle 2 (demo)
|
---|
96 | - No CCS_E scanout on tgl+ with ANV
|
---|
97 | - anv: Modifier tests assert-fail on TGL+
|
---|
98 | - ci: zink-tu jobs no longer included in manual pipelines
|
---|
99 | - [ANV][A770] GravityMark segfaults and buffer allocation errors
|
---|
100 | - etnaviv: gc2000 gles2 regression
|
---|
101 | - ci_run_n_monitor: pipeline finding unreliable
|
---|
102 | - nvk: Implement VK_EXT_dynamic_rendering_unused_attachments
|
---|
103 | - anv: jsl timeline semaphores flaky
|
---|
104 | - anv: OOB access in vkDestroyDevice?
|
---|
105 | - nvk: Implement VK_EXT_primitive_topology_list_restart
|
---|
106 | - nvk: Implement VK_EXT_image_sliced_view_of_3d
|
---|
107 | - nvk: Implement VK_KHR_workgroup_memory_explicit_layout
|
---|
108 | - util/macros: BITFIELD64_RANGE raises an error with mesa-clang if we try to set last bit
|
---|
109 | - r300/r400 regression; can't compile \`if/then` in shaders
|
---|
110 | - iris: gbm_bo_get_offset() wrongly returns 0 for second plane of NV12/P010 buffers
|
---|
111 | - nvk: Implement VK_EXT_depth_bias_control
|
---|
112 | - ICL/zink: gpu hang on 'piglit.object namespace pollution.framebuffer with gldrawpixels'
|
---|
113 | - [R600] Wolfenstein: The New Order text glitch on menu
|
---|
114 | - need extension to request image/texture not use data dependent compression
|
---|
115 | - rusticl: segfault in clCreateKernel on AMD Instinct MI100
|
---|
116 | - !25587 broke xserver
|
---|
117 | - GPU Hang in Deep Rock Galactic on DG2
|
---|
118 | - intel: Wrong length for 3DSTATE_3D_MODE on gfx125
|
---|
119 | - [radeonsi] Wargame: Red Dragon /w OpenGL stopped working with ACO
|
---|
120 | - traces job reference images missing again sometimes
|
---|
121 | - Vulkan Texture/Polygon Glitches in Games
|
---|
122 | - freedreno: dmabuf modify query ignores format
|
---|
123 | - virgl: removing PIPE_CAP_CLEAR_TEXTURE completely breaks virglrenderer
|
---|
124 | - Turnip build error on termux
|
---|
125 | - failiure in amd llvm helper
|
---|
126 | - failiure in amd llvm helper
|
---|
127 | - radv_amdgpu_cs_submit: Assertion \`chunk_data[request->number_of_ibs - 1].ib_data.ip_type == request->ip_type' failed.
|
---|
128 | - hasvk: subgroups regression
|
---|
129 | - radeonsi: broken hardware decoding (vaapi/vulkan) on RDNA2 gpu (bisected)
|
---|
130 | - aco: SwizzleInvocationsMaskedAMD behavior is not correct for reads from inactive lanes
|
---|
131 | - anv: dEQP-VK.ssbo.phys.layout.random.16bit.scalar.13 slow
|
---|
132 | - [RDNA3] CS:GO - excessive power consumption and lower performance in Vulkan while MSAA is set to 4x or 8x
|
---|
133 | - [ICL] piglit.spec.arb_gl_spirv.execution.ssbo.unsized-array regression
|
---|
134 | - radv: Counter Strike 2 has multiple bugs while rendering smoke grenade effect
|
---|
135 | - Doom Eternal freezing on NAVI31 with current git
|
---|
136 | - iris CTS blend test fail with MSAA config on DG2
|
---|
137 | - anv: 32bit mesa asserts
|
---|
138 | - RADV: Randomly dissapearing objects in Starfield with RX 5xx and Vega graphics
|
---|
139 | - anv: missing barrier handling on video engines
|
---|
140 | - radv: Star Wars The Old Republic hang when DCC is enabled
|
---|
141 | - radv: Resident Evil 6 hangs 7900XTX GPU when DCC is enabled if in Options go to Display settings
|
---|
142 | - radv: Resident Evil 6 Benchmark Tool hangs 7900 XTX GPU when DCC is enabled immediately after splash screen
|
---|
143 | - ANV: fp64 shader leaked
|
---|
144 | - v3d: noop drm-shim raises some warnings
|
---|
145 | - freedreno: crashdec/etc chip_id support
|
---|
146 | - intel: compute dispatches with variable workgroup size have ralloc_asprintf CPU overhead
|
---|
147 | - ci build issues with builtin types
|
---|
148 | - freedreno: running angle perf traces with GALLIUM_THREAD=0 crashes
|
---|
149 | - RadeonSI: glClear() causes clear texture for some frames on RX580
|
---|
150 | - radeonsi: corruption when seeking video decoded with vaapi in mpv
|
---|
151 | - Zink/HasVK regression bisected to "gallium: move vertex stride to CSO"
|
---|
152 | - [radv] [Path Of Exile] - one setting in the workaround file breaks shadows/lighting rendering. Other workaround settings seems obsolete.
|
---|
153 | - radv: images don't always have extents in RGP
|
---|
154 | - shader_test causing a crash in compiler
|
---|
155 | - D3D12: Video decoding requirements are too restrictive. ID3D12VideoDevice3 should not be required.
|
---|
156 | - Crash in st_ReadPixels
|
---|
157 | - [regression] intel build issue on i386
|
---|
158 | - [ANV] [DG2/A770] The Spirit and The Mouse, miscellaneous issues with Mesa Git
|
---|
159 | - zink on hasvk regression: Assertion \`(dyn)->vi_binding_strides[first_binding + i] == (strides[i])' failed.
|
---|
160 | - Penumbra: Overture hangs on new game loading screen
|
---|
161 | - [r300, RV516] Some deqp-gles2\@performance\@shader\@control_statement vertex tests cause hard lockup & reboot in mesa 22.3.1 (regression over 22.1.7) on a Radeon X1550
|
---|
162 | - v3dv: Add a feature that implicitly copies the linear image to the tiled image prior to sampling from it
|
---|
163 | - radv: Regression from 266b2cfe5bf3feda16747c50c1638fb5a0426958
|
---|
164 | - h264 encoding picture showed randomly repeated frames.
|
---|
165 | - Mesa CI: NAVI10 hangs when running VKCTS on Linux 6.1
|
---|
166 | - zink: no uniform buffer objects support for v3dv?
|
---|
167 | - v3dv: Request for VkImageDrmFormatModifierExplicitCreateInfoEXT::pPlaneLayouts support
|
---|
168 | - [ANV] [DG2/A770] The Spirit and The Mouse, occasional flickering geometry
|
---|
169 | - [Google][Rex][anv] GLES dEQP test fails in anv when run via ANGLE-on-Venus on ChromeOS ARCVM.
|
---|
170 | - VAAPI on VCN: bad stream may crash whole gfx system
|
---|
171 | - Crash after GPU reset
|
---|
172 | - Bifrost PanVK should not be in CI
|
---|
173 | - [Intel][Vulkan][Gen12] vkCmdCopyImage() generates garbage data when the destination texture is bound to a piece of used device memory
|
---|
174 | - mesa: new glcts fails
|
---|
175 | - tu: GPL support is broken
|
---|
176 | - lavapipe: ycbcr regression
|
---|
177 | - aco: Assertion when compiling CP2077 shader
|
---|
178 | - anv: flakiness on tgl+ with samplemask handling
|
---|
179 | - [RADV] Dead by Daylight memory leak (shader-related?) on 23.1.6
|
---|
180 | - r300: optionally convert MULs into output modifier for the following MUL or DOT instructions
|
---|
181 | - r300: better 1-x presubtract pattern matching
|
---|
182 | - gpu hang on DG2 when running KHR-GLES31.core.texture_cube_map_array.image_op_tess*
|
---|
183 | - KHR-GLES31.core.texture_cube_map_array.image_op_tessellation_evaluation_sh fail on GFX12+
|
---|
184 | - wsi: deadlocks when DISPLAY is changed
|
---|
185 | - hasvk: Incompatible with minigbm/gralloc4 on Android
|
---|
186 | - VAAPI: AMDGPU crash on RX 6900 XT on corrupted video
|
---|
187 | - lavapipe/llvmpipe: shader unregister crash
|
---|
188 | - [ANV] [DG2/A380] Corruption in Borderlands 3
|
---|
189 | - blorp regression on dg2
|
---|
190 | - decouple -Dshader-cache= from EGL_ANDROID_blob_cache
|
---|
191 | - radv: commit 81641b01555faa4dd1dfc7de2513ad8d63e77ab7 leaded to artifacts in Quake II RTX
|
---|
192 | - [radv] Colors are distorted in Cyberpunk 2077 with ray tracing enabled
|
---|
193 | - Forza Horizon 5 stuttering since mesa 23.1.4 / 9b008673 revert as a FIX
|
---|
194 | - ubsan + gtest build fails
|
---|
195 | - glCopyTexSubImage2D is very slow on Intel
|
---|
196 | - NVE4 (GeForce 710) fails to get vdpau in mesa git
|
---|
197 | - [RADV] red and pink tinted shadows in Overwatch 2 on 7900 XTX
|
---|
198 | - nouveau prevents hardware acceleration with Chromium (Wayland)
|
---|
199 | - Corrupt text rendering in Blender
|
---|
200 | - DRI2 gallium frontend is using bad format type
|
---|
201 | - regression - MR 23089 - Hellblade RT crashing
|
---|
202 | - Incorrect vlVaCreateBuffer/vlVaMapBuffer behavior for buffer type VAEncCodedBufferType in Gallium
|
---|
203 | - Issue with clang-format
|
---|
204 | - Follow-up from "Draft: intel: Disable color fast-clears for blorp_copy"
|
---|
205 | - nightly VA-API build: new timeout
|
---|
206 | - r600: retire the SB optimizer
|
---|
207 | - ci: do not download perfetto on-fly in build jobs
|
---|
208 | - Shared Memory Leak With Qt OpenGL Applications
|
---|
209 | - OpenGL, SIGSEGV when program pipeline objects has separated vertex shader progam and separated fragment shader progam with in/out
|
---|
210 | - vaDeriveImage returns VA_STATUS_ERROR_OPERATION_FAILED
|
---|
211 | - 975a8ecc881873744d851ab0ef45ad7698eaa0ef "frontends/va: use resources instead of views" cause radeonsi can't play video.
|
---|
212 | - zink: reduce pipeline hash size
|
---|
213 | - Rusticl,radeonsi: ac_rtld error(2): too much LDS
|
---|
214 | - aco, radv Rage 2 menu corruption - bisected
|
---|
215 | - radv, aco: World War Z character texture regression on 7900xtx
|
---|
216 | - android: De-stage drm_gralloc support from mesa3d
|
---|
217 | - Cyberpunk screen goes black at game launch on integrated Gfx
|
---|
218 | - lavapipe/llvmpipe: regressions since descriptor rewrite
|
---|
219 | - intel: State cache invalidation after BLORP binding table setup ought to be unnecessary on ICL.
|
---|
220 | - ci: HW job logs have spam at the end
|
---|
221 | - kernel crash seen on AMD Raven device
|
---|
222 | - crocus: regression crashing in doubles/ubo tests
|
---|
223 | - turnip: object management CTS crashes
|
---|
224 | - a618: multiple assertions with different kernel config on u_vector_add
|
---|
225 | - [anv] Death Stranding crashes
|
---|
226 | - Can no longer build Clover without llvmspirvlib
|
---|
227 | - [radeonsi][vaapi] segfault in vl_video_buffer_sampler_view_components() when using vaapisink receiving I420 format
|
---|
228 | - Baldurs Gate 3 (DX11) - Graphical corruption on RDNA3 (ACO regression)
|
---|
229 | - [AMDGPU] Compiling large Blender Eevee shader node trees is unusably slow
|
---|
230 | - Building llvmpipe with LP_USE_TEXTURE_CACHE set fails since 23.2.0-rc1: error C2039: dynamic_state is not member of lp_build_sampler_soa in lp_tex_sample.c
|
---|
231 | - r300: calculate some cycles estimate for shader-db
|
---|
232 | - intel: Deathloop and other DX12 games fail assert(validated) with invalid SEL instruction
|
---|
233 | - GTF-GL46.gtf21.GL.build.CorrectFull_vert regressed on intel platforms
|
---|
234 | - error message when encoding via VAAPI AMD
|
---|
235 | - gpu hangs on dg2 with mesh shading enabled on vkcts
|
---|
236 | - radeonsi: Deadlock when creating a new GL context in parallel with linking a shader on another GL context
|
---|
237 | - robustness2 raygen tests intermittently fail in Intel Mesa CI
|
---|
238 | - ci/ci_run_n_monitor.py: KeyError: 'clang-format'
|
---|
239 | - glthread: huge performance regression
|
---|
240 | - DirectX games do not launch on Intel HD Graphics 4000 (IVB GT2) [bisected]
|
---|
241 | - rusticl: fails to build for iris + radeonsi
|
---|
242 |
|
---|
243 |
|
---|
244 | Changes
|
---|
245 | -------
|
---|
246 |
|
---|
247 | Adam Jackson (3):
|
---|
248 |
|
---|
249 | - egl: Implement EGL_EXT_explicit_device
|
---|
250 | - mesa: Implement and advertise GL_MESA_sampler_objects
|
---|
251 | - docs: Mention 'meson devenv' in the pre-install test instructions
|
---|
252 |
|
---|
253 | Aditya Swarup (6):
|
---|
254 |
|
---|
255 | - isl: enable Tile64 for 3D images
|
---|
256 | - intel/isl: Unittest for linear to Ytile conversion
|
---|
257 | - intel/isl: Convert linear texture to Tile4 format
|
---|
258 | - intel/isl: Convert Tile4 texture to linear format
|
---|
259 | - intel/isl: Linear to Tile-4 conversion unittest
|
---|
260 | - Revert "iris: Disable tiled memcpy for Tile4"
|
---|
261 |
|
---|
262 | Alba Mendez (1):
|
---|
263 |
|
---|
264 | - meson: support installation tags
|
---|
265 |
|
---|
266 | Alejandro Piñeiro (61):
|
---|
267 |
|
---|
268 | - v3dv: re-enable sync_fd import/export on the simulator
|
---|
269 | - broadcom(cle,clif,common,simulator): add 7.1 version on the list of versions to build
|
---|
270 | - broadcom/cle: update the packet definitions for new generation v71
|
---|
271 | - broadcom/common: add some common v71 helpers
|
---|
272 | - broadcom/qpu: add comments on waddr not used on V3D 7.x
|
---|
273 | - broadcom/qpu: set V3D 7.x names for some waddr aliasing
|
---|
274 | - broadcom/compiler: rename small_imm to small_imm_b
|
---|
275 | - broadcom/compiler: add small_imm a/c/d on v3d_qpu_sig
|
---|
276 | - broadcom/qpu: add v71 signal map
|
---|
277 | - broadcom/qpu: define v3d_qpu_input, use on v3d_qpu_alu_instr
|
---|
278 | - broadcom/qpu: add raddr on v3d_qpu_input
|
---|
279 | - broadcom/qpu: defining shift/mask for raddr_c/d
|
---|
280 | - broadcom/commmon: add has_accumulators field on v3d_device_info
|
---|
281 | - broadcom/qpu: add qpu_writes_rf0_implicitly helper
|
---|
282 | - broadcom/qpu: add pack/unpack support for v71
|
---|
283 | - broadcom/compiler: phys index depends on hw version
|
---|
284 | - broadcom/compiler: don't favor/select accum registers for hw not supporting it
|
---|
285 | - broadcom/vir: implement is_no_op_mov for v71
|
---|
286 | - broadcom/compiler: update vir_to_qpu::set_src for v71
|
---|
287 | - broadcom/qpu_schedule: add process_raddr_deps
|
---|
288 | - broadcom/qpu: update disasm_raddr for v71
|
---|
289 | - broadcom/qpu: return false on qpu_writes_accumulatorXX helpers for v71
|
---|
290 | - broadcom/compiler: add support for varyings on nir to vir generation for v71
|
---|
291 | - broadcom/compiler: payload_w is loaded on rf3 for v71
|
---|
292 | - broadcom/qpu_schedule: update write deps for v71
|
---|
293 | - broadcom/compiler: update register classes to not include accumulators on v71
|
---|
294 | - broadcom/qpu: implement switch rules for fmin/fmax fadd/faddnf for v71
|
---|
295 | - broadcom/compiler: update one TMUWT restriction for v71
|
---|
296 | - broadcom/compiler: update ldunif/ldvary comment for v71
|
---|
297 | - broadcom/compiler: update payload registers handling when computing live intervals
|
---|
298 | - broadcom/qpu: new packing/conversion v71 instructions
|
---|
299 | - v3dv/meson: add v71 hw generation
|
---|
300 | - v3dv: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71
|
---|
301 | - v3dv/cmd_buffer: emit TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1 for v71
|
---|
302 | - v3dvx/cmd_buffer: emit CLEAR_RENDER_TARGETS for v71
|
---|
303 | - v3dv/cmd_buffer: emit CLIPPER_XY_SCALING for v71
|
---|
304 | - v3dv/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71
|
---|
305 | - v3dv/cmd_buffer: just don't fill up early-z fields for CFG_BITS for v71
|
---|
306 | - v3dv: default vertex attribute values are gen dependant
|
---|
307 | - v3dv/pipeline: default vertex attributes values are not needed for v71
|
---|
308 | - v3dv/pipeline: handle GL_SHADER_STATE_RECORD changed size on v71
|
---|
309 | - v3dv: no specific separate_segments flag for V3D 7.1
|
---|
310 | - v3dv: add support for TFU jobs in v71
|
---|
311 | - v3d: add v71 hw generation
|
---|
312 | - v3d: emit TILE_BINNING_MODE_CFG and TILE_RENDERING_MODE_CFG_COMMON for v71
|
---|
313 | - v3d: TILE_RENDERING_MODE_CFG_RENDER_TARGET_PART1
|
---|
314 | - v3d: emit CLEAR_RENDER_TARGETS for v71
|
---|
315 | - v3d: just don't fill up early-z fields for CFG_BITS for v71
|
---|
316 | - v3d: emit CLIPPER_XY_SCALING for v71
|
---|
317 | - v3d: no specific separate_segments flag for V3D 7.1
|
---|
318 | - v3d: default vertex attributes values are not needed for v71
|
---|
319 | - v3d/uniforms: update VIEWPORT_X/Y_SCALE uniforms for v71
|
---|
320 | - v3d: handle new texture state transfer functions in v71
|
---|
321 | - v3d: handle new TEXTURE_SHADER_STATE v71 YCbCr fields
|
---|
322 | - v3d: setup render pass color clears for any format bpp in v71
|
---|
323 | - v3d: GFX-1461 does not affect V3D 7.x
|
---|
324 | - v3d: don't convert floating point border colors in v71
|
---|
325 | - v3d: handle Z clipping in v71
|
---|
326 | - v3d: add support for TFU blit in v71
|
---|
327 | - v3dv: implement depthBounds support for v71
|
---|
328 | - doc/features: update after last v3d changes
|
---|
329 |
|
---|
330 | Alex Denes (1):
|
---|
331 |
|
---|
332 | - virgl: link VA driver with build-id
|
---|
333 |
|
---|
334 | Alexander Orzechowski (1):
|
---|
335 |
|
---|
336 | - radeonsi: Set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET for auxiliary contexts
|
---|
337 |
|
---|
338 | Alyssa Rosenzweig (431):
|
---|
339 |
|
---|
340 | - zink: Switch to register intrinsics
|
---|
341 | - gallium/trace: Collect enums from multiple files
|
---|
342 | - gallium,util: Move blend enums to util/
|
---|
343 | - gallium,util: Move util_blend_dst_alpha_to_one
|
---|
344 | - util/blend: Add helpers for normalizing inverts
|
---|
345 | - vulkan: Add helpers for blend enum translation
|
---|
346 | - lvp: Use common blend/logicop translation
|
---|
347 | - nir/lower_blend: Use util enums
|
---|
348 | - panfrost: Convert to PIPE_BLEND enums internally
|
---|
349 | - gallium: Remove pipe->compiler BLEND enum translation
|
---|
350 | - compiler: Remove blend enums duplicating util
|
---|
351 | - nir/legacy: Fix fneg(load_reg) case
|
---|
352 | - nir/legacy: Fix handling of fsat(fabs)
|
---|
353 | - ntt: Switch to new-style registers and modifiers
|
---|
354 | - ir3: Convert to register intrinsics
|
---|
355 | - nir: Add fence_{pbe,mem}_to_tex(_pixel)_agx intrinsics
|
---|
356 | - nir: Devendor load_sample_mask
|
---|
357 | - nir: Promote tess_coord_r600 to tess_coord_xy
|
---|
358 | - nir: Add nir_lower_tess_coord_z pass
|
---|
359 | - r600: Use nir_lower_tess_coord_xy
|
---|
360 | - ir3: Use nir_lower_tess_coord_z
|
---|
361 | - nir: Initialize workgroup_size in builder_init_simple_shader
|
---|
362 | - v3dv: Rely on nir_builder setting workgroup size
|
---|
363 | - radv: Rely on workgroup_size initialization
|
---|
364 | - panfrost: Fix transform feedback on v9
|
---|
365 | - r600/sfn: Remove nir_register unit tests
|
---|
366 | - panfrost: Lower vertex_id for XFB
|
---|
367 | - panfrost: Fix transform feedback on v9 harder
|
---|
368 | - asahi: Augment fake drm_asahi_params_global
|
---|
369 | - asahi: Use nir_builder_at more
|
---|
370 | - asahi: Remove unused #define
|
---|
371 | - asahi: Refactor PBE upload routine
|
---|
372 | - asahi: Extract shader_initialize helper
|
---|
373 | - asahi: Serialize NIR in memory
|
---|
374 | - asahi: Identify background/EOT counts
|
---|
375 | - asahi,agx: Set coherency bit for clustered targets
|
---|
376 | - ail: Page-align layers for writable images
|
---|
377 | - asahi: Mark writeable images as such
|
---|
378 | - asahi: Reallocate to set the writeable image flag
|
---|
379 | - asahi: Add agx_batch_track_image helper
|
---|
380 | - asahi: Add texture/image indexing lowering pass
|
---|
381 | - asahi: Upload at most the max texture state registers
|
---|
382 | - asahi: Upload image descriptors
|
---|
383 | - asahi: Make clear the non-sRGBness of EOT images
|
---|
384 | - asahi: Don't restrict sampler views
|
---|
385 | - asahi: Forbid 2D Linear with images
|
---|
386 | - agx: Add try_coalesce_with helper
|
---|
387 | - agx: Try to allocate phis compatibly with sources
|
---|
388 | - agx: Try to allocate phi sources with phis
|
---|
389 | - agx: Try to allocate phi sources with loop phis
|
---|
390 | - agx: Vectorize 16-bit parallel copies
|
---|
391 | - agx: Reduce un/packs with mem access lowering
|
---|
392 | - agx: Fix bogus assert
|
---|
393 | - asahi: Augment PBE descriptor for software access
|
---|
394 | - asahi: Extend PBE packing for image support
|
---|
395 | - asahi: Use nir_lower_robust_access
|
---|
396 | - agx: Legalize image LODs to be 16-bit
|
---|
397 | - agx: Lower image size to txs
|
---|
398 | - agx: Generalize texture/PBE packing
|
---|
399 | - agx: Add image write instruction
|
---|
400 | - agx: Model texture bindless base
|
---|
401 | - agx: Handle bindless properly for txs lowering
|
---|
402 | - agx: Pack bindless textures
|
---|
403 | - agx: Translate texture bindless handles
|
---|
404 | - agx: Translate image_store from NIR
|
---|
405 | - agx: Handle frag side effects without render targets
|
---|
406 | - agx: Wait for outstanding stores before barriers
|
---|
407 | - agx: Implement image barriers
|
---|
408 | - agx: Handle early_fragment_tests
|
---|
409 | - agx: Add interleave opcode
|
---|
410 | - agx: Extract coords_for_buffer_texture helper
|
---|
411 | - agx: Extract texture_descriptor_ptr_for_* helpers
|
---|
412 | - agx: Lower image atomics
|
---|
413 | - agx: Lower buffer images
|
---|
414 | - asahi,agx: Fix txf sampler
|
---|
415 | - agx: Add image_load opcode
|
---|
416 | - agx: Extract texture write mask handling
|
---|
417 | - agx: Implement image_load
|
---|
418 | - agx: Emit global memory barriers for images
|
---|
419 | - agx: Don't emit silly barriers
|
---|
420 | - agx: Implement fence_*_to_tex_agx intrinsics
|
---|
421 | - agx: Add simple image fencing pass
|
---|
422 | - agx: Require tag writes with side effects
|
---|
423 | - agx: Plumb in coverage mask
|
---|
424 | - asahi: Extract sampler_view_for_surface
|
---|
425 | - asahi: Introduce concept of spilled render targets
|
---|
426 | - asahi: Add agx_tilebuffer_spills query
|
---|
427 | - asahi: Do not support masking with spilled RTs
|
---|
428 | - asahi: Ignore spilled render targets in EOT shaders
|
---|
429 | - asahi: Ignore spilled render targets with partial renders
|
---|
430 | - asahi: Extract some tilebuffer lowering code
|
---|
431 | - asahi: Lower tilebuffer access for spilled RTs
|
---|
432 | - asahi: Lower multisample image stores
|
---|
433 | - asahi: Permit meta shaders to use preambles
|
---|
434 | - asahi: Ignore spilled render targets for background load
|
---|
435 | - asahi: Offset clear colour uniform by 4
|
---|
436 | - asahi: Execute preambles for background programs
|
---|
437 | - asahi: Advertise Z16_UNORM
|
---|
438 | - ir2: Switch to nir_legacy
|
---|
439 | - intel/fs: Don't read reg.base_offset
|
---|
440 | - panfrost: Remove unused helpers
|
---|
441 | - nir: Remove nir_lower_locals_to_regs
|
---|
442 | - nir: Rename lower_locals_to_reg_intrinsics back
|
---|
443 | - nir: Remove register arrays
|
---|
444 | - asahi: Don't depend on glibc to decode
|
---|
445 | - pan/bi: Remove leftover include
|
---|
446 | - nir/trivialize: Handle more RaW hazards
|
---|
447 | - panfrost: Disable blending for no-op logic ops
|
---|
448 | - nir/lower_blend: Fix 32-bit logicops
|
---|
449 | - nir/lower_blend: Optimize out PIPE_LOGICOP_NOOP
|
---|
450 | - clang-format: Ignore original panfrost commit
|
---|
451 | - nir/schedule: Assume no old-style registers
|
---|
452 | - gallium/u_simple_shaders: Optimize out ffloors
|
---|
453 | - gallium/u_transfer_helper: Remove dead forward decl
|
---|
454 | - nir/loop_analyze: Drop unused inverse_comparison
|
---|
455 | - nir/passthrough_gs: Drop unused array_size_for_prim
|
---|
456 | - panfrost: Add missing static inline annotation
|
---|
457 | - pan/decode: Drop unused debug function
|
---|
458 | - pan/mdg: Add missing static inline annotation
|
---|
459 | - panfrost: Drop unused decode_position for samples
|
---|
460 | - panfrost: Only define pan_blitter_get_blend_shaders for midgard
|
---|
461 | - panfrost: Add missing inline
|
---|
462 | - panfrost: Gate overdraw_alpha on Bifrost+
|
---|
463 | - nir: Rename scoped_barrier -> barrier
|
---|
464 | - nir: Remove lower_to_source_mods
|
---|
465 | - nir: Remove lower_vec_to_movs
|
---|
466 | - nir: Remove reg_intrinsics parameter to convert_from_ssa
|
---|
467 | - nir: Remove register load/store builders
|
---|
468 | - r600/sfn: Stop referencing legacy functionality
|
---|
469 | - r600/sfn: Ignore instruction write masks
|
---|
470 | - nouveau/codegen: Drop writemask check
|
---|
471 | - vc4,broadcom/compiler: Drop write_mask handling
|
---|
472 | - zink: Collapse is_ssa check
|
---|
473 | - nir: Add {...} before case
|
---|
474 | - nir/from_ssa: Drop legacy reg support
|
---|
475 | - nir/schedule: Drop nir_schedule_dest_pressure
|
---|
476 | - nir: Drop NIR reg create/destroy
|
---|
477 | - nir: Remove nir_index_local_regs and callers
|
---|
478 | - nir/schedule: Drop more nir_register handling
|
---|
479 | - nir: Remove nir_foreach_register
|
---|
480 | - nir: remove nir_{src,dest}_for_reg
|
---|
481 | - ntt: Drop nir_register reference
|
---|
482 | - nir/print: Assume SSA
|
---|
483 | - nir/clone: Assume SSA
|
---|
484 | - nir/serialize: Drop legacy NIR
|
---|
485 | - nir/validate: Assume SSA
|
---|
486 | - nir: Remove impl->{registers,reg_alloc}
|
---|
487 | - nir: Remove nir_alu_dest::saturate
|
---|
488 | - treewide: Drop is_ssa asserts
|
---|
489 | - nir: Collapse some SSA checks
|
---|
490 | - treewide: Remove more is_ssa asserts
|
---|
491 | - nir: Remove reg-only dest manipulation
|
---|
492 | - nir: Remove stale todo
|
---|
493 | - nir/print: Drop legacy NIR
|
---|
494 | - nir: Drop nir_alu_src::{negate,abs}
|
---|
495 | - treewide: sed out more is_ssa
|
---|
496 | - pan/mdg: Assume SSA
|
---|
497 | - treewide: Drop some is_ssa if's
|
---|
498 | - nir: Drop trivial reg handling
|
---|
499 | - aco: Remove is_ssa check
|
---|
500 | - intel: Collapse is_ssa checks
|
---|
501 | - llvmpipe: Assume SSA
|
---|
502 | - ir3: Collapse is_ssa checks
|
---|
503 | - lima: Collapse is_ssa checks
|
---|
504 | - radeonsi: Collapse SSA check
|
---|
505 | - nir/gather_ssa_types: Collapse SSA checks
|
---|
506 | - nir/worklist: Assume SSA
|
---|
507 | - nir/range_analysis: Assume SSA
|
---|
508 | - treewide: Collapse more SSA checks
|
---|
509 | - nir/instr_set: Assume SSA
|
---|
510 | - nir: Collapse more SSA checks
|
---|
511 | - nir: Remove def_is_register
|
---|
512 | - nir: Do not init dests
|
---|
513 | - nir: Initialize source as a NULL SSA def
|
---|
514 | - nir: Collapse more SSA checks
|
---|
515 | - nir: Remove nir_{src,dest}::is_ssa
|
---|
516 | - nir: Drop nir_register
|
---|
517 | - nir/from_ssa: Remove pointless union
|
---|
518 | - ir3: Drop write_mask handling
|
---|
519 | - rogue: Stop reading write masks
|
---|
520 | - etnaviv: Don't use alu->dest.write_mask
|
---|
521 | - etnaviv: What if we just didn't have a compiler?
|
---|
522 | - intel/vec4: Don't use legacy write mask
|
---|
523 | - ntt: Evaluate write_mask check
|
---|
524 | - nir: Remove nir_alu_dest::write_mask
|
---|
525 | - nir: Remove nir_foreach_def
|
---|
526 | - lima: Clean up after deleting asserts
|
---|
527 | - nir: Remove no-op remove_def_cb
|
---|
528 | - nir: Drop no-op all_srcs_are_ssa
|
---|
529 | - nir: Simplify alu_instr_is_copy
|
---|
530 | - nir: Add load_coefficients_agx intrinsic
|
---|
531 | - agx: Implement nir_intrinsic_load_coefficients_agx
|
---|
532 | - agx: Allow more varying slots
|
---|
533 | - agx: Set lower_fisnormal
|
---|
534 | - agx: Forcibly vectorize pointcoord coeffs
|
---|
535 | - agx: Add interpolateAtOffset lowering pass
|
---|
536 | - agx: Lower flat shading in NIR
|
---|
537 | - asahi: Stub num_dies
|
---|
538 | - asahi: Move a bunch of helpers to common
|
---|
539 | - agx: Lower 8-bit ALU
|
---|
540 | - agx: Handle 8-bit vecs
|
---|
541 | - asahi,agx: Respect no16 even for I/O
|
---|
542 | - agx: Don't lower load_local_invocation_index
|
---|
543 | - agx/dce: Use the helper
|
---|
544 | - agx: Fix atomics with no destination
|
---|
545 | - agx: Fix shader info with sample mask writes
|
---|
546 | - agx: Do not move bindless handles
|
---|
547 | - agx: Put else instructions in the right block
|
---|
548 | - agx: Use unconditional else instruction
|
---|
549 | - agx: Optimize out pointless else instructions
|
---|
550 | - agx: Fix length bit confusion
|
---|
551 | - agx: Require an immediate for \`nest`
|
---|
552 | - agx: Use compressed fadd/fmul encodings
|
---|
553 | - agx: Optimize swaps of 2x16 channels
|
---|
554 | - agx: Optimize logical_end removal
|
---|
555 | - agx: Fix AGX_MESA_DEBUG=demand
|
---|
556 | - agx: Maintain ctx->max_reg while assigning regs
|
---|
557 | - agx: Allow 64-bit memory regs
|
---|
558 | - agx: Fix accounting for phis
|
---|
559 | - agx: Set phi sources in predecessors
|
---|
560 | - agx: Stop setting registers after the shader
|
---|
561 | - agx: Use agx_replace_src
|
---|
562 | - agx: Assert invariant stated in the comment
|
---|
563 | - agx: Don't use ssa_to_reg across blocks
|
---|
564 | - agx: Don't reuse ssa_to_reg across blocks
|
---|
565 | - agx: Remove unused allocation
|
---|
566 | - agx: Stop setting forwarding bit
|
---|
567 | - agx: Handle blocks with no predecessors
|
---|
568 | - agx: Lower f2u8/f2i8
|
---|
569 | - agx: Handle conversions to 8-bit
|
---|
570 | - agx: Fix uadd_sat packing
|
---|
571 | - agx: Fix 64-bit immediate moves
|
---|
572 | - agx: Lower f2f16_rtz
|
---|
573 | - agx: Handle f2f16_rtne like f2f16
|
---|
574 | - agx: Handle <32-bit local memory access
|
---|
575 | - agx: Do not allow creating vec8
|
---|
576 | - asahi: Legalize compression before blitting
|
---|
577 | - nir: Drop "SSA" from NIR language
|
---|
578 | - agx: Stop passing nir_dest around
|
---|
579 | - agx: Remove agx_nir_ssa_index
|
---|
580 | - pan/mdg: Don't reference nir_dest
|
---|
581 | - pan/bi: Don't reference nir_dest
|
---|
582 | - asahi: Do not reference nir_dest
|
---|
583 | - panfrost: Do not reference nir_dest
|
---|
584 | - zink: Do not reference nir_dest
|
---|
585 | - ir3: Do not reference nir_dest
|
---|
586 | - dxil: Do not reference nir_dest
|
---|
587 | - nir: Drop nir_dest_init
|
---|
588 | - panfrost: Pack stride at CSO create time on v9
|
---|
589 | - lvp,nir/lower_input_attachments: Use nir_trim_vector
|
---|
590 | - broadcom/compiler: Use nir_trim_vector explicitly
|
---|
591 | - nir: Assert that nir_ssa_for_src components matches
|
---|
592 | - nir: Add nir_shader_intrinsics_pass
|
---|
593 | - nir: Lower fquantize2f16
|
---|
594 | - agx: Lower fquantize2f16
|
---|
595 | - nir/lower_helper_writes: Consider bindless images
|
---|
596 | - nir/passthrough_gs: Correctly set vertices_in
|
---|
597 | - nir/passthrough_gs: Fix array size
|
---|
598 | - nir/print: Print access qualifiers for intrinsics
|
---|
599 | - nir/lower_gs_intrinsics: Remove end primitive for points
|
---|
600 | - panfrost/ci: Disable T720
|
---|
601 | - nir: Add load_sysval_agx intrinsic
|
---|
602 | - agx: Fix extraneous bits with b2b32
|
---|
603 | - agx: Use more barriers
|
---|
604 | - asahi: Copy CSO stride
|
---|
605 | - agx: Assert vertex_id, instance_id are VS-only
|
---|
606 | - asahi: Keep drawoverhead from OOMing itself
|
---|
607 | - agx: Don't blow up when lowering textures twice
|
---|
608 | - agx/lower_vbo: Handle nonzero component
|
---|
609 | - agx: Allow loop headers without later preds
|
---|
610 | - agx: Handle b2i8
|
---|
611 | - agx: Convert 8-bit comparisons
|
---|
612 | - agx: Implement imul_high
|
---|
613 | - asahi: Advertise OpenGL ES 3.1!
|
---|
614 | - asahi/decode: Turn assert into error
|
---|
615 | - asahi: Report local_size from compiler
|
---|
616 | - asahi: Use local_size from compiler directly
|
---|
617 | - asahi: Pass layer stride in pixels, not elements
|
---|
618 | - agx: Clear sample count after lowering MSAA
|
---|
619 | - agx: Clear image_array after lowering
|
---|
620 | - asahi: Preserve atomic ops when rewriting image to bindless
|
---|
621 | - agx: Use 16-bit reg for pixel_coord
|
---|
622 | - asahi: Generalize query logic
|
---|
623 | - asahi: Simplify occlusion query batch tracking
|
---|
624 | - asahi: Refactor agx_get_query_result
|
---|
625 | - asahi: Only touch batch->occlusion_queries for occlusion
|
---|
626 | - asahi: Sync when beginning a query
|
---|
627 | - asahi: Add non-occlusion query tracking
|
---|
628 | - asahi: Add get_query_address helper
|
---|
629 | - agx/fence_images: Use intrinsics_pass
|
---|
630 | - agx: Do not fence write-only images
|
---|
631 | - asahi: Add missing LOD source for agx_meta's txfs
|
---|
632 | - agx: Do some texture lowering early
|
---|
633 | - agx: Add helper returning if a descriptor crawl is needed
|
---|
634 | - nir,asahi: Remove texture_base_agx
|
---|
635 | - asahi: Move UBO lowering into GL driver
|
---|
636 | - asahi: Add sysval tables for each shader stage
|
---|
637 | - asahi: Split out per-stage sysvals
|
---|
638 | - asahi: Collapse grid_info
|
---|
639 | - asahi: Extract agx_upload_textures
|
---|
640 | - asahi: Upload a single draw_uniforms per draw
|
---|
641 | - asahi: Add real per-stage dirty flags
|
---|
642 | - asahi: Extract sampler upload
|
---|
643 | - asahi: Put unuploaded uniforms on the batch
|
---|
644 | - asahi: Decouple sysval lowering from uniform assignment
|
---|
645 | - asahi: Use finer dirty tracking for blend constant
|
---|
646 | - asahi: Use proper dirty tracking for VBOs
|
---|
647 | - asahi: Dirty track VBOs + blend const separately
|
---|
648 | - asahi: Dirty the shader stage when the shader changes
|
---|
649 | - asahi: Fix shader stage dirtying
|
---|
650 | - treewide: Use nir_shader_intrinsic_pass sometimes
|
---|
651 | - treewide: Also handle struct nir_builder form
|
---|
652 | - nir/lower_shader_calls: Fix warning with clang
|
---|
653 | - nir: Add nir_before/after_impl cursors
|
---|
654 | - treewide: Use nir_before/after_impl in easy cases
|
---|
655 | - treewide: Use nir_before/after_impl for more elaborate cases
|
---|
656 | - radv: Use before/after_cf_list for entrypoints
|
---|
657 | - ci: Disable known broken Bifrost Vulkan job
|
---|
658 | - ci: Disable WHL jobs
|
---|
659 | - nir/opt_if: Simplify if's with general conditions
|
---|
660 | - asahi: Fixes for clang-warnings
|
---|
661 | - agx: Fix jmp_exec_none encoding
|
---|
662 | - agx/validate: Print to stderr
|
---|
663 | - agx: Annotate opcodes with a scheduling class
|
---|
664 | - agx: Add schedule-specialized get_sr variants
|
---|
665 | - agx: Include schedule class in the opcode info
|
---|
666 | - agx: Schedule for register pressure
|
---|
667 | - agx: Lower pack_32_4x8_split
|
---|
668 | - asahi: Force translucency for ignored render targets
|
---|
669 | - agx: Remove logical_end instructions
|
---|
670 | - agx: Lower pseudo-ops later
|
---|
671 | - agx: Expand nest
|
---|
672 | - agx: Lower nest later
|
---|
673 | - agx: Split nest instruction into begin_cf + break
|
---|
674 | - agx: Add break_if_*cmp instructions
|
---|
675 | - agx: Add agx_first/last_instr helpers
|
---|
676 | - agx: Use agx_first_instr
|
---|
677 | - agx: Detect conditional breaks
|
---|
678 | - agx: Omit push_exec at top level
|
---|
679 | - agx: Omit while_icmp without continue
|
---|
680 | - agx: Add helper to determine if a NIR loop uses continue
|
---|
681 | - agx: Only use nest by 1 for loops w/o continue
|
---|
682 | - agx: Add pseudo-instructions for icmp/fcmp
|
---|
683 | - agx: Generate unfused comparison pseudo ops
|
---|
684 | - agx: Fuse conditions into if's
|
---|
685 | - agx: Fuse compares into selects
|
---|
686 | - agx: Add unit test for if_cmp fusing
|
---|
687 | - agx: Add unit test for cmp+sel fusing
|
---|
688 | - asahi: Translate cube array dimension
|
---|
689 | - ail: Force page-alignment for layered attachments
|
---|
690 | - agx: Handle cube arrays when clamping arrays
|
---|
691 | - agx: Lower coordinates for cube map array images
|
---|
692 | - agx: Run opt_idiv_const after lowering texture
|
---|
693 | - asahi: Forbid linear 1D Array images
|
---|
694 | - asahi: Handle linear 1D Arrays
|
---|
695 | - asahi: Conditionally expose cube arrays
|
---|
696 | - gallium,mesa/st: Add PIPE_CONTEXT_NO_LOD_BIAS flag
|
---|
697 | - asahi: Skip LOD bias lowering for GLES
|
---|
698 | - nir: Add nir_function_instructions_pass helper
|
---|
699 | - nir: Add NIR_OP_IS_DERIVATIVE property
|
---|
700 | - nir: Hoist nir_op_is_derivative
|
---|
701 | - nir/opt_preamble: Use nir_op_is_derivative
|
---|
702 | - nir/opt_gcm: Use nir_op_is_derivative more
|
---|
703 | - nir/gather_info: Use nir_op_is_derivative
|
---|
704 | - nir/opt_sink: Sink load_constant_agx
|
---|
705 | - nir/opt_sink: Sink load_local_pixel_agx
|
---|
706 | - nir/opt_sink: Sink frag coord instructions
|
---|
707 | - nir/opt_sink: Do not move derivatives
|
---|
708 | - nir/opt_sink: Move ALU with constant sources
|
---|
709 | - nir/opt_sink: Also consider load_preamble as const
|
---|
710 | - agx: Enable sinking ALU
|
---|
711 | - treewide: Drop nir_ssa_for_src users
|
---|
712 | - treewide: Remove remaining nir_ssa_for_src
|
---|
713 | - nir: Remove nir_ssa_for_src
|
---|
714 | - asahi: Clamp index buffer extent to what's read
|
---|
715 | - agx: Align the reg file for 256-bit vectors
|
---|
716 | - agx: Hoist sample_mask/zs_emit
|
---|
717 | - agx: Set PIPE_SHADER_CAP_CONT_SUPPORTED
|
---|
718 | - agx: Augment if/else/while_cmp with a target
|
---|
719 | - agx: Add jumps to block ends
|
---|
720 | - agx: Add agx_prev_block helper
|
---|
721 | - agx: Insert jmp_exec_none instructions
|
---|
722 | - nir: Add layer_id_written_agx sysval
|
---|
723 | - nir: Support arrays in block_image_store_agx
|
---|
724 | - agx/nir_lower_texture: Allow disabling layer clamping
|
---|
725 | - agx: Pack block image store dim correctly
|
---|
726 | - agx: Handle layered block image stores
|
---|
727 | - agx: Add pass to lower layer ID writes
|
---|
728 | - asahi: Add helper to get layer id in internal program
|
---|
729 | - asahi,agx: Select layered rendering outputs
|
---|
730 | - agx: Support packed layered rendering writes
|
---|
731 | - agx/tilebuffer: Support layered layouts
|
---|
732 | - agx/lower_tilebuffer: Support spilled layered RTs
|
---|
733 | - asahi: Use layered layouts
|
---|
734 | - asahi: Expose VS_LAYER_VIEWPORT behind a flag
|
---|
735 | - asahi: Account for layering for attachment views
|
---|
736 | - asahi: Assume LAYER is flat-shaded
|
---|
737 | - asahi: Add pass to predicate layer ID reads
|
---|
738 | - asahi: Predicate layer ID reads
|
---|
739 | - asahi: Write to cubes/etc attachments as 2D array
|
---|
740 | - asahi: Use a 2D Array texture for array render targets
|
---|
741 | - asahi: Generate layered EOT programs
|
---|
742 | - asahi: Handle layered background programs
|
---|
743 | - lima/pp: Do not use union undefined behaviour
|
---|
744 | - nir: Add trivial nir_src_* getters
|
---|
745 | - nir: Use set_parent_instr internally
|
---|
746 | - nir: Use getters for nir_src::parent_*
|
---|
747 | - nir: Assert the nir_src union is used safely
|
---|
748 | - nir: Use a tagged pointer for nir_src parents
|
---|
749 | - nir: Add ACCESS_CAN_SPECULATE
|
---|
750 | - ir3: Set CAN_SPECULATE before opt_preamble
|
---|
751 | - ir3: Model cost of phi nodes for opt_preamble
|
---|
752 | - nir/opt_preamble: Walk cf_list manually
|
---|
753 | - nir/opt_preamble: Preserve IR when replacing phis
|
---|
754 | - nir/opt_preamble: Unify foreach_use logic
|
---|
755 | - nir/opt_preamble: Move phis for movable if's
|
---|
756 | - nir/opt_preamble: Respect ACCESS_CAN_SPECULATE
|
---|
757 | - freedreno/ci: Minetest
|
---|
758 | - r600/sfn: Handle load_global_constant
|
---|
759 | - nir/opt_phi_precision: Work with libraries
|
---|
760 | - nir/legalize_16bit_sampler_srcs: Use instr_pass
|
---|
761 | - nir/print: Handle KERNEL
|
---|
762 | - nir/lower_io: Use load_global_constant for OpenCL
|
---|
763 | - nir/opt_algebraic: Reduce int64
|
---|
764 | - nir/opt_algebraic: Optimize LLVM booleans
|
---|
765 | - nir/trivialize_registers: Handle obscure load hazard
|
---|
766 | - hasvk: Support builiding on non-Intel
|
---|
767 | - crocus: Support building on non-Intel
|
---|
768 | - meson: Add vulkan-drivers=all option
|
---|
769 | - meson: Add gallium-drivers=all option
|
---|
770 | - agx: Fix fragment side effects scheduling
|
---|
771 |
|
---|
772 | Amber (7):
|
---|
773 |
|
---|
774 | - ir3: make wave_granularity configurable
|
---|
775 | - turnip: Add support for devices not supporting double thread size.
|
---|
776 | - turnip: make sampler_minmax support configurable.
|
---|
777 | - freedreno, turnip: set correct reg_size_vec4 for a6xx_gen1_low
|
---|
778 | - ir3: handle non-uniform case for atomic image/ssbo intrinsics
|
---|
779 | - freedreno: Add support for devices not supporting double thread size.
|
---|
780 | - turnip: Add debug option to allow non-conforming features.
|
---|
781 |
|
---|
782 | Andrew Randrianasulu (1):
|
---|
783 |
|
---|
784 | - nv50/ir: Remove few nvc0 specific defines from nv50-specific header.
|
---|
785 |
|
---|
786 | Antonio Gomes (9):
|
---|
787 |
|
---|
788 | - rusticl/kernel: Removing unnecessary clone in kernel launch
|
---|
789 | - rusticl/kernel: Add CsoWrapper
|
---|
790 | - rusticl/compiler: Add NirPrintfInfo
|
---|
791 | - rusticl: Move Cso to Program
|
---|
792 | - rusticl/compiler: Remove unnecessary functions
|
---|
793 | - rusticl: Move NirKernelBuild to ProgramDevBuild
|
---|
794 | - rusticl/program: New helper functions to NirKernelBuild
|
---|
795 | - rusticl/core: Delete KernelDevState and KernelDevStateInner
|
---|
796 | - rusticl/core: Make convert_spirv_to_nir output pair (KernelInfo, NirShader)
|
---|
797 |
|
---|
798 | Asahi Lina (29):
|
---|
799 |
|
---|
800 | - docs/tgsi: Specify that depth texture fetches are replicated
|
---|
801 | - asahi: Add synctvb debug flag
|
---|
802 | - asahi: Add smalltile debug option
|
---|
803 | - asahi: Add nomsaa debug flag
|
---|
804 | - asahi: decode: Add a params argument to pass through
|
---|
805 | - asahi: Add extra CDM header block for G14X
|
---|
806 | - asahi: wrap: Handle freeing shmems
|
---|
807 | - asahi: decode: Refactor to always copy GPU mem to local buffers
|
---|
808 | - asahi: decode: Add a function to construct decode_params from a chip_id
|
---|
809 | - asahi: Add a shared library interface for decode
|
---|
810 | - asahi: Add a noshadow debug flag
|
---|
811 | - asahi: Do not overallocate BOs by more than 2x
|
---|
812 | - asahi: Fix race in BO stats accounting
|
---|
813 | - asahi: Always use resource size, not BO size
|
---|
814 | - asahi: Print info about shadowed resources
|
---|
815 | - asahi: Impose limits on resource shadowing
|
---|
816 | - asahi: Force linear for SHARED buffers with no/implicit modifier
|
---|
817 | - asahi: Enable explicit coherency for G14D (multi-die)
|
---|
818 | - asahi: Handle non-written RTs correctly
|
---|
819 | - asahi: Fix incorrect BO bitmap reallocations
|
---|
820 | - asahi: Allocate staging resources as staging
|
---|
821 | - asahi: cmdbuf: Identify call/ret bits
|
---|
822 | - asahi: decode: Implement VDM call/ret
|
---|
823 | - asahi: decode: Do not assert on buffer overruns
|
---|
824 | - asahi: Fix VDM pipeline field width
|
---|
825 | - asahi: Add scaffolding for supporting driconf options
|
---|
826 | - asahi: Add and support the no_fp16 driconf flag
|
---|
827 | - driconf: Disable fp16 for browsers
|
---|
828 | - asahi: Allow no16 flag for disk cache
|
---|
829 |
|
---|
830 | Bas Nieuwenhuizen (16):
|
---|
831 |
|
---|
832 | - aco: fix nir_op_vec8/16 with 16-bit elements.
|
---|
833 | - aco: Fix some constant patterns in 16-bit vec4 construction with s_pack.
|
---|
834 | - nir: Fix 16-component nir_replicate.
|
---|
835 | - radv: Expose VK_EXT_external_memory_acquire_unmodified.
|
---|
836 | - util/perf: Add gpuvis integration.
|
---|
837 | - egl,venus,vulkan,turnip,freedreno: Update CPU trace init to init more than perfetto.
|
---|
838 | - vulkan: Add CPU tracing for vkWaitForFences.
|
---|
839 | - docs: Add documentation for gpuvis.
|
---|
840 | - vulkan: Add trace points for more Vulkan waiting functions.
|
---|
841 | - radv: Use a double jump to limit nops in DGC for dynamic sequence count.
|
---|
842 | - nir: Add AMD cooperative matrix intrinsics.
|
---|
843 | - aco: Add WMMA instructions.
|
---|
844 | - aco: Make RA understand WMMA instructions.
|
---|
845 | - radv: Don't transparently use wave32 with cooperative matrices.
|
---|
846 | - radv: Add cooperative matrix lowering.
|
---|
847 | - radv: Expose VK_KHR_cooperative_matrix.
|
---|
848 |
|
---|
849 | Benjamin Cheng (10):
|
---|
850 |
|
---|
851 | - radv/video: use app provided hevc scaling list order
|
---|
852 | - radv/video: copy from correct H264 scaling lists
|
---|
853 | - anv/video: copy from correct H264 scaling lists
|
---|
854 | - vulkan/video: add helper to derive H264 scaling lists
|
---|
855 | - radv/video: use vk_video_derive_h264_scaling_list
|
---|
856 | - anv/video: use vk_video_derive_h264_scaling_list
|
---|
857 | - util/vl: extract gallium vl scanning data to shared code
|
---|
858 | - radv/video: send h264 scaling list in raster order
|
---|
859 | - anv/video: send h264 scaling list in raster order
|
---|
860 | - radv/video: find SPS with pps_seq_parameter_set_id
|
---|
861 |
|
---|
862 | Benjamin Lee (1):
|
---|
863 |
|
---|
864 | - nvk: Fix segfault when opening DRI device file returns error
|
---|
865 |
|
---|
866 | Biswapriyo Nath (1):
|
---|
867 |
|
---|
868 | - radv/video: Match function definitions to declarations
|
---|
869 |
|
---|
870 | Boris Brezillon (1):
|
---|
871 |
|
---|
872 | - panfrost: Flag the right shader when updating images
|
---|
873 |
|
---|
874 | Boyuan Zhang (3):
|
---|
875 |
|
---|
876 | - virgl: Add vp9 picture desc
|
---|
877 | - virgl: Implement vp9 hardware decode
|
---|
878 | - radeonsi/vcn: disable tmz ctx buffer for VCN_2_2_0
|
---|
879 |
|
---|
880 | Caio Oliveira (134):
|
---|
881 |
|
---|
882 | - nir: Use instructions_pass() for nir_fixup_deref_modes()
|
---|
883 | - meson: Ensure that LLVMSPIRVLib is not required for Clover
|
---|
884 | - nir: Let nir_fixup_deref_modes() fix deref_casts when possible
|
---|
885 | - nir: Add nir_opt_reuse_constants()
|
---|
886 | - radv: Use nir_opt_reuse_constants()
|
---|
887 | - compiler/types: Use ralloc for the key in array_types
|
---|
888 | - compiler/types: Use smaller keys for array_types table
|
---|
889 | - compiler/types: Extract get_explicit_matrix_instance() function
|
---|
890 | - compiler/types: Use smaller keys for explicit_matrix_types table
|
---|
891 | - anv/tests: Refactor state_pool_test_helper to not use macros for parametrization
|
---|
892 | - anv/tests: Link a single anv_tests binary using gtest
|
---|
893 | - anv/tests: Propagate failures to gtest
|
---|
894 | - hasvk/tests: Refactor state_pool_test_helper to not use macros for parametrization
|
---|
895 | - hasvk/tests: Link a single hasvk_tests binary using gtest
|
---|
896 | - hasvk/tests: Propagate failures to gtest
|
---|
897 | - util: Add convenience macros for linear allocator
|
---|
898 | - compiler/types: Use right hash for function types
|
---|
899 | - compiler/types: Don't duplicate empty string
|
---|
900 | - compiler/types: Constify a couple of pointers in glsl_type
|
---|
901 | - compiler/types: Remove unused GLSL_TYPE_FUNCTION and related functions
|
---|
902 | - compiler/types: Move GLSL specific builtin structs into glsl/
|
---|
903 | - glsl: Add missing glsl_types initialization to test_optpass
|
---|
904 | - glsl: Don't create struct type builtins
|
---|
905 | - compiler/types: Add extra level of macro to builtin_macros
|
---|
906 | - compiler/types: Use designated initializer syntax to specify builtins
|
---|
907 | - compiler/types: Move local cache details to implementation file
|
---|
908 | - compiler/types: Add a mem_ctx for the glsl_type_cache
|
---|
909 | - compiler/types: Use type cache mem_ctx for hash tables
|
---|
910 | - compiler/types: Don't store a mem_ctx per type
|
---|
911 | - compiler/types: Simplify clearing the glsl_type_cache
|
---|
912 | - compiler/types: Move static asserts about glsl_type to a central place
|
---|
913 | - compiler/types: Store builtin types directly as data
|
---|
914 | - compiler/types: Use a linear (arena) allocator for glsl_types
|
---|
915 | - compiler/types: Make struct glsl_type visible to C code
|
---|
916 | - compiler/types: Add workaround to use builtin_type_macros.h in C
|
---|
917 | - compiler/types: Move builtin type initialization to C
|
---|
918 | - glsl: Annotate _mesa_glsl_error() with PRINTFLIKE
|
---|
919 | - compiler/types: Fix array name dimension flipping for unsized arrays
|
---|
920 | - compiler/types: Use Python to generate code for builtin types
|
---|
921 | - compiler/types: Use glsl_get_type_name() to access the type name
|
---|
922 | - compiler/types: Change glsl_type::name to be an uintptr_t
|
---|
923 | - compiler/types: Use a string table for builtin type names
|
---|
924 | - intel/compiler/xe2: Account for reg_unit() in TCS intrinsics
|
---|
925 | - intel/compiler/xe2: Account for reg_unit() in TES intrinsics
|
---|
926 | - intel/fs/xe2+: Update BS payload setup for Xe2 reg size.
|
---|
927 | - intel/fs/xe2+: Update TASK/MESH payload setup for Xe2 reg size.
|
---|
928 | - compiler: Use a meson dependency for libcompiler
|
---|
929 | - meson: Remove unnecessary inc_compiler mentions
|
---|
930 | - rusticl: Ensure NIR generated headers will be available
|
---|
931 | - clover: Hide SPIR-V related code behind HAVE_CLOVER_SPIRV
|
---|
932 | - clover: Only compile/depend libclspirv and libclnir when using SPIR-V support
|
---|
933 | - compiler: Only enable mesaclc helper if we have OpenCL SPIR-V support
|
---|
934 | - intel/compiler: Don't allocate memory for SIMD select error handling
|
---|
935 | - microsoft/compiler: Fix printf formatting string issues
|
---|
936 | - util: Add more PRINTFLIKE and MALLOCLIKE annotations
|
---|
937 | - util: Remove ralloc_parent from linear_header
|
---|
938 | - util: Use linear parent to (r)allocated extra nodes
|
---|
939 | - util: Remove size from linear_parent creation
|
---|
940 | - util: Make DECLARE_LINEAR_ALLOC_* macros assume no destructors
|
---|
941 | - util: Use an opaque type for linear context
|
---|
942 | - util: Remove usages of linear_realloc()
|
---|
943 | - util: Remove linear_realloc()
|
---|
944 | - util: Remove size information from child allocations
|
---|
945 | - util: Remove per-buffer header in linear alloc for release mode
|
---|
946 | - util: Add a few basic tests for linear_alloc
|
---|
947 | - util: Fix bookkeeping of linear node sizes
|
---|
948 | - intel/compiler: Don't store stage name and abbrev
|
---|
949 | - intel/compiler/xe2: URB fence uses LSC now
|
---|
950 | - intel/compiler/xe2: Fix URB writes in TCS
|
---|
951 | - intel/compiler/xe2: Update TCS ICP handle code to support SIMD16
|
---|
952 | - compiler/types: Add support for Cooperative Matrix types
|
---|
953 | - nir: Add new intrinsics for Cooperative Matrix
|
---|
954 | - nir: Handle cooperative matrix in various passes
|
---|
955 | - spirv: Expose some memory related functions in vtn_private.h
|
---|
956 | - spirv: Let vtn_ssa_value hold references to variables
|
---|
957 | - spirv: Implement SPV_KHR_cooperative_matrix
|
---|
958 | - compiler/types: Remove private related declarations
|
---|
959 | - compiler/types: Remove use of new/delete
|
---|
960 | - compiler/types: Remove use of references
|
---|
961 | - compiler/types: Remove use of auto
|
---|
962 | - compiler/types: Use C compatible cast syntax
|
---|
963 | - compiler/types: Spell struct and enum in type names
|
---|
964 | - compiler/types: Add void parameter to ensure these are valid C prototypes
|
---|
965 | - intel/fs: Tweak default case of fs_inst::size_read()
|
---|
966 | - compiler/types: Move the C++ inline functions in glsl_type out of the struct body
|
---|
967 | - compiler/types: Move C declarations into glsl_types.h
|
---|
968 | - compiler/types: Flip wrapping of base_type checks
|
---|
969 | - compiler/types: Flip wrapping of various type identification checks
|
---|
970 | - compiler/types: Flip wrapping of convenience accessors for vector types
|
---|
971 | - compiler/types: Flip wrapping of basic "get type" functions
|
---|
972 | - rusticl: Add Rust bindings for inline glsl_types functions
|
---|
973 | - util: Add size to ralloc_header in debug mode
|
---|
974 | - util: Add a canary to identify gc_ctx in debug mode
|
---|
975 | - util: Add function print information about a ralloc tree
|
---|
976 | - util: Avoid waste space when linear alloc'ing large sizes
|
---|
977 | - spirv: Expose stage enum conversion in vtn_private.h
|
---|
978 | - spirv: Change spirv2nir to use the shorter shader name abbreviations
|
---|
979 | - spirv: List entry-points in spirv2nir when unsure what to use
|
---|
980 | - spirv: Let spirv2nir find out the shader to use
|
---|
981 | - intel/compiler: Don't emit calls to validate() in release build
|
---|
982 | - compiler/types: Flip wrapping of "type contains?" predicate functions
|
---|
983 | - compiler/types: Flip wrapping of array related functions
|
---|
984 | - compiler/types: Flip wrapping of cmat related functions
|
---|
985 | - compiler/types: Flip wrapping of CL related functions
|
---|
986 | - compiler/types: Flip wrapping of size related functions
|
---|
987 | - compiler/types: Flip wrapping of struct related functions
|
---|
988 | - compiler/types: Flip wrapping of interface related functions
|
---|
989 | - compiler/types: Flip wrapping of layout related functions
|
---|
990 | - compiler/types: Flip wrapping of record_compare
|
---|
991 | - compiler/types: Flip wrapping of get_instance()
|
---|
992 | - compiler/types: Flip wrapping of texture/sampler/image get instance functions
|
---|
993 | - compiler/types: Flip wrapping of various get instance functions
|
---|
994 | - compiler/types: Flip wrapping of get row/column type helpers
|
---|
995 | - compiler/types: Flip wrapping of remaining non-trivial type getters
|
---|
996 | - compiler/types: Flip wrapping of remaining small data getters
|
---|
997 | - compiler/types: Flip wrapping of numeric type conversion functions
|
---|
998 | - compiler/types: Move remaining code from nir_types to glsl_types
|
---|
999 | - rusticl: Add bindings for glsl_vector_type()
|
---|
1000 | - compiler/types: Add more glsl_contains_*() functions and use them in C++
|
---|
1001 | - compiler/types: Add glsl_get_mul_type() and use it in C++
|
---|
1002 | - compiler/types: Add glsl_type_compare_no_precision() and use it in C++
|
---|
1003 | - compiler/types: Add glsl_type_uniform_locations() and use it in C++
|
---|
1004 | - compiler/types: Add glsl_get_std430_array_stride() and use it in C++
|
---|
1005 | - compiler/types: Add glsl_get_explicit_*() functions and use them in C++
|
---|
1006 | - compiler/types: Implement glsl_type::field_type() in terms of existing functions
|
---|
1007 | - compiler/types: Add glsl_simple_explicit_type() and simplify glsl_simple_type()
|
---|
1008 | - compiler/types: Add remaining type extraction functions and use them in C++
|
---|
1009 | - compiler/types: Use C instead of C++ constants for builtin types
|
---|
1010 | - compiler/types: Remove usages of C++ members in glsl_types.cpp
|
---|
1011 | - compiler/types: Annotate extern "C" only once in glsl_types.cpp
|
---|
1012 | - compiler/types: Rename glsl_types.cpp to glsl_types.c
|
---|
1013 | - compiler/types: Remove warnings about potential fallthrough
|
---|
1014 | - compiler/types: Move comments and reorganize declarations
|
---|
1015 | - anv: Fix leak when compiling internal kernels
|
---|
1016 |
|
---|
1017 | Carsten Haitzler (2):
|
---|
1018 |
|
---|
1019 | - kmsro: Add hdlcd DPU
|
---|
1020 | - panfrost: Add GPU variant of G57 to the set of known ids
|
---|
1021 |
|
---|
1022 | Charles Giessen (1):
|
---|
1023 |
|
---|
1024 | - panvk: Use 1.0 in ICD Manifest json
|
---|
1025 |
|
---|
1026 | Charmaine Lee (8):
|
---|
1027 |
|
---|
1028 | - svga: set clear_texture to NULL for vgpu9
|
---|
1029 | - svga: fix stride used in vertex declaration
|
---|
1030 | - svga: fix persistent mapped surface update to constant buffer
|
---|
1031 | - svga: restrict use of rawbuf for constant buffer access to GL43 device
|
---|
1032 | - svga: fix immediates used in rawbuf for constant buffer
|
---|
1033 | - svga: use srv raw buffer for accessing readonly shader buffer
|
---|
1034 | - svga: sync resource content from backing resource before image upload
|
---|
1035 | - svga: ignore sampler view resource if not used by shaders
|
---|
1036 |
|
---|
1037 | Chia-I Wu (38):
|
---|
1038 |
|
---|
1039 | - radv: fix separate depth/stencil layouts in fb state
|
---|
1040 | - radv: fix separate depth/stencil layouts in resolve meta
|
---|
1041 | - radv: refactor depth clear in clear meta
|
---|
1042 | - radv: fix separate depth/stencil layouts in clear meta
|
---|
1043 | - amd/ci: update radv-stoney-aco-fails.txt for depth/stencil clear
|
---|
1044 | - radv: disable tc-compat htile for layered images on gfx8
|
---|
1045 | - amd/ci: update radv-stoney-aco-fails.txt for depth/stencil resolve
|
---|
1046 | - winsys/amdgpu: fix a race between import and destroy
|
---|
1047 | - ac/surface: limit RADEON_SURF_NO_TEXTURE to color surfaces
|
---|
1048 | - winsys/radeon: fix a race between bo import and destroy
|
---|
1049 | - vulkan/runtime: add a helper for ETC2 emulation
|
---|
1050 | - radv: use vk_tecompress_etc2 from the runtime
|
---|
1051 | - vulkan/runtime: fix image type check for ETC2 emulation
|
---|
1052 | - vulkan/runtime: fix a harmless typo for ETC2 emulation
|
---|
1053 | - vulkan/runtime, radv: remove 1D support from ETC2 emulation
|
---|
1054 | - radv: add radv_is_format_emulated
|
---|
1055 | - radv: simplify view format override for emulated formats
|
---|
1056 | - radv: hard code format features for emulated formats
|
---|
1057 | - mesa: make astc_decoder.glsl vk-compatible
|
---|
1058 | - radv, drirc: rename radv_require_{etc2,astc}
|
---|
1059 | - anv: remove unused field from anv_image_view
|
---|
1060 | - anv: add anv_image_view_{init,finish}
|
---|
1061 | - anv: support image views with surface state stream
|
---|
1062 | - anv: add anv_push_descriptor_set_{init,finish}
|
---|
1063 | - anv: support alternative push descriptor sets
|
---|
1064 | - anv: add anv_descriptor_set_write
|
---|
1065 | - anv: add anv_cmd_buffer_{save,restore}_state
|
---|
1066 | - anv: add anv_is_format_emulated
|
---|
1067 | - anv: add a hidden plane for emulated formats
|
---|
1068 | - anv: decompress on upload for emulated formats
|
---|
1069 | - anv: fix up image views for emulated formats
|
---|
1070 | - anv: fix up blit src for emulated formats
|
---|
1071 | - anv: advertise emulated formats
|
---|
1072 | - anv: add support for vk_require_astc driconf
|
---|
1073 | - util: improve BITFIELD_MASK and BITFIELD64_MASK on clang
|
---|
1074 | - anv: prep for gen9 astc workaround
|
---|
1075 | - anv: add gen9 astc workaround
|
---|
1076 | - radv: fix image view extent override for astc
|
---|
1077 |
|
---|
1078 | Chris Spencer (9):
|
---|
1079 |
|
---|
1080 | - radv: initialize result when pipeline cache creation fails
|
---|
1081 | - anv/android: Fix importing hardware buffers with planar formats
|
---|
1082 | - anv/android: Add support for AHARDWAREBUFFER_FORMAT_YV12
|
---|
1083 | - anv: Advertise Vulkan 1.3 on Android 13
|
---|
1084 | - anv: Don't reject Android image format if external props not supplied
|
---|
1085 | - android: Add explanatory comment to u_gralloc
|
---|
1086 | - anv/android: Enable shared presentable image support
|
---|
1087 | - anv/video: use correct enum value for max level IDC
|
---|
1088 | - radv/video: use correct enum value for max level IDC
|
---|
1089 |
|
---|
1090 | Christian Gmeiner (41):
|
---|
1091 |
|
---|
1092 | - nir/print: print instr pass_flags
|
---|
1093 | - etnaviv: move nir texture lowerings into one pass
|
---|
1094 | - nir: add enta specific intrinsic used for txs lowering
|
---|
1095 | - etnaviv: nir: support intrinsic used for txs lowering
|
---|
1096 | - etnaviv: nir: lower nir_texop_txs
|
---|
1097 | - ci/etnaviv: update ci expectations
|
---|
1098 | - etnaviv: make use of BITFIELD_BIT(..) macro
|
---|
1099 | - etnaviv: name the enum used for pass_flags
|
---|
1100 | - etnaviv: add is_dead_instruction(..) helper
|
---|
1101 | - etnaviv: extend etna_pass_flags with source modifiers
|
---|
1102 | - etnaviv: do not clear all pass_flags before RA
|
---|
1103 | - etnaviv: nir: look at parent instr in lower_alu(..)
|
---|
1104 | - etnaviv: nir: add etna_nir_lower_to_source_mods(..)
|
---|
1105 | - etnaviv: nir: switch to etna_nir_lower_to_source_mods(..)
|
---|
1106 | - etnaviv: nir: convert to new-style NIR registers
|
---|
1107 | - freedreno/regs: remove double assignment of self.current_domain
|
---|
1108 | - freedreno/regs: remove not used variable
|
---|
1109 | - freedreno/regs: remove dead code
|
---|
1110 | - freedreno/regs: python does not need ';'
|
---|
1111 | - etnaviv: switch to log2f(..)
|
---|
1112 | - etnaviv: switch to U_FIXED(..) macro
|
---|
1113 | - etnaviv: switch to S_FIXED(..) macro
|
---|
1114 | - etnaviv: fix null pointer dereference
|
---|
1115 | - etnaviv: switch to float_to_ubyte(..)
|
---|
1116 | - ci/etnaviv: update ci expectation
|
---|
1117 | - etnaviv: unbreak cmdline compiler
|
---|
1118 | - agx/lower_address: Use intrinsics_pass
|
---|
1119 | - agx/lower_address: Remove not used has_offset
|
---|
1120 | - isaspec: python does not need ';'
|
---|
1121 | - docs: Move isaspec out of drivers/freedreno
|
---|
1122 | - isaspec: Add support for templates
|
---|
1123 | - isaspec: encode: Correct used regex
|
---|
1124 | - isaspec: Add method to get all instrustions
|
---|
1125 | - isaspec: Add support for custom meta information
|
---|
1126 | - isaspec: Add BitSetEnumValue object
|
---|
1127 | - spirv: Don't use libclc for rotate
|
---|
1128 | - docs: update etnaviv extensions
|
---|
1129 | - etnaviv: drm: Be able to mark end of context init
|
---|
1130 | - etnaviv: Skip 'empty' cmd streams
|
---|
1131 | - ci: Bump PyYAML to 6.0.1
|
---|
1132 | - etnaviv: Don't leak disk_cache
|
---|
1133 |
|
---|
1134 | Collabora's Gfx CI Team (2):
|
---|
1135 |
|
---|
1136 | - Uprev Piglit to ed58dfbd12be34fa3dab97a7a2987b890e0637f1
|
---|
1137 | - Uprev Piglit to f7db20b03de6896d013826c0a731bc4417c1a5a0
|
---|
1138 |
|
---|
1139 | Cong Liu (2):
|
---|
1140 |
|
---|
1141 | - r300: Fix out-of-bounds access in ntr_emit_store_output()
|
---|
1142 | - virgl:Fix ITEM_CPY macro pointer copy bug
|
---|
1143 |
|
---|
1144 | Connor Abbott (83):
|
---|
1145 |
|
---|
1146 | - afuc: Rework and significantly expand README.rst
|
---|
1147 | - tu: Fix vk2tu_*_stage flag type
|
---|
1148 | - tu: Fix and simplify execution dependency handling
|
---|
1149 | - tu, freedreno/a6xx: Remove has_ccu_flush_bug
|
---|
1150 | - ir3: Handle GS stream "mixing" with non-point output primitives
|
---|
1151 | - tu: Disable transformFeedbackPreservesProvokingVertex
|
---|
1152 | - isaspec: Add "displayname" for altering {NAME} when decoding
|
---|
1153 | - isaspec: Add support for "absolute" branches
|
---|
1154 | - isaspec: Add support for function and entrypoint labels
|
---|
1155 | - isaspec: Add "custom" field type
|
---|
1156 | - isaspec: Add callback after decoding an instruction
|
---|
1157 | - isaspec: Rename isa_decode() to isa_disasm()
|
---|
1158 | - isaspec: Add initial decoding support
|
---|
1159 | - afuc: Fix xmov lexer typo
|
---|
1160 | - afuc: Convert to isaspec
|
---|
1161 | - afuc: Add setbit/clrbit
|
---|
1162 | - afuc: Fix writing $00
|
---|
1163 | - freedreno/afuc: Initial a7xx support
|
---|
1164 | - ir3: Parse (eq) flag
|
---|
1165 | - ir3, freedreno, tu: Plumb through SP_FS_PREFETCH_CNTL::ENDOFQUAD
|
---|
1166 | - tu: Add missing last_baryf statistic
|
---|
1167 | - freedreno, tu, ir3: Add last_helper statistic
|
---|
1168 | - ir3: Gather pixlod status earlier
|
---|
1169 | - ir3: Implement helper invocation optimization
|
---|
1170 | - vk/graphic_state, tu: Use dynamic blend count from subpass
|
---|
1171 | - freedreno/a7xx: Add CP_RESET_CONTEXT_STATE
|
---|
1172 | - vk/graphics_state: Fix copying MS locations pipeline state
|
---|
1173 | - tu: Remove MSAA draw state
|
---|
1174 | - tu: Merge SAMPLE_LOCATIONS and SAMPLE_LOCATIONS_ENABLE draw states
|
---|
1175 | - tu: Merge PC_RASTER_CNTL into RAST draw state
|
---|
1176 | - tu: Stop reusing base Vulkan dynamic state enums
|
---|
1177 | - tu: Merge depth/stencil draw states
|
---|
1178 | - tu: Rename PrimID-related registers
|
---|
1179 | - tu, freedreno/a6xx: Don't use VS for PrimID passthru state
|
---|
1180 | - tu: Pull entangled shader state into program config
|
---|
1181 | - ir3: Add ir3_find_input_loc() helper
|
---|
1182 | - tu: Split up tu6_emit_vpc()
|
---|
1183 | - freedreno, ir3, tu: Constify various uses of ir3_shader_variant
|
---|
1184 | - ir3: Add helper to determine when variant exceeds safe constlen
|
---|
1185 | - tu: Split program draw state into per-shader states
|
---|
1186 | - tu: Fix per-view viewport state propagation
|
---|
1187 | - tu: Fix tu6_emit_*_fdm size call
|
---|
1188 | - tu: Fix assert in FDM state emission
|
---|
1189 | - tu: Actually emit patchpoint for viewports with FDM
|
---|
1190 | - nir/lower_subgroups: Don't do multiple lowerings at once
|
---|
1191 | - nir/spirv: Add inverse_ballot intrinsic
|
---|
1192 | - amd: Use inverse ballot intrinsic if available
|
---|
1193 | - tu: Create singleton "empty" shaders
|
---|
1194 | - tu: Start tracking shaders independently of pipeline
|
---|
1195 | - tu: Move FS-specific pipeline information to the shader
|
---|
1196 | - tu: Use shader directly for VS/TCS output size and patch size
|
---|
1197 | - tu: Rewrite tessellation modes handling
|
---|
1198 | - tu: Rework passing shared consts
|
---|
1199 | - tu: Decouple program state from the pipeline
|
---|
1200 | - tu: Use pipeline feedback loop flag indirectly
|
---|
1201 | - tu: Rewrite remaining pipeline LRZ handling
|
---|
1202 | - tu: Don't reference pipeline for some draw states
|
---|
1203 | - tu: Make compute dispatch use the shader
|
---|
1204 | - tu: Don't use pipeline for dynamic draw states
|
---|
1205 | - tu: Don't use pipeline for bandwidth validity
|
---|
1206 | - tu: Don't use pipeline for per_view_viewport
|
---|
1207 | - tu: Don't use pipeline for active stages
|
---|
1208 | - tu: Remove pipeline from state
|
---|
1209 | - zink: Rework color clamping and conversion
|
---|
1210 | - freedreno/fdl: Use A8_UNORM HW format for sampling
|
---|
1211 | - tu: Support clearing A8_UNORM
|
---|
1212 | - freedreno/fdl: Support PIPE_FORMAT_R5G5B5A1_UNORM on a6xx
|
---|
1213 | - tu/clear_blit: Fix staging image view layer count
|
---|
1214 | - tu/clear_blit: Allow VK_REMAINING_ARRAY_LAYERS as layerCount
|
---|
1215 | - tu: Allow VK_WHOLE_SIZE in tu_CmdBindVertexBuffers2EXT pSizes
|
---|
1216 | - tu: Implement vkCmdBindIndexBuffer2KHR
|
---|
1217 | - tu: Implement vkGetImageSubresourceLayout2KHR and vkGetDeviceImageSubresourceLayoutKHR
|
---|
1218 | - tu: Implement vkGetRenderingAreaGranularityKHR
|
---|
1219 | - tu: Use new buffer usage flags
|
---|
1220 | - tu: Support VkPipelineCreateFlags2CreateInfoKHR
|
---|
1221 | - tu: Check for DEVICE_LOST in vkGetEventStatus()
|
---|
1222 | - tu: Add maintenance5 properties
|
---|
1223 | - freedreno/ci: Skip dEQP-VK.info.device_extensions
|
---|
1224 | - tu: Expose VK_KHR_maintenance5
|
---|
1225 | - freedreno/ci: Remove minetest trace
|
---|
1226 | - v3d/ci: Remove minetest trace
|
---|
1227 | - ir3/ra: Don't swap killed sources for early-clobber destination
|
---|
1228 | - tu: Fix re-emitting VS param state after it is re-enabled
|
---|
1229 |
|
---|
1230 | Corentin Noël (16):
|
---|
1231 |
|
---|
1232 | - ci: Add locked flag to bindgen-cli installation
|
---|
1233 | - virgl: Do not expose EXT_texture_mirror_clamp when using a GLES host
|
---|
1234 | - ci: disable Collabora's LAVA lab for maintenance
|
---|
1235 | - llvmpipe: make sure to initialize the lp_setup_context slots with the default values
|
---|
1236 | - virgl: Cover all the formats defined in the virgl definition
|
---|
1237 | - mesa: Ensure that the baselevel will never exceed the maximal supported number
|
---|
1238 | - ci: Uprev virglrenderer
|
---|
1239 | - freedreno/drm/virtio: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
|
---|
1240 | - tu: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
|
---|
1241 | - aux/tc: Use MESA_TRACE_SCOPE instead of _BEGIN/_END
|
---|
1242 | - venus: Change the only occurrence of VN_TRACE_BEGIN/END to VN_TRACE_SCOPE
|
---|
1243 | - util: Avoid the use of MESA_TRACE_BEGIN/END
|
---|
1244 | - util/perf: Remove the tracing categories
|
---|
1245 | - util: Remove MESA_TRACE_BEGIN/END
|
---|
1246 | - mesa/bufferobj: ensure that very large width+offset are always rejected
|
---|
1247 | - frontends/va: Remove wrong use of ProfileToPipe
|
---|
1248 |
|
---|
1249 | Daniel Schürmann (9):
|
---|
1250 |
|
---|
1251 | - nir/opt_move: fix handling of if-condition
|
---|
1252 | - aco: append p_logical_end after monolithic RT shaders
|
---|
1253 | - aco/insert_exec_mask: set Exact mode after p_discard_if when necessary
|
---|
1254 | - aco: don't optimize cross-lane instructions across p_wqm
|
---|
1255 | - aco: make p_wqm a marker instruction without Operands/Definitions
|
---|
1256 | - aco: don't insert a copy when emitting p_wqm
|
---|
1257 | - aco: insert a single p_end_wqm after the last derivative calculation
|
---|
1258 | - aco/insert_exec_mask: Simplify WQM handling (1/2)
|
---|
1259 | - aco/insert_exec_mask: Simplify WQM handling (2/2)
|
---|
1260 |
|
---|
1261 | Daniel Stone (23):
|
---|
1262 |
|
---|
1263 | - dri: Support 1555/4444 formats
|
---|
1264 | - egl/dri2: Don't look up image extension twice
|
---|
1265 | - egl/wayland: Always initialise fd_display_gpu
|
---|
1266 | - egl/wayland: Add image loader extension for swrast
|
---|
1267 | - egl/wayland: Never use DRI2_LOADER extension
|
---|
1268 | - egl/wayland: Assume modern DRI interface versions
|
---|
1269 | - egl/drm: Use IMAGE_DRIVER instead of DRI2_LOADER
|
---|
1270 | - egl/drm: Assume modern DRI interface versions
|
---|
1271 | - ci: Disable nouveau CI
|
---|
1272 | - panfrost/vk: Use correct sampler dimensions for MSAA
|
---|
1273 | - ci: Declare stages before jobs
|
---|
1274 | - ci/radeonsi: Add new flake
|
---|
1275 | - ci/d3d12: Add new flake
|
---|
1276 | - ci/intel: Add new skqp flake
|
---|
1277 | - ci/zink: Add new zink-lvp flakes
|
---|
1278 | - ci/radeonsi: Skip more really slow tests
|
---|
1279 | - ci/zink: Add another conversion fail on a618
|
---|
1280 | - ci: Move farm-disable rules before anything else
|
---|
1281 | - ci: Always set user container jobs to manual
|
---|
1282 | - ci: Use container rules for containers
|
---|
1283 | - ci: Only look at file changes for MRs
|
---|
1284 | - ci: Fix pre-merge pipelines with no code changes
|
---|
1285 | - ci: Try really hard to print final result string
|
---|
1286 |
|
---|
1287 | Daniel van Vugt (1):
|
---|
1288 |
|
---|
1289 | - glx: Increment dpy->request before issuing an error that had no request
|
---|
1290 |
|
---|
1291 | Danylo Piliaiev (71):
|
---|
1292 |
|
---|
1293 | - freedreno/cffdec: Decode CP_DRAW_AUTO
|
---|
1294 | - freedreno, turnip: Clarify some RB_CCU_CNTL fields
|
---|
1295 | - freedreno,turnip: Make number of VSC pipes configurable
|
---|
1296 | - freedreno,turnip: Make CS shared memory size configurable
|
---|
1297 | - freedreno,turnip: Make VS input attr/binding count configurable
|
---|
1298 | - freedreno: Add A605, A608, A610, A612 GPUs definition
|
---|
1299 | - turnip: Make multiview support configurable per generation
|
---|
1300 | - ir3: Make FS tex prefetch optimization optional
|
---|
1301 | - ir3: Use NIR info to enable per sample shading
|
---|
1302 | - freedreno/regs: Rename SP_FS_CTRL_REG0.DIFF_FINE into LODPIXMASK
|
---|
1303 | - ir3: Fix FS quad ops returning wrong values from helper invocations
|
---|
1304 | - tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults
|
---|
1305 | - radv: fix unused non-xfb shader outputs not being removed
|
---|
1306 | - vulkan/nir: Add common helper to check if output is XFB
|
---|
1307 | - radv: Use common nir_vk_is_not_xfb_output
|
---|
1308 | - turnip: Use common nir_vk_is_not_xfb_output
|
---|
1309 | - freedreno/regs: Define unknown SP_FS_PREFETCH_CNTL fields
|
---|
1310 | - freedreno/registers: Refactor gen_header.py to allow more options
|
---|
1311 | - freedreno/registers: Generate python files with reg offsets
|
---|
1312 | - freedreno: Add a list of raw magic regs
|
---|
1313 | - freedreno: Fully define a730 and a740 device properties
|
---|
1314 | - ir3/tests: Use fd_dev_info to infer GPU generation
|
---|
1315 | - freedreno/computerator: Fix remaining issues with A7XX
|
---|
1316 | - isaspec: Make possible to obtain gpu_id in <expr> blocks
|
---|
1317 | - ir3/a7xx: cat5 mode1 has swapped tex/samp ids
|
---|
1318 | - ir3/a7xx: Don't multiply global mem instruction's offset by 4
|
---|
1319 | - ir3/a7xx: insert lock/unlock at the end of every compute shader
|
---|
1320 | - ir3/a7xx: Add ccinv instruction
|
---|
1321 | - ir3/a7xx: Use ccinv for data synchronization
|
---|
1322 | - ir3/a7xx: Disable shared consts for a7xx
|
---|
1323 | - tu/common: Generalize TU_GENX macro
|
---|
1324 | - tu: Basic a7xx support
|
---|
1325 | - freedreno/fdl: Set LOSSLESSCOMPEN for image when ubwc is enabled on a7xx
|
---|
1326 | - tu/a7xx: Fix geometry shaders
|
---|
1327 | - tu/a7xx: Fix tesselation shaders
|
---|
1328 | - tu/a7xx: Fix multiview
|
---|
1329 | - tu/a7xx: Fix flat shading
|
---|
1330 | - tu/a7xx: Fix occlusion query
|
---|
1331 | - tu/a7xx: Fix 3d blits after multiview usage
|
---|
1332 | - tu/a7xx: Fix CmdDrawIndirectByteCountEXT
|
---|
1333 | - tu/a7xx: Disable LRZ
|
---|
1334 | - ir3/lower_tex_prefetch: Fix crash with lowered load_barycentric_at_offset
|
---|
1335 | - tu: Exclude SP_UNKNOWN_AE73 from reg stomping
|
---|
1336 | - tu: Call tu_cs_dbg_stomp_regs with appropriate GPU gen
|
---|
1337 | - freedreno/replay: Add limited support for KGSL
|
---|
1338 | - freedreno/rddecompiler: Update to handle a7xx
|
---|
1339 | - freedreno/replay: Add "print" instr to ir3 asm to be used in replay
|
---|
1340 | - freedreno/replay: Add "gpu_print" function for command streams
|
---|
1341 | - tu/perfetto: Remove now unnecessary tu_perfetto_util
|
---|
1342 | - tu/perfetto: Allow gpu time to be passed into tu_perfetto_submit
|
---|
1343 | - tu/kgsl: Fix memory leak of tmp allocations during submissions
|
---|
1344 | - tu/kgsl: Support u_trace and perfetto
|
---|
1345 | - tu/a7xx: Correctly record timestamps for u_trace
|
---|
1346 | - tu/virtio: Fix incorrect call to tu_perfetto_submit
|
---|
1347 | - ci: Compile Turnip's virtio kmd in debian-arm64
|
---|
1348 | - freedreno/registers: Refine a7xx push consts registers
|
---|
1349 | - ir3,tu: Refactor push consts info plumbing
|
---|
1350 | - freedreno: Make possible to specify A7XX feature flags
|
---|
1351 | - turnip,ir3: Implement A7XX push consts load via preamble
|
---|
1352 | - tu: Add push_consts_per_stage debug option
|
---|
1353 | - tu: Fix VK_FORMAT_A8_UNORM_KHR using UBWC when !has_8bpp_ubwc
|
---|
1354 | - tu/kgsl: Fix field order in kgsl_command_object init
|
---|
1355 | - tu: Fix stale tu_render_pass_attachment::store_stencil with dyn rendering
|
---|
1356 | - tu: Zero init tu_render_pass and tu_subpass for dynamic rendering
|
---|
1357 | - tu: Disable preamble push consts when they are not used
|
---|
1358 | - ir3: Fix values of #wrmask not being compatible with ir3 parser
|
---|
1359 | - tu: Count a whole push consts range in constlen for PREAMBLE push consts
|
---|
1360 | - freedreno/rddecompiler: Use fd_dev_gen to pass gpu_id to ir3 disasm
|
---|
1361 | - freedreno/rddecompiler: Decompile repeated IBs
|
---|
1362 | - freedreno: Fix field size of A6XX_TEX_CONST[3].ARRAY_PITCH
|
---|
1363 | - tu: Fix reading of stale (V)PC_PRIMITIVE_CNTL_0
|
---|
1364 |
|
---|
1365 | Dave Airlie (163):
|
---|
1366 |
|
---|
1367 | - ci: remove binding model from the asan skips for lavapipe.
|
---|
1368 | - gallivm: fix atomic global temporary storage.
|
---|
1369 | - llvmpipe: fix fragdata/lastfragdata heuristic a bit more.
|
---|
1370 | - nvk: add missing finish calls
|
---|
1371 | - nvk: add some initial wsi framework.
|
---|
1372 | - nvk: fix header guards to be less generic.
|
---|
1373 | - nvk: add bind buffer memory
|
---|
1374 | - nvk: Add initial queue
|
---|
1375 | - nvk: add cmd buffer framework
|
---|
1376 | - nvk: Reset pushbufs on command buffer reset
|
---|
1377 | - nvk: reindent descriptor sets to mesa std.
|
---|
1378 | - nvk: add initial descriptor pool framework.
|
---|
1379 | - nvk: some boilerplate for descriptor sets
|
---|
1380 | - nvk: add descriptor set bo allocation.
|
---|
1381 | - nvk: implement buffer address.
|
---|
1382 | - nvk: descriptor set freeing fix
|
---|
1383 | - nvk: move to new command stream generator.
|
---|
1384 | - nvk: port the blit and copy code to new command submission.
|
---|
1385 | - nouveau/ws: drop the old push generators.
|
---|
1386 | - nvk: link in codegen without gallium bits.
|
---|
1387 | - nvk: Initial wiring in of the compiler
|
---|
1388 | - nvk: Basic descriptor binding
|
---|
1389 | - nouveau/vk: add support for compute classes to generator.
|
---|
1390 | - nvk: retrieve gpc/mp counts from kernel.
|
---|
1391 | - nvk: add support for preamble and tls allocation.
|
---|
1392 | - nvk: add record result to cmd_buffer.
|
---|
1393 | - nvk: add command stream upload buffer.
|
---|
1394 | - nouveau/winsys: Add m2mf/compute objects
|
---|
1395 | - nvk: add some basic format wrapping framework
|
---|
1396 | - nvk: add some compute limits
|
---|
1397 | - nvk: add basic nve4+ compute support.
|
---|
1398 | - nvk: fix empty cmd submission.
|
---|
1399 | - nouveau/ws: add a push reset just for references.
|
---|
1400 | - nouveau/classes: add 906f header support.
|
---|
1401 | - nvk: add initial 8/16 byte clears.
|
---|
1402 | - nvk: fix pipeline pushbuf sizing
|
---|
1403 | - nvk: increase graphics cpu push buffer
|
---|
1404 | - nvk: fix depth emission ordering.
|
---|
1405 | - nvk: add some limits/features from binary driver.
|
---|
1406 | - nvk: add indexed draw support.
|
---|
1407 | - nvk: assign vertex locations according to input attrib index
|
---|
1408 | - nvk: lower io to temps to avoid output reads in vertex shaders
|
---|
1409 | - nvk: handle NULL to destroy descriptor pool
|
---|
1410 | - nvk: add basic primitive restart
|
---|
1411 | - nvk: fix copy lower address extraction
|
---|
1412 | - nvk: fix multiple pipelines failure allocation case.
|
---|
1413 | - nvk: init dev->physical_device earlier.
|
---|
1414 | - nvk/winsys: store device ptr into bo instead of ptr
|
---|
1415 | - nvk: set the device fd
|
---|
1416 | - nil: Fix image align and size constraints
|
---|
1417 | - nvk: Report image alignments from NIL
|
---|
1418 | - nouveau/winsys: allocate unique object handles across channels.
|
---|
1419 | - nvk/nil: don't ask for compressed image kind
|
---|
1420 | - nvk/barrier: handle host bit.
|
---|
1421 | - nvk: add compute support for ampere
|
---|
1422 | - nvk: add min_lod to spirv caps.
|
---|
1423 | - nvk: fix r32_sint format support
|
---|
1424 | - nvk: expose EXT_sampler_filter_minmax
|
---|
1425 | - nvk: fix transform feedback crash when optimiser removes things.
|
---|
1426 | - nvk: merge tess info between tcs/tes.
|
---|
1427 | - nvk: introduce an optimisation loop.
|
---|
1428 | - nvk: add support for D32_SFLOAT_S8_UINT
|
---|
1429 | - nvk/query: fix push buffer size for copy pool results.
|
---|
1430 | - nvk: init image fields for requirements
|
---|
1431 | - nvk: handle alignments in device memory
|
---|
1432 | - nvk/tess: don't emit patch control points in pipeline
|
---|
1433 | - nvk: align geometry clip setting with nvc0
|
---|
1434 | - nvk: fix independent color write masks.
|
---|
1435 | - nvk: enable rgb32 texel buffer support
|
---|
1436 | - nvk: enable EXT_depth_clip_control
|
---|
1437 | - nvk: enable EXT_depth_clip_enable
|
---|
1438 | - nvk: always sync internal cmd bufs for vma lifetimes.
|
---|
1439 | - nouveau/winsys: add support for the vma bind interfaces
|
---|
1440 | - nvk: Add support for sparse buffers
|
---|
1441 | - nvk: Add support for sparse images
|
---|
1442 | - nvk/queue: add support for syncobjs and sparse binds
|
---|
1443 | - nvk: Handle pre-turing indirect buffers with sparse
|
---|
1444 | - nvk: enable sparse features
|
---|
1445 | - nvk: enable a bunch of external fence/semaphore bits
|
---|
1446 | - nvk: enable sparse residency buffer on maxwell+
|
---|
1447 | - nvk: add new internal bo allocation flag.
|
---|
1448 | - docs: add two nvk exts to features.txt
|
---|
1449 | - zink: use fprintf instead of printf to align the requirements warnings
|
---|
1450 | - nvk: align sampler allocation counts with nvidia.
|
---|
1451 | - zink: turn off threaded cpu access if not visible.
|
---|
1452 | - nvk: add gart forced cmd pool side buffer.
|
---|
1453 | - nvk: add cond render upload buffer.
|
---|
1454 | - nvk: enable KHR_shader_clock.
|
---|
1455 | - nvk: NOUVEAU_WS_BO_LOCAL is a trap.
|
---|
1456 | - gallivm: drop unused info parameter
|
---|
1457 | - llvmpipe/fs: drop cbuf 0 since it's lowered now.
|
---|
1458 | - gallivm/nir: avoid using params->info
|
---|
1459 | - llvmpipe/fs: move some tgsi checks in nir path to nir code.
|
---|
1460 | - llvmpipe/cs: convert to using tgsi->nir
|
---|
1461 | - llvmpipe/cs: drop tgsi for compute/mesh/task shader internals.
|
---|
1462 | - lavapipe: use vk_buffer common code.
|
---|
1463 | - lavapipe: use vk_buffer_range common code.
|
---|
1464 | - llvmpipe/fs: switch to using tgsi->nir instead of handling tgsi
|
---|
1465 | - llvmpipe/analyse: drop TGSI path.
|
---|
1466 | - llvmpipe/fs: start using nir info in some places.
|
---|
1467 | - llvmpipe/fs: drop the simple shader logic
|
---|
1468 | - llvmpipe/fs: rewrite output finding using nir.
|
---|
1469 | - nvk: add build_id linker argument.
|
---|
1470 | - nir/gather: add support for fbfetch and bindless image loads.
|
---|
1471 | - llvmpipe/cs: further cleanups after tgsi removal.
|
---|
1472 | - llvmpipe: move to nir lowering for fquantize2f16
|
---|
1473 | - rusticl: don't store ptrs to nir_variables across opt passes.
|
---|
1474 | - llvmpipe: enable f16 paths on aarch64.
|
---|
1475 | - clover/llvm: move to modern pass manager.
|
---|
1476 | - nir: use a _clone so users calling their variable clone don't get a warning
|
---|
1477 | - nir: rename nir_inline_functions.c to nir_functions.c
|
---|
1478 | - nir: use nir_function_instructions_pass in the inliner.
|
---|
1479 | - nir: move the libclc lowering over to functions file.
|
---|
1480 | - nir/functions: use helper to get function for a name.
|
---|
1481 | - nir/functions: put link state into a struct
|
---|
1482 | - nir/functions: move linker pass to new helper
|
---|
1483 | - nir: add nir function clone
|
---|
1484 | - nir: don't inline linked functions
|
---|
1485 | - gallivm/nir: split prepasses out to make per-function work easier.
|
---|
1486 | - gallivm: rework translator to allow per-impl work.
|
---|
1487 | - spirv/nir: parse function control and store in nir.
|
---|
1488 | - nir: add driver_functions option to avoid inlining.
|
---|
1489 | - nir: add a function usage tracker
|
---|
1490 | - rusticl: use cleanup funcs
|
---|
1491 | - gallivm: add support for function calling
|
---|
1492 | - llvmpipe/cs: add support for function calls.
|
---|
1493 | - llvmpipe: enable driver functions.
|
---|
1494 | - radv: don't emit event code on video queues.
|
---|
1495 | - spirv: use a pointer sized int type for opencl event_t
|
---|
1496 | - clover: fix parameter arguments since recent translator changes.
|
---|
1497 | - radv/video: take db alignment into account when allocating images.
|
---|
1498 | - ac,radeonsi: move vcn enc structs to common
|
---|
1499 | - ac,radeonsi: move vcn enc av1 default cdf file to common
|
---|
1500 | - nir: add a deref slot counter that handles compact
|
---|
1501 | - llvmpipe/linear: drop tgsi path.
|
---|
1502 | - gallivm: drop tgsi aos paths.
|
---|
1503 | - llvmpipe/nir: call gather info to update inputs read properly
|
---|
1504 | - llvmpipe/fs: start converting interp/input paths to nir.
|
---|
1505 | - llvmpipe/fs: start converting dervied state to nir based.
|
---|
1506 | - llvmpipe/linear: convert to using nir for output.
|
---|
1507 | - llvmpipe/linear: move to nir inputs
|
---|
1508 | - draw/mesh: reset some user state values on mesh draws.
|
---|
1509 | - llvmpipe/fs: fix regression in sample mask handling from tgsi removal.
|
---|
1510 | - llvmpipe: reset viewport_index_slot in fb bind
|
---|
1511 | - llvmpipe/cs: migrate to generic jit texture from pipe code.
|
---|
1512 | - llvmpipe/cs: migrate cs image handle to common jit code.
|
---|
1513 | - lavapipe: fix some whitespace in advance of other changes.
|
---|
1514 | - lavapipe: fix subresource layers asserts
|
---|
1515 | - lavapipe: support host image copying on compressed texture formats
|
---|
1516 | - llvmpipe: don't create texture functions for planar textures.
|
---|
1517 | - lavapipe: don't emit blit src/dst for subsampled formats.
|
---|
1518 | - llvmpipe: don't support planar formats for buffers.
|
---|
1519 | - lavapipe: convert sampler to use vk base class.
|
---|
1520 | - lavapipe: cleanup copy code to use a local region variable.
|
---|
1521 | - lavapipe: start introducing planes structure.
|
---|
1522 | - lavapipe: allocate image and image view planes.
|
---|
1523 | - lavapipe: handle planes in copies
|
---|
1524 | - lavapipe: handle planes in get image sub resource
|
---|
1525 | - lavapipe: add descriptor sets bindings for planar images
|
---|
1526 | - lavapipe: handle planes in texture lowering.
|
---|
1527 | - lavapipe: expose planar ycbcr formats and new ycbcr features
|
---|
1528 | - lavapipe + docs: update ycbcr extension enables.
|
---|
1529 | - intel-clc: avoid using spirv-linker.
|
---|
1530 |
|
---|
1531 | David Heidelberg (82):
|
---|
1532 |
|
---|
1533 | - ci/freedreno: update a530 flakes
|
---|
1534 | - ci: build kernel in gfx-ci/linux and just use binaries in Mesa3D CI
|
---|
1535 | - ci: update kernel to 6.3.13
|
---|
1536 | - ci/freedreno: add fails introduced by upreving to 6.3.13
|
---|
1537 | - Revert "lima/ci: temporarily disable deqp-egl tests due to timeouts"
|
---|
1538 | - ci/radeonsi: stoney arb_timer_query got fixed between kernel 6.3.1..13
|
---|
1539 | - ci/lima: EGL testing was disabled when fp16 fail was removed
|
---|
1540 | - ci/freedreno: fix unexpectedpass flake on a630
|
---|
1541 | - ci/freedreno: add another a530 flakes
|
---|
1542 | - ci: add quirk for GitLab assuming changes is always true for scheduled runs
|
---|
1543 | - ci/microsoft: when re-enabling Windows Farm, always run the container
|
---|
1544 | - ci/freedreno: add a530 flakes, remove one fail which recently started passing
|
---|
1545 | - ci/panfrost: introduce OpenGL testing with Mali-G57 MP5 on Asurada chromebook
|
---|
1546 | - ci/freedreno: cover all texture gather flakes
|
---|
1547 | - ci/freedreno: add a530 flake vs-lessthanequal-uvec4-uvec4
|
---|
1548 | - ci/farms: always compare the code against main repository
|
---|
1549 | - Revert "ci/farms: always compare the code against main repository"
|
---|
1550 | - ci/kernel: add amd patch to prevent crashes when starting X
|
---|
1551 | - ci/kdl: remove extra-verbose ls command
|
---|
1552 | - ci/nouveau: add 20 minutes timeout to gk20a and align gm20b
|
---|
1553 | - ci/freedreno: document another mapbuffer flake on a530
|
---|
1554 | - ci/amd: fix timeouting radeonsi-raven-va-full job
|
---|
1555 | - docs/ci: default to port 80 for the caching proxy
|
---|
1556 | - docs/ci: update to systemd and used version of the trace for testing
|
---|
1557 | - docs/ci: remove default nginx config, which we don't need for proxy
|
---|
1558 | - bin/ci: handle errors more gracefully in update_traces_checksum script
|
---|
1559 | - ci/freedreno: document another flakes on Adreno 530
|
---|
1560 | - ci: add perfetto into mesa git-cache
|
---|
1561 | - ci/panfrost: re-enable t760 and t860 traces as a nightly job
|
---|
1562 | - CI: Re-enable G52 Vulkan testing
|
---|
1563 | - ci/panfrost: t760-gles is nightly job, test also GLES 3 and 3.1
|
---|
1564 | - ci/zink: Add flake seen in the wild
|
---|
1565 | - ci/build: limit debian-build-testing to 30 minutes
|
---|
1566 | - ci/amd: add glx\@glx-visuals-depth flake to raven
|
---|
1567 | - ci/freedreno: document vs-nested-return-sibling-loop2 flake on Adreno 530
|
---|
1568 | - ci/farms: enabled Microsoft job only when conditions are met
|
---|
1569 | - ci/deqp: really remove the uncompressed results.csv file
|
---|
1570 | - ci/baremetal: do not install curl, it's already there
|
---|
1571 | - ci/baremetal: shorten BM_KERNEL to filename and BM_DTB to name only
|
---|
1572 | - ci/freedreno: document another a530 flake batch
|
---|
1573 | - ci: remove LAVA prefix from variables which can be used also elsewhere
|
---|
1574 | - ci/zink: drop a630, which we currently have very low amount available
|
---|
1575 | - ci/freedreno: the tag belongs to the apq8016 only
|
---|
1576 | - ci/freedreno: switch references, the farm-rules takes care about this
|
---|
1577 | - ci/freedreno: handle disabling farm properly for each FD/Collabora farm
|
---|
1578 | - ci/freedreno: another batch of Adreno 530 flakes
|
---|
1579 | - gtest: backport ansi color fix
|
---|
1580 | - ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace
|
---|
1581 | - panfrost/ci: revert Disable T720
|
---|
1582 | - ci/piglit: add extra space on top to prevent single quote getting into URL
|
---|
1583 | - ci/freedreno: There is only one King of Town.
|
---|
1584 | - ci: switch to 6.4 kernel, improving Adreno 660 reliability
|
---|
1585 | - ci/iris: add GL46.arrays_of_arrays_gl.SizedDeclarationsPrimitive timeout
|
---|
1586 | - ci/panfrost: add G52 flakes
|
---|
1587 | - ci/panfrost: we have enough device, parallelize Vulkan tests
|
---|
1588 | - ci/virgl: flakes in functional.draw_buffers_indexed group
|
---|
1589 | - ci/freedreno: add another a530 flake
|
---|
1590 | - ci/panfrost: add G52 simple_tests.partial_image_pot_same_format_noclear flake
|
---|
1591 | - panvk: architecture isn't invalid, just unsupported
|
---|
1592 | - panvk: catch unsupported arch in the panvk_physical_device_init
|
---|
1593 | - Revert "ci: disable a660 jobs"
|
---|
1594 | - docs: add LAVA farm informations
|
---|
1595 | - ci: disable Google Freedreno farm, currently timeouting on all jobs
|
---|
1596 | - Revert "ci: disable Google Freedreno farm, currently timeouting on all jobs"
|
---|
1597 | - ci/farms: no need to check RUNNER_TAG for Collabora farm
|
---|
1598 | - ci/traces: extend no-output timeout by 5 minutes
|
---|
1599 | - ci/venus: add fragment.32B_in_memory_with_vec4_s32 flake
|
---|
1600 | - iris: do not mention specifically clover for OpenCL support
|
---|
1601 | - ci/freedreno: disable broke cheza (Adreno 630) runners
|
---|
1602 | - ci/bare-metal: correct workaround for R8152 issue while retrieving TFTP data
|
---|
1603 | - ci/bare-metal: drop unused imports, sort, use SPDX license
|
---|
1604 | - ci/lima: farm is down, disable for now
|
---|
1605 | - ci: do not report failed job when flakes reporting fails
|
---|
1606 | - ci/freedreno: re-enable Cheza (Adreno 630) runners
|
---|
1607 | - ci/traces: upload only missing trace images
|
---|
1608 | - ci/traces: keep images for every job except the performance testing
|
---|
1609 | - ci/traces: rename upload function to reflect it works with S3
|
---|
1610 | - ci/traces: always export piglit EXTRA_ARGS
|
---|
1611 | - ci: ci_marge_queue.py
|
---|
1612 | - ci/freedreno: fix copy paste causing a618_gl being run only in manual pipeline
|
---|
1613 | - ci/freedreno: disable Adreno 660 Vulkan pre-merge
|
---|
1614 | - ci/traces: drop the freedoom-phase2-gl-high.trace
|
---|
1615 |
|
---|
1616 | David Rosca (70):
|
---|
1617 |
|
---|
1618 | - radeonsi: Use DIV_ROUND_UP instead of ALIGN_POT
|
---|
1619 | - frontends/va: Skip processing buffers already converted with EFC
|
---|
1620 | - frontends/va: Don't use EFC with scaling or filtering enabled
|
---|
1621 | - radeonsi/vcn: Don't use chroma in AV1 encode with RGB input
|
---|
1622 | - frontends/va: Parse H264 SPS for video signal parameters
|
---|
1623 | - frontends/va: Parse HEVC SPS for video signal parameters
|
---|
1624 | - frontends/va: Add postproc support for converting to full range
|
---|
1625 | - radeonsi/vcn: Set H264 video signal parameters in bitstream
|
---|
1626 | - radeonsi/vcn: Set HEVC video signal parameters in bitstream
|
---|
1627 | - radeonsi/vcn: Enable full/limited range support for H264/HEVC/AV1
|
---|
1628 | - radeonsi/vcn: Fix setting color range in AV1 bitstream
|
---|
1629 | - gallium/auxiliary/vl: Fix RGB->YCbCr full range matrix
|
---|
1630 | - gallium/auxiliary/vl: Handle UV subsampling in compute_shader_yuv
|
---|
1631 | - gallium/auxiliary/vl: Fix blurry output of compute_shader_yuv
|
---|
1632 | - frontends/va: Add YUV420 to NV12 postproc conversion
|
---|
1633 | - gallium/auxiliary/vl: Fix chroma and blurry output of cs video_buffer
|
---|
1634 | - gallium/auxiliary/vl: Fix chroma offset of compute_shader_weave
|
---|
1635 | - frontends/va: Also map VAImageBufferType for reading
|
---|
1636 | - frontends/va: Alloc interlaced surface for interlaced pics
|
---|
1637 | - frontends/vdpau: Alloc interlaced surface for interlaced pics
|
---|
1638 | - radeonsi: Don't prefer interlaced for video decode
|
---|
1639 | - ci/amd: Skip VAAPI CreateSurfacesWithConfigAttribs/1121 test
|
---|
1640 | - frontends/va: Don't allow multi-plane derive without driver support
|
---|
1641 | - frontends/va: Init view_resources array in vlVaPut/GetImage
|
---|
1642 | - radeonsi: Copy all planes with multi-plane staging textures
|
---|
1643 | - radeonsi: Enable PIPE_VIDEO_CAP_SUPPORTS_CONTIGUOUS_PLANES_MAP
|
---|
1644 | - ci/amd: Skip all VAAPI tests that creates too many huge surfaces
|
---|
1645 | - radeonsi/vcn: Update rate control when framerate changes with HEVC
|
---|
1646 | - frontends/va: Ignore requested size when creating VAEncCodedBufferType
|
---|
1647 | - gallium/auxiliary/vl: Set correct csc matrix in set_buffer_layer
|
---|
1648 | - radeonsi/vcn: Fix leaking fences in decode
|
---|
1649 | - gallium/auxiliary/vl: Add BT.709 full csc matrix
|
---|
1650 | - frontends/va: Set csc matrix in postproc
|
---|
1651 | - gallium/auxiliary/vl: Don't set csc matrix in video_buffer/rgb_to_yuv_layer
|
---|
1652 | - frontends/va: Add BT.709 as supported postproc color standard
|
---|
1653 | - Revert "radeonsi/vcn: add an exception of field case for h264 decoding"
|
---|
1654 | - gallium/auxiliary/vl: Set vertex element src_stride in vl_deint_filter
|
---|
1655 | - gallium/auxiliary: Fix util_compute_blit half texel offset with scaling
|
---|
1656 | - gallium/auxiliary/vl: Map range when updating constants
|
---|
1657 | - gallium/auxiliary/vl: Clamp coordinates in compute shaders
|
---|
1658 | - gallium/auxiliary/vl: Support chroma sample location in compute shaders
|
---|
1659 | - frontends/va: Support chroma sample location in postproc
|
---|
1660 | - frontends/va: Flush after unmapping VAImageBufferType
|
---|
1661 | - frontends/va: Parse chroma sample location in H264/HEVC SPS
|
---|
1662 | - radeonsi/vcn: Set H264/HEVC chroma sample location in bitstream
|
---|
1663 | - radeonsi/vcn: Don't hang GPU when using DCC surface as encoder input
|
---|
1664 | - frontends/va: Track surfaces in context
|
---|
1665 | - frontends/va: Destroy fences when destroying surface or context
|
---|
1666 | - radeonsi/vcn: Implement destroy_fence vfunc
|
---|
1667 | - frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture
|
---|
1668 | - frontends/va: Set default rate control values once when creating encoder
|
---|
1669 | - gallium/auxiliary/vl: Add RGB to YUV compute shader
|
---|
1670 | - gallium/auxiliary/vl: Use chroma offset in YUV to RGB weave compute shader
|
---|
1671 | - gallium/auxiliary/vl: Fix YUV to RGB bob compute shader deinterlacing
|
---|
1672 | - gallium/auxiliary/vl: Only map the shader constants buffer in render
|
---|
1673 | - frontends/va: Add High Quality preset mode
|
---|
1674 | - radeonsi/vcn: Add High Quality encoding preset for AV1
|
---|
1675 | - radeonsi: Fix plane size in si_copy_multi_plane_texture
|
---|
1676 | - frontends/va: Implement vaMapBuffer2
|
---|
1677 | - frontends/va: Fix locking in vlVaBeginPicture
|
---|
1678 | - frontends/va: Parse H264 SPS for max_num_reorder_frames
|
---|
1679 | - util/vl: Fix vl_rbsp parser with bitstreams without emulation bytes
|
---|
1680 | - frontends/va: Fix parsing packed headers without emulation bytes
|
---|
1681 | - radeonsi/vcn: Add encode support for H264 B-frames
|
---|
1682 | - frontends/va: Map decoder and postproc surfaces for reading
|
---|
1683 | - radeonsi: Fix offset for linear surfaces on GFX < 9
|
---|
1684 | - gallium/auxiliary/vl: Fix coordinates clamp in compute shaders
|
---|
1685 | - gallium/auxiliary: Fix coordinates clamp in util_compute_blit
|
---|
1686 | - gallium/auxiliary/vl: Scale dst_rect x0/y0 when rendering chroma plane
|
---|
1687 | - util/rbsp: Fill bits twice if reading more than 16 bits
|
---|
1688 |
|
---|
1689 | Derek Foreman (2):
|
---|
1690 |
|
---|
1691 | - vulkan/wsi: Allow binding presentation_timing when software rendering
|
---|
1692 | - vulkan/wsi: warn about unset present_mode in PresentModeCompatibilityExt
|
---|
1693 |
|
---|
1694 | Dmitry Baryshkov (3):
|
---|
1695 |
|
---|
1696 | - gallium: move kmsro definition to the bottom of the file
|
---|
1697 | - gallium: unbreak kmsro/freedreno case
|
---|
1698 | - tu: Pass real size of prime buffers to allocator
|
---|
1699 |
|
---|
1700 | Dmitry Osipenko (3):
|
---|
1701 |
|
---|
1702 | - util/cache_test: Re-add test for disabled cache
|
---|
1703 | - util/cache_test: Fix disabled cache test using SHADER_CACHE_DISABLE_BY_DEFAULT
|
---|
1704 | - util/cache_test: Add test for get/put() with disabled cache
|
---|
1705 |
|
---|
1706 | Dor Askayo (1):
|
---|
1707 |
|
---|
1708 | - nouveau: add exported GEM handles to the global list
|
---|
1709 |
|
---|
1710 | Dr. David Alan Gilbert (6):
|
---|
1711 |
|
---|
1712 | - rusticl/core: Add profiling time storage (queued) to event
|
---|
1713 | - rusticl: Wire the 'queued' profiling time up
|
---|
1714 | - rusticl: Wire the 'submit' profiling time up
|
---|
1715 | - rusticl: Wrap pipe queries
|
---|
1716 | - rusticl: Wrap pipe query reads
|
---|
1717 | - rusticl: Wire the 'start' and 'end' profilng times up
|
---|
1718 |
|
---|
1719 | Dylan Baker (4):
|
---|
1720 |
|
---|
1721 | - VERSION: bump to 23.3.0-devel
|
---|
1722 | - docs: Update release calendar for 23.2.0-rc1
|
---|
1723 | - docs: truncate feature list for 23.3-devel
|
---|
1724 | - meson: use a single dependency call for lua
|
---|
1725 |
|
---|
1726 | Echo J (5):
|
---|
1727 |
|
---|
1728 | - nvk: Fix some cast defines
|
---|
1729 | - nvk: Add A8B8G8R8_*_PACK32 format support
|
---|
1730 | - nvk: Add bufferImageGranularity limit
|
---|
1731 | - nvk: Reset offset value in ResetDescriptorPool
|
---|
1732 | - nil: Add A4B4G4R4_UNORM format support
|
---|
1733 |
|
---|
1734 | Emma Anholt (111):
|
---|
1735 |
|
---|
1736 | - ci/radv: Clarify when the ANGLE GS failures started happening.
|
---|
1737 | - ci: Uprev ANGLE to 0518a3ff4d4e ("Android: Simplify power metrics collection")
|
---|
1738 | - ci/tgl: Improve the info for ANGLE's MSAA regression on TGL.
|
---|
1739 | - ci/tu: Add more crash cases for the multithreading bugs caught on a630.
|
---|
1740 | - ci/tu: Mark descriptor_buffer.basic.limits as failing in gmem too.
|
---|
1741 | - ci/tu: Drop some xfails for !24086
|
---|
1742 | - tu: Fix data race in userspace VMA management.
|
---|
1743 | - ci/a5xx: Add another GPU hanging piglit test to the skips.
|
---|
1744 | - Revert "ci: Disable nouveau CI"
|
---|
1745 | - nvk: Avoid strict aliasing warning in the pushbuffer encoding.
|
---|
1746 | - nvk: Fix uninitialized result usage in NVK_DEBUG_ZERO_MEMORY.
|
---|
1747 | - nvk: Fix unused result warnings in pushbuf resets.
|
---|
1748 | - nvk: Remove duplicate (disabled) point sprite setup.
|
---|
1749 | - nvk: Fix missing init of the stages to sync against.
|
---|
1750 | - nvk: Use depth_clamp_enable to select PIXEL_*_Z_CLAMP.
|
---|
1751 | - nouveau/winsys: Fix an undefined use in the error path.
|
---|
1752 | - nvk: Quiet a compiler warning.
|
---|
1753 | - nvk: Clean up redundant vendor checking for physical device creation.
|
---|
1754 | - nvk: Add support for probing as a platform device.
|
---|
1755 | - nvk: Disable shaderStorageImageReadWithoutFormat pre-Maxwell.
|
---|
1756 | - freedreno/a5xx: Fix border color structure size.
|
---|
1757 | - freedreno/a5xx: Skip emitting unused texture descriptors for images.
|
---|
1758 | - freedreno/ir3: Move pvtmem per-fiber size alignment to the compiler.
|
---|
1759 | - ci/freedreno: Drop a bunch of stale a530 xfails.
|
---|
1760 | - ci/freedreno: Sort another a530 xfail with its friends.
|
---|
1761 | - ci/freedreno: Update comments for some a530 xfails.
|
---|
1762 | - ci/freedreno: Add some more db820c xfails.
|
---|
1763 | - freedreno/devices: Move fibers_per_sp to the common info struct.
|
---|
1764 | - freedreno/devices: Set num_sp_cores explicitly for pre-gen6.
|
---|
1765 | - freedreno/a6xx: Move pvtmem allocation to ir3_gallium.
|
---|
1766 | - freedreno/a3xx: Add the shift for MEMSIZEPERITEM according to db410c docs.
|
---|
1767 | - freedreno/a5xx: Refactor SHADER_OBJ emit to a helper function.
|
---|
1768 | - freedreno/a5xx: Set num_sp_cores and set PC/VFD_POWER_CNTL accordingly.
|
---|
1769 | - freedreno/a5xx: Add private mem support.
|
---|
1770 | - freedreno/cffdec: Fix decode on pixel 2 blob's COMPUTE_CHECKPOINT
|
---|
1771 | - ci/freedreno: Add a regression test for decoding a540 blob's compute shaders.
|
---|
1772 | - freedreno: Fix crashdec pre-a6xx.
|
---|
1773 | - freedreno/a5xx: Skip SSBO emit when none are enabled.
|
---|
1774 | - vulkan/util: Make multialloc succeed with 0 allocations.
|
---|
1775 | - turnip: Track the first/last subpass an attachment is used in.
|
---|
1776 | - turnip: Skip emitting empty CP_COND_REG_EXEC.
|
---|
1777 | - turnip: Save the renderpass's clear values in the cmdbuf state.
|
---|
1778 | - turnip: Move gmem clears and loads to the first subpass that uses them.
|
---|
1779 | - turnip: Move sysmem clears to the first subpass that uses them.
|
---|
1780 | - ci/freedreno: Skip some tests on a5xx that destabilize other tests.
|
---|
1781 | - freedreno/a3-5xx: Don't try to emit ISAM for SSBO loads.
|
---|
1782 | - ci/turnip: Add a660 VK coverage.
|
---|
1783 | - disk_cache: Disable the "List" test for RO disk cache.
|
---|
1784 | - blorp: Disable unaligned partial HIZ fast clears for HIZ_CCS too.
|
---|
1785 | - intel/fs: Move defin/defout setup to the start of the loop.
|
---|
1786 | - intel/fs: Move the defin[]/defout[] screening up to livein[]/liveout[] setup.
|
---|
1787 | - intel/fs: Simplify compute_start_end().
|
---|
1788 | - ci/freedreno: Add another excessive-constlen UBO skip.
|
---|
1789 | - ci/anv: Drop DEQP_VER:vk setting.
|
---|
1790 | - ci/anv: Drop "-vk" from the job name.
|
---|
1791 | - ci/anv: Add a manual full VK run for TGL.
|
---|
1792 | - ci/anv: Add testing on JSL.
|
---|
1793 | - freedreno: Build drm subdir before perfcntrs, which uses it.
|
---|
1794 | - ci/intel: Add various updates from our nightly runs.
|
---|
1795 | - ci/virgl: Disable virgl-iris-traces.
|
---|
1796 | - ci/zink: Add a few updates for anv/tgl from the nightly runs.
|
---|
1797 | - ci/fastboot: Use a case insensitive match for a fastboot line.
|
---|
1798 | - ci/etnaviv: Skip some tests that hang the GPU and knock out other tests.
|
---|
1799 | - ci/etnaviv: Drop some gc2k flakes that I think are resolved.
|
---|
1800 | - ci/anv: Drop incorrect xfail addition for TGL
|
---|
1801 | - ci/anv: Drop the 16bit.scalar.13 skip.
|
---|
1802 | - ci/etnaviv: Minor xfail/flake polishing.
|
---|
1803 | - ci/etnaviv: Skip a GLES2 test that times out the asan job.
|
---|
1804 | - ci/zink: Skip more doubles tests on anv that flake at 3 minute timeouts.
|
---|
1805 | - ci/docker: Clear the results file before starting a new deqp test run.
|
---|
1806 | - ci/crocus: Add a related flake to a known one.
|
---|
1807 | - ci/etnaviv: return gl-1.4-tex1d-2dborder as a known flake
|
---|
1808 | - ci/crocus: Add known piglit flakes
|
---|
1809 | - ci/hasvk: Add a bunch of new CTS border color fails.
|
---|
1810 | - i915: Re-clang-format and enforce it in CI.
|
---|
1811 | - i915: Print the relevant counts vs limits when throwing errors.
|
---|
1812 | - i915: Don't log I915_DEBUG=fs output for blit shaders.
|
---|
1813 | - i915: Save fragment program compile error messages in the fragment shader.
|
---|
1814 | - i915: Do a test compile at glLinkShader() time.
|
---|
1815 | - i915: Make exceeding tex indirect count fatal.
|
---|
1816 | - i915: Use nir_group_loads() to reduce texture indirection phases.
|
---|
1817 | - ci/crocus: Generalize the drawarrays-vertex-count flakes.
|
---|
1818 | - ci/zink: Skip 3-minute-long glx-visuals timeouts.
|
---|
1819 | - ci/zink: Skip dmat[34] op tests in general, as well
|
---|
1820 | - ci/crocus: Disable flaky unvanquished-ultra trace
|
---|
1821 | - nir/print: Decode system values in the variable declarations.
|
---|
1822 | - ci/zink: Add a TGL flake that's showed up in nightlies recently.
|
---|
1823 | - ci/radeonsi: Drop an xfail for vangogh.
|
---|
1824 | - i915: Make I915_DEBUG=fs log shaders that fail to link due to CF.
|
---|
1825 | - nir: Flatten ifs with discards in nir_opt_peephole_select for HW without CF.
|
---|
1826 | - glsl: Remove lower_discard().
|
---|
1827 | - ci/zink: Only test half of piglit pre-merge on anv.
|
---|
1828 | - ci: Stop doing internal retries in bare-metal.
|
---|
1829 | - ci/bare-metal: Drop the 2 vs 1 exit code from poe_run.
|
---|
1830 | - ci/bare-metal: Default our boards to a 20-minute timeout for the whole job.
|
---|
1831 | - ci/iris: Drop parallel on kbl piglit to 2.
|
---|
1832 | - ci/freedreno: Fold a630_egl into a630_gl.
|
---|
1833 | - ci/freedreno: Move skqp testing to a618.
|
---|
1834 | - ci/zink: Cut zink-lvp coverage in half.
|
---|
1835 | - ci/freedreno: Generalize the implicit_unmap timeouts.
|
---|
1836 | - ci_run_n_monitor: Poll mesa/mesa and user/mesa for pipelines at the same time.
|
---|
1837 | - glx: Delete support for GLX_OML_swap_method.
|
---|
1838 | - ci: drop skip for glx-swap-copy.
|
---|
1839 | - dri: Drop a duplicate mesa vs pipe format table.
|
---|
1840 | - docs/ci: Drop old instructions for farm disabling
|
---|
1841 | - docs/ci: Add some links in the CI docs to how to track job flakes
|
---|
1842 | - glsl: Remove int64 div/mod lowering.
|
---|
1843 | - llvmpipe: Set nir_lower_dround_even.
|
---|
1844 | - nir: Add nir_lower_dsign as 64-bit fsign lowering.
|
---|
1845 | - glsl: Retire dround lowering.
|
---|
1846 | - ci_run_n_monitor: Always resolve --rev arguments for looking up pipelines.
|
---|
1847 |
|
---|
1848 | Eric Engestrom (194):
|
---|
1849 |
|
---|
1850 | - ci: avoid running hardware jobs if lint fails - now on LAVA too!
|
---|
1851 | - ci: avoid running hardware jobs if lint fails - now on Windows too!
|
---|
1852 | - ci: replace copy of nouveau rules with reference
|
---|
1853 | - ci: drop leftover kernel configs
|
---|
1854 | - ci: use !reference for scheduled_pipeline retry rule
|
---|
1855 | - ci: add .llvmpipe-manual-rules and use it
|
---|
1856 | - ci: add .gallium-core-rules and use it instead of gallium_core_file_list anchor
|
---|
1857 | - ci: replace llvmpipe_file_list anchor with reference
|
---|
1858 | - ci: replace softpipe_file_list anchor with reference
|
---|
1859 | - ci: replace lavapipe_file_list anchor with reference
|
---|
1860 | - ci: replace iris_file_list anchor with reference
|
---|
1861 | - ci: replace radv_file_list anchor with reference
|
---|
1862 | - ci: replace radeonsi_file_list anchor with reference
|
---|
1863 | - ci: replace virgl_file_list anchor with reference
|
---|
1864 | - ci: move etnaviv files rules to src/etnaviv/ci/gitlab-ci.yml
|
---|
1865 | - ci: move freedreno files rules to src/freedreno/ci/gitlab-ci.yml
|
---|
1866 | - ci: move nouveau files rules to src/gallium/drivers/nouveau/ci/gitlab-ci.yml
|
---|
1867 | - ci: move panfrost files rules to src/panfrost/ci/gitlab-ci.yml
|
---|
1868 | - ci: move broadcom files rules to src/broadcom/ci/gitlab-ci.yml
|
---|
1869 | - ci: move lima files rules to src/gallium/drivers/lima/ci/gitlab-ci.yml
|
---|
1870 | - ci: move amd files rules to src/amd/ci/gitlab-ci.yml
|
---|
1871 | - ci: move microsoft files rules to src/microsoft/ci/gitlab-ci.yml
|
---|
1872 | - ci: move zink files rules to src/gallium/drivers/zink/ci/gitlab-ci.yml
|
---|
1873 | - ci: move virtio files rules to src/virtio/ci/gitlab-ci.yml
|
---|
1874 | - ci: move intel files rules to src/intel/ci/gitlab-ci.yml
|
---|
1875 | - ci: move virgl files rules to src/gallium/drivers/virgl/ci/gitlab-ci.yml
|
---|
1876 | - ci: move llvmpipe files rules to src/gallium/drivers/llvmpipe/ci/gitlab-ci.yml
|
---|
1877 | - ci: move softpipe files rules to src/gallium/drivers/softpipe/ci/gitlab-ci.yml
|
---|
1878 | - ci: move lavapipe files rules to src/gallium/drivers/lavapipe/ci/gitlab-ci.yml
|
---|
1879 | - ci: delete install.tar after extracting it to avoid re-uploading it
|
---|
1880 | - docs: add release notes for 23.1.4
|
---|
1881 | - docs: add sha256sum for 23.1.4
|
---|
1882 | - docs: update calendar for 23.1.4
|
---|
1883 | - asahi: drop unused include paths
|
---|
1884 | - ci/lint: deduplicate formatting check jobs
|
---|
1885 | - ci/lint: also print a diff for rust format issues
|
---|
1886 | - ci: allow hw jobs even if lint jobs fail for non-Marge pipelines
|
---|
1887 | - ci: print rustfmt's version
|
---|
1888 | - ci: print clang-format's version
|
---|
1889 | - bin/ci_run_n_monitor: get git sha from pipeline if specified, instead of requiring --rev to match
|
---|
1890 | - lavapipe/ci: use tighter changes: rules
|
---|
1891 | - ci: add a 10min job timeout to formatting checks
|
---|
1892 | - ci: reduce bare-metal retries of poe_run to only 3 attempts
|
---|
1893 | - broadcom/ci: reduce vc4-rpi3-gl timeout to 30min (instead of 1h)
|
---|
1894 | - broadcom/ci: reduce v3d-rpi4-gl timeout to 30min (instead of 1h)
|
---|
1895 | - broadcom/ci: reduce v3d-rpi4-traces timeout to 30min (instead of 1h)
|
---|
1896 | - broadcom/ci: reduce v3dv-rpi4-vk timeout to 30min (instead of 1h)
|
---|
1897 | - ci: add .core-rules to .gallium-core-rules
|
---|
1898 | - ci: drop rule for non-existent src/include/
|
---|
1899 | - docs: add release notes for 23.1.5
|
---|
1900 | - docs: add sha256sum for 23.1.5
|
---|
1901 | - docs: update calendar for 23.1.5
|
---|
1902 | - ci: include some timing information in the git cache download script
|
---|
1903 | - docs/ci: stop trying to enumerate drivers that are tested using VK-GL-CTS
|
---|
1904 | - docs/ci: in paragraph about the CI being overwhelmed, mention our tool to help with that
|
---|
1905 | - docs/ci: drop mention of build systems variants in the CI
|
---|
1906 | - docs/ci: expand the description of test suites
|
---|
1907 | - bin: add wrapper to run scripts in a python venv
|
---|
1908 | - bin/ci/ci_run_n_monitor: use venv wrapper
|
---|
1909 | - bin/ci/gitlab_gql: use venv wrapper
|
---|
1910 | - bin/ci/update_traces_checksum: use venv wrapper
|
---|
1911 | - bin/pick-ui: use venv wrapper
|
---|
1912 | - ci: include mold in x86_64_test-base & rootfs images
|
---|
1913 | - ci: use mold to build deqp
|
---|
1914 | - zink/ci: set the default timeout for zink jobs to 30min instead of 1h
|
---|
1915 | - egl: make _eglFilterConfigArray static
|
---|
1916 | - egl: fixup _eglFilterConfigArray() params and drop _eglFallbackMatch() wrapper
|
---|
1917 | - ci: build nvk
|
---|
1918 | - ci: document max image tag length
|
---|
1919 | - docs/radv: mark VK_EXT_tooling_info as implemented
|
---|
1920 | - docs/radv: mark VK_INTEL_shader_integer_functions2 as implemented
|
---|
1921 | - git-blame-ignore-revs: repeat instruction on how to enable to avoid having to look for it
|
---|
1922 | - git-blame-ignore-revs: add radv formatting commit
|
---|
1923 | - git-blame-ignore-revs: add pvr formatting commit
|
---|
1924 | - meson: fix indentation
|
---|
1925 | - docs/v3dv: mark direct display extensions as implemented
|
---|
1926 | - ci: reorder vk drivers alphabetically in debian-vulkan job
|
---|
1927 | - ci: build hasvk in debian-vulkan job
|
---|
1928 | - ci/zink+radv: set a timeout of 2x the normal runtime
|
---|
1929 | - amd/ci: drop duplicate test expectations
|
---|
1930 | - panfrost: upcast uint8/uint16 before shifting them beyond their range
|
---|
1931 | - ci/a530: document piglit flake
|
---|
1932 | - docs: add release notes for 23.1.6
|
---|
1933 | - docs: add sha256sum for 23.1.6
|
---|
1934 | - docs: update calendar for 23.1.6
|
---|
1935 | - docs: add one more 23.1.x release
|
---|
1936 | - ci: rename \*.log to \*.txt to work around gitlab bug
|
---|
1937 | - ci/freedreno: reuse freedreno_gl_file_list instead of re-definining it
|
---|
1938 | - egl: bump extension string length
|
---|
1939 | - vc4: drop duplicate .lower_ldexp
|
---|
1940 | - zink: fix format in zink_make_{image,texture}_handle_resident()
|
---|
1941 | - v3dv: fix VK_PIPELINE_ROBUSTNESS_{BUFFER,IMAGE}_BEHAVIOR_DEVICE_DEFAULT_EXT copy/paste typo
|
---|
1942 | - v3dv: fix copy/pasted type of \`sample`
|
---|
1943 | - v3dv: fix shader stage name in error message
|
---|
1944 | - v3d/qpu: fix type of function argument
|
---|
1945 | - ci/deqp: backport fix for dEQP-EGL.functional.wide_color.*_888_colorspace_*
|
---|
1946 | - ci/farm-rules: fix missing valve-infra jobs in scheduled pipelines
|
---|
1947 | - bin/ci_run_n_monitor: error out if both --project and --pipeline-url are passed
|
---|
1948 | - ci: document farm rules
|
---|
1949 | - ci/b2c: skip install.tar extraction if the tarball is not present
|
---|
1950 | - ci/b2c: don't allow failures in test script preparation
|
---|
1951 | - ci/b2c: assert that install folder is present whether or not the tarball was extracted
|
---|
1952 | - ci/amd: split the polaris10 rules into one for each farm
|
---|
1953 | - ci: skip containers & build jobs when disabling a farm
|
---|
1954 | - docs: add release notes for 23.1.7
|
---|
1955 | - docs: add sha256sum for 23.1.7
|
---|
1956 | - docs: update calendar for 23.1.7
|
---|
1957 | - docs: add one more 23.1.x release
|
---|
1958 | - ci: taking igalia farm offline
|
---|
1959 | - ci/b2c: drop logic to remove install.tar
|
---|
1960 | - ci: drop clover leftover
|
---|
1961 | - Revert "ci: taking igalia farm offline"
|
---|
1962 | - bin/ci_run_n_monitor: print in which repo we're looking for the pipeline
|
---|
1963 | - bin/ci_run_n_monitor: automatically pick MR pipelines when they exist
|
---|
1964 | - ci: remove duplicate fork pipeline in MRs
|
---|
1965 | - ci_run_n_monitor: add comment to explain "MR > fork" logic
|
---|
1966 | - ci: don't run everything just because a farm gets re-enabled
|
---|
1967 | - ci/windows: centralize definition of windows runners tags
|
---|
1968 | - ci/windows: add windows docker runner tags to .windows-docker-vs2019
|
---|
1969 | - ci/windows: drop build rules from test jobs
|
---|
1970 | - ci: document which image tags need to be bumped when updating piglit
|
---|
1971 | - ci: document which image tags need to be bumped when updating {alpine,debian,fedora}/x86_64
|
---|
1972 | - ci/farm-rules: rename .disable-farm-mr-rules to make it clear it's only about MRs
|
---|
1973 | - ci/farm-rules: re-add "run every container and build job when a farm gets re-enabled"
|
---|
1974 | - ci/zink: drop redundant \`MESA_LOADER_DRIVER_OVERRIDE: zink`
|
---|
1975 | - docs: add release notes for 23.1.8
|
---|
1976 | - docs: add sha256sum for 23.1.8
|
---|
1977 | - docs: update calendar for 23.1.8
|
---|
1978 | - docs: add another 23.1.x
|
---|
1979 | - ci: limit build jobs to 30min so that they can retry when they go wrong
|
---|
1980 | - docs: drop outdated and redundant note about the minimum meson version
|
---|
1981 | - ci/zink+radv: specify that zink-radv-navi10-valve should run in the mupuf farm
|
---|
1982 | - ci/zink+radv: bump the timeout of zink-radv-navi10-valve by 10 minutes
|
---|
1983 | - docs: add calendar for 23.3
|
---|
1984 | - ci: unify container and build jobs rules
|
---|
1985 | - docs/meson: drop mention that our meson is ready
|
---|
1986 | - ci/docs: drop extra overwritten rules
|
---|
1987 | - ci/zink+radv: document flake
|
---|
1988 | - docs: document the merging process and what is allowed or not
|
---|
1989 | - ci: drop unused shader-db clone + build from alpine image
|
---|
1990 | - ci: drop unused shader-db clone + build from fedora image
|
---|
1991 | - ci: move shader-db clone/build into its own script
|
---|
1992 | - ci/deqp-runner: fix indentation
|
---|
1993 | - ci/deqp-runner: restore exit-on-error after getting deqp-runner's exit code
|
---|
1994 | - ci: fix shebang in build-deqp-runner.sh
|
---|
1995 | - docs: add release notes for 23.1.9
|
---|
1996 | - docs: add sha256sum for 23.1.9
|
---|
1997 | - docs: update calendar for 23.1.9
|
---|
1998 | - ci: drop unused ephemeral packages in alpine image
|
---|
1999 | - docs/ci: rewrite the "farm maintenance ^ other change" rule to mean what we actually meant
|
---|
2000 | - ci: skip dEQP-VK.api.driver_properties.conformance_version for everyone
|
---|
2001 | - pick-ui: use assignment expressions
|
---|
2002 | - pick-ui: use more expressive variable names
|
---|
2003 | - pick-ui: add \`Backport-to: XX.Y` nomination
|
---|
2004 | - v3d/ci: move traces job to wayland
|
---|
2005 | - ci: print deqp version in the job log
|
---|
2006 | - ci/b2c: move to the shiny new \`gfx-ci/ci-tron` repo
|
---|
2007 | - ci/b2c: use latest mesa-trigger image
|
---|
2008 | - include/dri_interface.h: restore define mistakenly removed in !25587
|
---|
2009 | - ci_run_n_monitor: dependency jobs must always be started
|
---|
2010 | - util/xmlconfig: drop driInjectDataDir() now that DRIRC_CONFIGDIR is always supported
|
---|
2011 | - util/xmlconfig: inline datadir
|
---|
2012 | - ci/b2c: change artifacts path to match baremetal and LAVA
|
---|
2013 | - VERSION: bump for rc1
|
---|
2014 | - .pick_status.json: Update to e64a97694ac9dc97f65e1a8e91a5c9789109fd2c
|
---|
2015 | - .pick_status.json: Update to 4cdd094ae1e97d857a6b9dbc291d7bbe6ea266ac
|
---|
2016 | - .pick_status.json: Update to e4a1bc70dd739ca8addddc940af08312b038e288
|
---|
2017 | - .pick_status.json: Update to faed5d647f2416bb0ce3a9d33a3955169c70dc52
|
---|
2018 | - VERSION: bump for 23.3.0-rc2
|
---|
2019 | - .pick_status.json: Update to 1f1ec1c6bcc2a32a3c1df8c2cc7a2f4e7139b7ec
|
---|
2020 | - .pick_status.json: Mark 8dda860f83ac30d042dc6beb4438cc925d1fd130 as denominated
|
---|
2021 | - .pick_status.json: Update to 7d6f9ccfbeab050c26775d5e03578a01526cbfcb
|
---|
2022 | - .pick_status.json: Update to aa33ca0a52591961f8ae01dc253354462ed17c18
|
---|
2023 | - .pick_status.json: Update to a77ea9555aa00cc12f3d1c440252e940ff552500
|
---|
2024 | - .pick_status.json: Mark 227300345ed38377190b0eaf08694d5c42ee7e60 as denominated
|
---|
2025 | - VERSION: bump for 23.3.0-rc3
|
---|
2026 | - .pick_status.json: Update to 56451ce773c11094a8c08fdc6b500bb8bdcf37e1
|
---|
2027 | - .pick_status.json: Mark fa7ec4226bdf48bf63438e303af83ecd58ec95f2 as denominated
|
---|
2028 | - .pick_status.json: Update to 08f851f4361cfbdb211dc70d03cf3ebff331c3ee
|
---|
2029 | - .pick_status.json: Update to 03a7cb261828b350dd9b56bd74850197ca9eba33
|
---|
2030 | - .pick_status.json: Mark fcfa68a632e5711cc657b103c9a0384928e9bf49 as denominated
|
---|
2031 | - VERSION: bump for 23.3.0-rc4
|
---|
2032 | - .pick_status.json: Update to f05688aa3299a27430119b27e45181a6f415bff8
|
---|
2033 | - egl/dri2: increase NUM_ATTRIBS to fit all the attributes
|
---|
2034 | - .pick_status.json: Update to f39ed0063b4cd3e5a71efad2d43ce31f574c698d
|
---|
2035 | - .pick_status.json: Update to b07a58157d0b110dbc09a42cffe7046c3200dd3b
|
---|
2036 | - VERSION: bump for 23.3.0-rc5
|
---|
2037 | - .pick_status.json: Update to f843b14c171299e1696ca6d971ccaa496f60c3ab
|
---|
2038 | - intel/perf: fix regex escaping
|
---|
2039 | - intel/ci: fix .hasvk-manual-rules
|
---|
2040 | - VERSION: bump for 23.3.0
|
---|
2041 | - Revert "VERSION: bump for 23.3.0"
|
---|
2042 | - docs: add release notes for 23.3.0
|
---|
2043 | - Revert "docs: add release notes for 23.3.0"
|
---|
2044 |
|
---|
2045 | Erico Nunes (10):
|
---|
2046 |
|
---|
2047 | - lima/ppir: don't optimize loads with different block successors
|
---|
2048 | - lima/ppir: convert to nir_legacy
|
---|
2049 | - lima/gpir: switch to register intrinsics
|
---|
2050 | - egl/drm: fix EGL_EXT_buffer_age with gbm contexts
|
---|
2051 | - lima: fix plbu block stride calculation
|
---|
2052 | - ci: disable lima LAVA lab for maintance
|
---|
2053 | - Revert "ci: disable lima LAVA lab for maintance"
|
---|
2054 | - v3dv: allow headless device without display device
|
---|
2055 | - Revert "ci/lima: farm is down, disable for now"
|
---|
2056 | - v3dv: Rework to remove drm authentication for wsi
|
---|
2057 |
|
---|
2058 | Erik Faye-Lund (30):
|
---|
2059 |
|
---|
2060 | - meson: report with_glvnd in summary
|
---|
2061 | - docs: upgrade bootstrap to 5.3.1
|
---|
2062 | - docs: expand mobile-menu without js
|
---|
2063 | - panfrost: delete stale editorconfig file
|
---|
2064 | - docs/panfrost: link to lima
|
---|
2065 | - docs/panfrost: use code-blocks with wrapping for long blocks
|
---|
2066 | - docs/panfrost: use math-role to denote powers of two
|
---|
2067 | - docs: fix linkcheck
|
---|
2068 | - docs: update a few links to https
|
---|
2069 | - docs: update anchor for link
|
---|
2070 | - docs: update link to git-wiki
|
---|
2071 | - docs: link to upstream etnaviv
|
---|
2072 | - docs: apply some trivial redirects
|
---|
2073 | - docs: use doc-role when linking to lists article
|
---|
2074 | - docs: keep up with intels ever-moving documentation
|
---|
2075 | - docs: mark some redirects as allowed
|
---|
2076 | - docs: only link to old docs from html
|
---|
2077 | - docs: use html_static_path for static files
|
---|
2078 | - ci/etnaviv: update ci expectation
|
---|
2079 | - ci/etnaviv: allow failure on failing test
|
---|
2080 | - zink: fix wording of warning
|
---|
2081 | - ci/etnaviv: move failure to flake
|
---|
2082 | - meson: add wayland-protocols from meson wrapdb
|
---|
2083 | - util/xmlconfig: add an env-var for overriding drirc search dir
|
---|
2084 | - meson: add src/util to the drirc search path
|
---|
2085 | - docs/relnotes: remove cruft from end of lines
|
---|
2086 | - docs/ci: escape at-symbols
|
---|
2087 | - docs/relnotes: escape some at-symbols
|
---|
2088 | - bin/gen_release_notes: escape at-symbols
|
---|
2089 | - panfrost: use perf_debug instead of open-coding
|
---|
2090 |
|
---|
2091 | Faith Ekstrand (809):
|
---|
2092 |
|
---|
2093 | - nv50/ir: Convert to new-style NIR registers
|
---|
2094 | - nv50/ir: Support vector movs
|
---|
2095 | - intel/fs: Add support for new-style registers
|
---|
2096 | - intel/vec4: Assume get_nir_dest() provides a sane write-mask
|
---|
2097 | - intel/vec4: Add support for new-style registers
|
---|
2098 | - intel: Switch to intrinsic-based registers
|
---|
2099 | - intel/fs: Drop support for nir_register
|
---|
2100 | - intel/vec4: Drop support for nir_register
|
---|
2101 | - anv,hasvk,iris: sampler_prog_key::swizzles is only used on crocus
|
---|
2102 | - nir: Properly handle divergence for load_reg
|
---|
2103 | - nir/trivialize: Maintain divergence information
|
---|
2104 | - nir/trivialize: Trivialize cross-block loads
|
---|
2105 | - vc4: Convert to new-style NIR registers
|
---|
2106 | - nir/schedule: Support load/store_reg
|
---|
2107 | - broadcom/compiler: Convert to new-style NIR registers
|
---|
2108 | - intel/fs: Use write masks from store_reg intrinsics
|
---|
2109 | - intel/fs: Rework the overlapping mov/vec case
|
---|
2110 | - intel/fs: Assume NIR is in SSA form
|
---|
2111 | - nir: Add a backend_flags field to nir_tex_instr
|
---|
2112 | - intel/fs: Add a parameter to speed up register spilling
|
---|
2113 | - nir/builder: Allow tex helpers on image types
|
---|
2114 | - nir/builder: Add a nir_txs_deref() helper
|
---|
2115 | - vulkan: Add a core vk_buffer_view struct
|
---|
2116 | - vulkan: Add a more direct way to use a NIR shader
|
---|
2117 | - vulkan: Add a vk_query_pool base object
|
---|
2118 | - vulkan: Add common vkCmdBegin/EndQuery wrappers
|
---|
2119 | - vulkan/format: Add the remaining 1-plane YCbCr formats
|
---|
2120 | - vulkan: Add a core vk_sampler struct
|
---|
2121 | - nv50/nir: Lower to scratch AFTER optimization
|
---|
2122 | - nouveau: Allow GLSL_SAMPLER_DIM_SUBPASS*
|
---|
2123 | - nouveau/nir: Implement support for compact arrays
|
---|
2124 | - nouveau/codegen: Handle/indirect goes before sample index
|
---|
2125 | - nouveau/codegen: Use a NULL format for PIPE_FORMAT_NONE for images
|
---|
2126 | - nouveau/codegen: Don't convertSurfaceFormat for unknown formats
|
---|
2127 | - nv50/ir: Run nir_divergence_analysis before out-of-SSA
|
---|
2128 | - anv: Use vk_sampler
|
---|
2129 | - anv: Use vk_buffer_view
|
---|
2130 | - vulkan: Add init/finish helpers for vk_query_pool
|
---|
2131 | - anv: Use vk_query_pool
|
---|
2132 | - anv: Use the common versions of vkBegin/EndQuery()
|
---|
2133 | - nir/builder: Don't assume we have compiler options
|
---|
2134 | - Revert "mesa, compiler: Move gl_texture_index to glsl_types.h"
|
---|
2135 | - Revert "compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h"
|
---|
2136 | - vulkan: Use VkBufferUsageFlags2 in vk_buffer
|
---|
2137 | - clang-format: Set ColumnLimit to 78
|
---|
2138 | - nvk: Implement EnumerateInstanceVersion
|
---|
2139 | - nvk: Add stub implementations of VkImage and VkImageView
|
---|
2140 | - nvk: Add stub implementation of VkSampler
|
---|
2141 | - nvk: Add a stub implementation of VkBuffer
|
---|
2142 | - nvk: Implement VkDescriptorSetLayout
|
---|
2143 | - nvk: Implement VkPipelineLayout
|
---|
2144 | - nvk: Add initial descriptor set lowering
|
---|
2145 | - nvk: Implement vkUpdateDescriptorSets
|
---|
2146 | - nvk: Expose nvk_descriptor_stride_align_for_type
|
---|
2147 | - nvk: Re-format descriptor set layouts
|
---|
2148 | - nvk: Re-format pipeline layouts
|
---|
2149 | - nvk: Re-format descriptor sets some more
|
---|
2150 | - nvk/buffer: Take an offset in nvk_buffer_address
|
---|
2151 | - nvk/buffer: Add a push_buffer_ref helper
|
---|
2152 | - nvk/copy: Use nvk_buffer_address in CmdCopyBuffer
|
---|
2153 | - nvk/image: Add image address helpers
|
---|
2154 | - nvk/copy: Use nvk_image_base_address()
|
---|
2155 | - nvk: Add an nvk_device_physical helper
|
---|
2156 | - nvk: Add a skeleton for pipelines
|
---|
2157 | - nvk: Re-arrange nvk_descriptor_set.h a bit
|
---|
2158 | - nvk: Reformat nvk_nir_lower_descriptors
|
---|
2159 | - nvk: Add a couple descriptor set address helpers
|
---|
2160 | - nvk: Move nvk_cmd_pool cast definitions
|
---|
2161 | - nvk: Rework whitespace in nvk_cmd_buffer.c
|
---|
2162 | - nvk: Add a root descriptor table
|
---|
2163 | - nvk: Fetch descriptor set addresses from the root table
|
---|
2164 | - nvk: Re-arrange nir_lower_explicit_io a bit
|
---|
2165 | - nvk: Lower load_global_constant_offset
|
---|
2166 | - nvk: Drop image_view_init
|
---|
2167 | - nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable
|
---|
2168 | - nvk: Allow R32_UINT
|
---|
2169 | - nvk: Mark nvk_push_descriptor_set_ref() inline
|
---|
2170 | - nvk: Add a descriptor table data structure
|
---|
2171 | - nvk: Copy in the nouveau TIC format table
|
---|
2172 | - nvk/image_view: Reformat and fix Create/DestroyImageView
|
---|
2173 | - nvk: Add an image descriptor table to the device
|
---|
2174 | - nvk: Fill out TIC table entries for image views
|
---|
2175 | - nvk: Set b->cursor when lowering image intrinsics
|
---|
2176 | - nvk: Unify descriptor loading in lower_descriptors
|
---|
2177 | - nvk: Re-format nvk_image_view.h a bit
|
---|
2178 | - nvk: Re-format nvk_buffer.c a bit
|
---|
2179 | - nvk: Add a stub implementation of buffer views
|
---|
2180 | - nvk: Make texture descriptors a bit more acceptable to codegen
|
---|
2181 | - nvk: GART os host-cache-coherent
|
---|
2182 | - nvk: Reserve a null image descriptor
|
---|
2183 | - nvk: Rework descriptor writes
|
---|
2184 | - nouveau: Add stubs for an image layout library called NIL
|
---|
2185 | - nil: Create images
|
---|
2186 | - nil: Add the TIC format table from nouveau
|
---|
2187 | - nil: Add a nil_view and code to fill out TIC entries
|
---|
2188 | - nvk: Add an nvk_get_format helper
|
---|
2189 | - nvk: Use helpers for push_ref
|
---|
2190 | - nvk: Align arguments consistently in copy/blit code
|
---|
2191 | - nvk: Move Fill/UpdateBuffer to nvk_cmd_copy
|
---|
2192 | - Revert "nvk: Stop returning VK_ERROR_FORMAT_NOT_SUPPORTED for non-blitable"
|
---|
2193 | - nvk: Manually offset for array layers in copy/blit
|
---|
2194 | - nvk: Convert to using NIL for image layout
|
---|
2195 | - nvk: Re-indent image entrypoints
|
---|
2196 | - nvk: Implement VkGetImageSubresourceLyout
|
---|
2197 | - nvk: Reset and properly clean up command buffer upload areas
|
---|
2198 | - nvk: Rework format features queries
|
---|
2199 | - nvk: Add a more competent GetPhysicalDeviceImageFormatProperties
|
---|
2200 | - nvk: Support compressed images in copy commands
|
---|
2201 | - nvk: Drop vk_sync BO refs after push_submit
|
---|
2202 | - nil: Drop miptail support for now
|
---|
2203 | - nil: Don't minify image dimensions when setting up TIC
|
---|
2204 | - nil: Refactor TIC image extent setup
|
---|
2205 | - nil: Fix image array layer alignments
|
---|
2206 | - nvk: Teture pool sizes are maximums not sizes
|
---|
2207 | - nvk: Re-format nvk_sampler.c
|
---|
2208 | - nvk: Implement samplers
|
---|
2209 | - nil: Add a helper for filling out buffer TIC entries
|
---|
2210 | - nvk: Move is_storage_image_format to nvk_format.c
|
---|
2211 | - nvk: Implement buffer views
|
---|
2212 | - nvk: Advertise KHR_dedicated_allocation
|
---|
2213 | - nvk: Use the correct root descriptor table size for CmdDispatch
|
---|
2214 | - nvk: Add support for dynamic buffers
|
---|
2215 | - nvk: Better advertise image format features
|
---|
2216 | - nvk: Advertise descriptor array indexing
|
---|
2217 | - nvk: Advertise non-zero descriptor set limits
|
---|
2218 | - nvk: Use a descriptor type instead of a hand-rolled thing
|
---|
2219 | - nvk: Handle cube storage images properly
|
---|
2220 | - nvk: Load the requested descriptor size
|
---|
2221 | - nvk: Implement push constants
|
---|
2222 | - nvk: Properly indent a comment
|
---|
2223 | - nvk: Fix descriptor offset alignment
|
---|
2224 | - nvk: Use a switch for descriptor types in load_descriptor
|
---|
2225 | - nvk: Support inline uniform blocks
|
---|
2226 | - nvk: Delete the storage TIC in nvk_image_view_destroy
|
---|
2227 | - nvk: Assert that we don't double-free descriptors
|
---|
2228 | - nvk: Initial vkCmdClearImage support
|
---|
2229 | - nvk: Unconditionally zero image format properties
|
---|
2230 | - nvk: No-op sparse image format properties
|
---|
2231 | - nvk: Advertise minUniformBufferOffsetAlignment
|
---|
2232 | - nvk: Rework OOM handling for descriptor pools
|
---|
2233 | - nvk: Bind immutable samplers on descriptor set creation
|
---|
2234 | - nvk: Padd shader BOs by 4K to avoid I-cache overflow
|
---|
2235 | - nvk: Include nvk_private.h in everything
|
---|
2236 | - nvk: Make image/buffer address helpers const
|
---|
2237 | - nouveau/push: Add a P_INLINE_FLOAT helper
|
---|
2238 | - nvk: Init WSI after setting up supported_sync_types
|
---|
2239 | - nouveau/parser: Fix an integer overflow and a typo
|
---|
2240 | - nouveau/parser: Properly dump most arrays used by 3D
|
---|
2241 | - nouveau/parser: Better dump float data
|
---|
2242 | - nouveau/parser: Handle arrays properly in P_IMMD()
|
---|
2243 | - nouveau/push: Make P_IMMD more versatile
|
---|
2244 | - nouveau: Null terminate the debug flag list
|
---|
2245 | - nouveau: Generate 3D headers
|
---|
2246 | - nvk: Add graphics state to command buffers
|
---|
2247 | - nvk: Split pipeline binding into helpers
|
---|
2248 | - nvk: Switch to vk_pipeline_shader_stage_to_nir
|
---|
2249 | - nvk: Don't free the NIR in nvk_compile_nir
|
---|
2250 | - nvk: Add an nvk_shader_address helper
|
---|
2251 | - nvk: Free pipeline shader BOs
|
---|
2252 | - nvk: Expose pipeline alloc/free functions
|
---|
2253 | - nvk: Make shader_upload take an nvk_device
|
---|
2254 | - nvk/shader: Assign I/O locations and gather info
|
---|
2255 | - nvk/shader: Populate headers for vertex and fragment shaders
|
---|
2256 | - nvk: Add a nvk_cmd_buffer_device() helper
|
---|
2257 | - nvk: Import 3D context init code from nouveau
|
---|
2258 | - nil/format: Add helpers for render formats
|
---|
2259 | - nvk: Add boilerplate for Begin/EndRendering
|
---|
2260 | - nvk: Misc. additional state setup
|
---|
2261 | - nvk: Emit dynamic graphics state
|
---|
2262 | - nvk: Implement push constants and descriptors for graphics
|
---|
2263 | - nouveau: Add CPU push buffers
|
---|
2264 | - nvk: Graphics pipelines
|
---|
2265 | - nvk: Implement vkCmdDraw()
|
---|
2266 | - nvk: Color attachments clears via image clears
|
---|
2267 | - vulkan/meta: Add the start of a meta framework
|
---|
2268 | - vulkan/meta: Add an object tracking list
|
---|
2269 | - vulkan/meta: Add a concept of rect pipelines
|
---|
2270 | - vulkan/meta: Implement attachment clears
|
---|
2271 | - vulkan/meta: Implement start-of-rendering clears
|
---|
2272 | - vulkan/meta: Add implementations of Clear*Image
|
---|
2273 | - nvk: Add an attachment format even for secondaries
|
---|
2274 | - nvk: Add an addr field to nvk_buffer
|
---|
2275 | - nvk: Expose a bind_vertex_buffer helper
|
---|
2276 | - nvk: Use vk_meta for CmdClearAttachments
|
---|
2277 | - nvk: Stop using vk_cmd_set_dynamic_graphics_state in meta_end()
|
---|
2278 | - nvk: Enable all the dynamic state features
|
---|
2279 | - nouveau: Fix pushbuf ref reset for user command buffers
|
---|
2280 | - nvk: add linear image creation support.
|
---|
2281 | - nvk: Use max alignment for descriptor pool sizes
|
---|
2282 | - nil: Switch to using the new headers for TIC entries
|
---|
2283 | - nvk: Use meta for CmdClear*Image
|
---|
2284 | - nvk: Zero client memory objects
|
---|
2285 | - nvk: Bind texture and sampler header pools for 3D
|
---|
2286 | - nvk: Use the new headers for samplers
|
---|
2287 | - nvk: Implement nir_intrinsic_load_frag_coord
|
---|
2288 | - vulkan/meta_clear: Populate VkRenderingInfo::renderArea
|
---|
2289 | - nvk: Don't assert when there are no attachments
|
---|
2290 | - nvk: Track and reference all device memory objects
|
---|
2291 | - vulkan: Allow scissors or viewports to be set without counts
|
---|
2292 | - nvk/copy: Mape bpp part of nouveau_copy_buffer
|
---|
2293 | - nvk: Implement copies for D24_UNORM_S8_UINT images
|
---|
2294 | - nvk: Drop sample locations structs
|
---|
2295 | - nvk/meta: Save and restore VI state
|
---|
2296 | - nvk: Re-initialize dynamic_graphics_state.vi when recycling
|
---|
2297 | - nvk: Move the vertex format table into nvk_format.h
|
---|
2298 | - nvk: Advertise vertex buffer format featues
|
---|
2299 | - nvk: Clean up try_create_physical_device error handling
|
---|
2300 | - nouveau/parser: Dump more fields as float
|
---|
2301 | - nvk: Depth bounds need fui()
|
---|
2302 | - nouveau: Add class information to nouveau_ws_device
|
---|
2303 | - nil: Properly depend on nouveau winsys and nvidia-headers
|
---|
2304 | - nil: Use nvidia headers for texture format enums
|
---|
2305 | - nil: Use the nvidia headers for render target format enums
|
---|
2306 | - nil: Use nvidia headers for ZS format enums
|
---|
2307 | - nil: Rename rt to czt in the format info struct
|
---|
2308 | - nil: Rename rendering to color_target
|
---|
2309 | - nil: Re-introduce the format capabilities
|
---|
2310 | - nil: Add more format support helpers
|
---|
2311 | - nvk: Advertise more format features
|
---|
2312 | - nvk: Clear dynamic state dirty after flushing it all
|
---|
2313 | - vulkan/meta: Make stencil reference dynamic for clears
|
---|
2314 | - nvk: Depth buffers don't allow Z-tiling
|
---|
2315 | - nvk: Disable sparse Z on Maxwell+
|
---|
2316 | - nil: Compute PTE kinds and tile modes for images
|
---|
2317 | - nouveau: Add a function to allocate a tiled buffer
|
---|
2318 | - nvk: Add internal helpers for device memory allocation
|
---|
2319 | - nvk: Do internal dedicated allocations for ZS images
|
---|
2320 | - nvk: Fix depth/stencil render pass clears
|
---|
2321 | - nvk: Fix viewport Z scale
|
---|
2322 | - nvk: Enable two-sided stencil
|
---|
2323 | - nvk: Flip the front-face setting
|
---|
2324 | - nvk: Advertise depth/stencil support
|
---|
2325 | - nvk: Don't destroy NULL descriptor pool BOs
|
---|
2326 | - nvk: Call nir_lower_input_attachments
|
---|
2327 | - nvk: Set GEOMETRY_SHADER_SELECTS_LAYER properly
|
---|
2328 | - nvk: Return OUT_OF_DEVICE_MEMORY if bo_new fails
|
---|
2329 | - nil: Add a PTE kind for Z32_FLOAT
|
---|
2330 | - nvk: Add nvk_queue_init/finish() helpers
|
---|
2331 | - nvk: Align descriptor buffers to NVK_MIN_UBO_ALIGNMENT
|
---|
2332 | - nvk: Re-flow a couple function prototypes
|
---|
2333 | - nvk: Assert samples == 1
|
---|
2334 | - nvk: Allocate descriptors for input attachments
|
---|
2335 | - nvk: Wire up early z and post depth coverage
|
---|
2336 | - nvk: Save/restore push constants around meta ops
|
---|
2337 | - nouveau/parser: Add array and float tags for clear values
|
---|
2338 | - nvk: Use hardware clears for attachment clears
|
---|
2339 | - nvk: Add image_view_init/finish functions
|
---|
2340 | - nvk: Implement vkCmdClear*Image directly
|
---|
2341 | - nvk: Use a UINT format to clear non-renderable images
|
---|
2342 | - nvk: Don't advertise tiling on non-power-of-two formats
|
---|
2343 | - nvk: Fix max anisotropy
|
---|
2344 | - nvk: Assert on CmdExecuteCommands
|
---|
2345 | - nvk: VkSamplerCreateInfo::mipLodBias is signed
|
---|
2346 | - nvk: Fix border color alpha
|
---|
2347 | - nil/format: Depth/stencil formats appear as red
|
---|
2348 | - nil: Fix max mip level
|
---|
2349 | - nil: Fix nonnormalized coordinates
|
---|
2350 | - nvk: Set up clip and cull distances
|
---|
2351 | - nvk: Fix dynamic buffer descriptor copies
|
---|
2352 | - nvk: Inline nouveau_copy_linear
|
---|
2353 | - nvk/copy: Rename push to p
|
---|
2354 | - nvk/blit: Rename push to p
|
---|
2355 | - nvk/dispatch: Rename push to p
|
---|
2356 | - nvk: Drop most buffer tracking
|
---|
2357 | - nvk: Rework TLS/SLM and image/sampler table handling
|
---|
2358 | - nvk: Invalidate texture header and sampler caches each submit
|
---|
2359 | - nvk/sampler: Free descriptor table entries
|
---|
2360 | - nvk: Rework nvk_descriptor_table_add/remove
|
---|
2361 | - nvk: Implement descriptor table growing
|
---|
2362 | - nvk: Zero unused descriptors
|
---|
2363 | - nvk: Add some asserts for nv50 compiler image restrictions
|
---|
2364 | - nvk: Update to the new command buffer infrastructure
|
---|
2365 | - nvk: Split nvk_queue into its own file
|
---|
2366 | - nvk: Start every command buffer with a nop
|
---|
2367 | - nvk: Initialize fixed draw/default state once
|
---|
2368 | - nouveau/parser: Convert to mako
|
---|
2369 | - nouveau/parser: Use more idiomatic python
|
---|
2370 | - nouveau/parser: Put the dump helpers in C files
|
---|
2371 | - nvk: Use f for extension features
|
---|
2372 | - nvk: Drop a TODO
|
---|
2373 | - nvk: Use VK_IMAGE_USAGE_*_ATTACHMENT_BIT for image clears
|
---|
2374 | - nvk: Increase the graphics pipeline push space
|
---|
2375 | - nil: Don't claim texture support for 2-bit SNORM
|
---|
2376 | - nouveau/push: Fix a void pointer arithmetic bug
|
---|
2377 | - nouveau/parser: Add more arrays
|
---|
2378 | - nouveau/mme: Add basic structures for the Turing+ MME
|
---|
2379 | - nouveau/mme: Add isaspec XML for the Turing+ MME
|
---|
2380 | - nouveau/mme: Add an assembler and disassembler for the Turring+ MME
|
---|
2381 | - nouveau/mme: Add a builder for the Turing+ MME
|
---|
2382 | - nouveau/mme: Add a tiny simulator for the Turing+ MME
|
---|
2383 | - nouveau/mme: Add an isaspec-based dumper
|
---|
2384 | - nouveau/mme: Make the winsys headers C++ safe
|
---|
2385 | - nouveau/mme: Add unit tests for the Turing+ MME simulator
|
---|
2386 | - nvk: Add MME infrastructure
|
---|
2387 | - nvk: Use MME for clears
|
---|
2388 | - nouveau/mme: Add helper macros for setting fields
|
---|
2389 | - nvk: Use MME for vkCmdDraw[Indexed]()
|
---|
2390 | - nvk: Implement vkCmdDraw[Indexed]Indirect()
|
---|
2391 | - nvk: Use p for the nouveau_ws_push_buffer in zero_vram
|
---|
2392 | - nouveau: Add an nv_push struct
|
---|
2393 | - nouveau: Rename the fields of vk_push
|
---|
2394 | - nouveau: Move nv_push and helpers to their own header
|
---|
2395 | - nouveau/parser: Take a FILE* in DUMP_*_MTHD_DATA
|
---|
2396 | - nouveau: Move push validate to nv_push.c
|
---|
2397 | - nouveau: Move push dumping to nv_push.c
|
---|
2398 | - nvk: Use nv_push directly for graphics pipelines
|
---|
2399 | - nouveau: Add a nouveau_ws_bo_new_mapped helper
|
---|
2400 | - nvk: Use bo_new_mapped for the zero page
|
---|
2401 | - nvk: Always allocate empty_push
|
---|
2402 | - nvk: Move queue_sumbit to nvk_queue_drm_nouveau.c
|
---|
2403 | - nvk: Submit pushbufs directly
|
---|
2404 | - nvk: Use a regular BO for the empty push
|
---|
2405 | - nvk: Use a regular BO for the queue state push
|
---|
2406 | - nvk: Add an nvk_queue_submit_simple helper
|
---|
2407 | - nvk: Initialize the queue later in device setup
|
---|
2408 | - nvk: Use submit_simple for draw state init
|
---|
2409 | - nvk: Use queue_submit_simple for zero_vram
|
---|
2410 | - nvk: Break nvk_cmd_pool into its own file
|
---|
2411 | - nvk: Use cmd instead of cmd_buffer
|
---|
2412 | - nvk: Add BO recycling to the command pool
|
---|
2413 | - nvk: Return VkResult from nvk_cmd_buffer_upload_alloc
|
---|
2414 | - nvk: memcpy root descriptors for compute instead of doing a DMA
|
---|
2415 | - nvk: Fully populate QMDs before uploading
|
---|
2416 | - nvk: Constant buffer alignment is actually 64B
|
---|
2417 | - nvk: Rework side-band data upload
|
---|
2418 | - nvk: Add an nvk_cmd_buffer_push helper
|
---|
2419 | - nvk: Add an nvk_cmd_buffer_ref_bo helper
|
---|
2420 | - nvk: Allocate upload buffers from the command pool
|
---|
2421 | - nvk: Use nvk_cmd_bo for push bufs
|
---|
2422 | - nvk: Implement vkCmdExecuteCommands()
|
---|
2423 | - nvk: Remove remaining references to nouveau_push.h
|
---|
2424 | - nouveau: Use DRM interfaces directly in MME tests
|
---|
2425 | - nouveau: Drop nouveau_ws_push
|
---|
2426 | - nvk: Re-indent vk_instance.c
|
---|
2427 | - nvk: Use vk_object_zalloc/free for descriptor pools/sets
|
---|
2428 | - nvk: Fix up whitespace in nvk_descriptor_set.c
|
---|
2429 | - nvk: Implement VK_KHR_push_descriptor
|
---|
2430 | - nvk: Reference descriptor set layouts in the sets themselves
|
---|
2431 | - nvk: Embed a nv_device_info in nvk_physical_device
|
---|
2432 | - nvk: Add an nvk_queue_submit wrapper
|
---|
2433 | - nvk: Also store the push BO map in nvk_queue_state
|
---|
2434 | - nvk: Bring back push sync and dumping
|
---|
2435 | - nvk: drop nvk_nir.h
|
---|
2436 | - nvk: Add lowering for load_global_constant_bounded
|
---|
2437 | - nvk: Properly implement robustBufferAccess
|
---|
2438 | - vulkan/meta: Add key types
|
---|
2439 | - vulkan/meta: Add a helper for image view types
|
---|
2440 | - vulkan/meta: Add a create_sampler helper
|
---|
2441 | - vulkan/meta: Fixes for clear
|
---|
2442 | - vulkan/meta: Implement vkCmdBlitImage()
|
---|
2443 | - nvk: Support load_layer_id
|
---|
2444 | - nvk/meta: Save/restore descriptor set 0
|
---|
2445 | - nvk: Use meta for doing blits with the 3D hardware
|
---|
2446 | - nvk: WFI in pipeline barriers
|
---|
2447 | - util/vma: Allow initializing zero-size heaps
|
---|
2448 | - nvk: Rework nvk_queue_submit_simple()
|
---|
2449 | - nvk: Add a heap data structure
|
---|
2450 | - nvk: Return a VkResult from nvk_shader_upload()
|
---|
2451 | - nvk: Add a shader heap to nvk_device
|
---|
2452 | - nvk: Allocate shaders from a heap
|
---|
2453 | - nvk: Rework whitespace in nvk_device_memory.c
|
---|
2454 | - nvk: Style fixes in nvk_physical_device.c
|
---|
2455 | - nvk: Reset semaphore syncs on wait
|
---|
2456 | - nvk/wsi: Style fixes
|
---|
2457 | - nvk/wsi: Use the common present implementation
|
---|
2458 | - nouveau/parser: Parse all fields in each method
|
---|
2459 | - nvk: Add a query pool object
|
---|
2460 | - nvk: Implement timestamp queries
|
---|
2461 | - nvk: Implement pipeline statistics and occlusion queries
|
---|
2462 | - nouveau/mme: Allow ZERO as the destinatio nof mme_load_to
|
---|
2463 | - nouveau/mme: Assert on OOB registers
|
---|
2464 | - nouveau/mme: Add support for freeing registers
|
---|
2465 | - nouveau/mme: Add a couple helpers for working 64-bit addresses
|
---|
2466 | - nouveau/mme: Add a helper for MME_DMA_READ_FIFOED
|
---|
2467 | - nvk: Use mme_tu104_read_fifoed()
|
---|
2468 | - nvk: Implement vkCmdCopyQueryPoolResults()
|
---|
2469 | - nvk: Handle large command buffer uploads better
|
---|
2470 | - nvk: Use a normal DMA for CmdUpdateBuffer
|
---|
2471 | - nouveau/parser: Handle 6F methods
|
---|
2472 | - nvk: Use mme_load_addr64()
|
---|
2473 | - nvk: Use poll for BO waits
|
---|
2474 | - nvk: Events
|
---|
2475 | - nvk: Don't crash if we fail to allocate a push BO
|
---|
2476 | - nvk: Stop leaking command pool BOs
|
---|
2477 | - nvk: Enable VK_KHR_create_renderpass2
|
---|
2478 | - nvk: Advertise VK_KHR_imageless_framebuffer
|
---|
2479 | - nvk: Flush the current pushbuf before allocating a new one
|
---|
2480 | - nvk: Advertise VK_KHR_separate_depth_stencil_layout
|
---|
2481 | - nvk: Tell WSI we don't support legacy scanout
|
---|
2482 | - nouveau: Add PCI information to nv_device_info
|
---|
2483 | - nvk: Implement VK_EXT_pci_bus_info
|
---|
2484 | - nvk: Bind 3D images as 3D for clears
|
---|
2485 | - nvk: Support copies between 3D and 2D images
|
---|
2486 | - nil: Add a helper for getting 2D views of 3D images
|
---|
2487 | - nvk: Support 2D views of 3D images
|
---|
2488 | - nvk: Advertise VK_KHR_maintenance1
|
---|
2489 | - nvk: Use 2D array views for 3D storage images
|
---|
2490 | - nil: Fix include guards in nil_image.h
|
---|
2491 | - nvk: Advertise custom border color features
|
---|
2492 | - vulkan: Add a helper for swizzling color values
|
---|
2493 | - nvk: Implement VK_EXT_border_color_swizzle
|
---|
2494 | - nvk: Advertise VK_EXT_extended_dynamic_state3
|
---|
2495 | - nvk: Move more states to dynamic
|
---|
2496 | - nvk: Advertise VK_KHR_storage_buffer_storage_class
|
---|
2497 | - nvk: Add a helper for pushing descriptors
|
---|
2498 | - nouveau/headers: Add generated headers to dependencies
|
---|
2499 | - nvk: Implement VK_EXT/KHR_buffer_device_address
|
---|
2500 | - nvk: Break the guts of CmdDispatch into a helper
|
---|
2501 | - nvk: Implement DispatchIndirect
|
---|
2502 | - nouveau/mme: Add a mul64 helper
|
---|
2503 | - nvk: Implement CS invocations statistics queries
|
---|
2504 | - nil: Use ONE for the anixotropic coarse spread function
|
---|
2505 | - nil: Properly support MSAA
|
---|
2506 | - nil: Add an offset4d struct and some helpers
|
---|
2507 | - nouveau/parser: Sort METHOD_ARRAY_SIZES
|
---|
2508 | - nouveau/parser: Handle SET_ANTI_ALIAS_SAMPLE_POSITIONS
|
---|
2509 | - nvk: Stop asserting on MSAA
|
---|
2510 | - nvk: Handle zero color attachments better
|
---|
2511 | - nvk: Handle multisampled render targets properly
|
---|
2512 | - nvk: Support copies of MSAA images
|
---|
2513 | - nvk: Use the right view format for stencil texturing
|
---|
2514 | - nvk: Pass through a shader key for fragment shaders and MSAA
|
---|
2515 | - nvk: Set correct multisample regs for graphics pipelines
|
---|
2516 | - nvk: Stop creating a new upload BO every time
|
---|
2517 | - nvk: Fill out sample locations on Maxwell B+
|
---|
2518 | - vulkan/meta: Bind whole LODs of 3D blit destinations
|
---|
2519 | - vulkan/meta: Add a helper for building texture ops
|
---|
2520 | - vulkan/meta: Break the guts of blit into a helper
|
---|
2521 | - vulkan/meta: Support writing stencil as iterative discard
|
---|
2522 | - vulkan/meta: Rename vk_meta_blit.c to vk_meta_blit_resolve.c
|
---|
2523 | - vulkan/meta: Add support for MSAA resolves
|
---|
2524 | - nvk/meta: Fix restore for descriptor set 0
|
---|
2525 | - nvk: Use meta for MSAA resolves
|
---|
2526 | - nvk: Replace gl_SamplePosition with fract(gl_FragCoord.xy)
|
---|
2527 | - nvk: Stop advertising higher framebufferNoAttachmentsSampleCounts
|
---|
2528 | - nvk: Advertise MSAA via image format properties
|
---|
2529 | - nvk: Advertise VK_KHR_depth_stencil_resolve
|
---|
2530 | - nvk: Assert that descriptor buffer access stays in-bounds
|
---|
2531 | - nvk: Add a bo size to nvk_descriptor_set
|
---|
2532 | - nvk/format: Style fix for VkFormatProperties3KHR
|
---|
2533 | - nvk: Support VK_FORMAT_B10G11R11_UFLOAT_PACK32 for vertex buffers
|
---|
2534 | - nvk: Add a devenv ICD json file
|
---|
2535 | - nvk: Advertise EXT_vertex_attribute_divisor
|
---|
2536 | - nvk: Lower image_size to txs
|
---|
2537 | - nvk: Fix a comment
|
---|
2538 | - nvk: Add an nvk_buffer_addr_range helper
|
---|
2539 | - nvk: Use nvk_buffer_addr_range for buffer descriptors
|
---|
2540 | - nvk: Re-order Vulkan 1.0 feature bits
|
---|
2541 | - nvk: Enable inheritedQueries
|
---|
2542 | - nvk: Enable VK_EXT_provoking_vertex
|
---|
2543 | - nvk: Advertise samplerMirrorClampToEdge via 1.2 features
|
---|
2544 | - nvk: Advertise VK_KHR_bind_memory2
|
---|
2545 | - nvk: Enable KHR_dynamic_rendering
|
---|
2546 | - nvk: Advertise KHR_uniform_buffer_standard_layout
|
---|
2547 | - nvk: Advertise EXT_index_type_uint8
|
---|
2548 | - nvk: Advertise VK_EXT_separate_stencil_usage
|
---|
2549 | - nvk: Capitalize NVK in user exposed strings
|
---|
2550 | - nvk: Rename grid_size to group_count
|
---|
2551 | - nvk: Lower load_num_workgroups ourselves
|
---|
2552 | - nvk: Drop block_size from the root descriptor table
|
---|
2553 | - nvk: Add a helper for loading resource_index-based descriptors
|
---|
2554 | - nvk: Set maxMemoryAllocationCount
|
---|
2555 | - nouveau/winsys: Take a drmDevicePtr in nouveau_ws_device_new()
|
---|
2556 | - nouveau/winsys: Add an info to nouveau_ws_device
|
---|
2557 | - nouveau/winsys: Move device type into nv_device_info
|
---|
2558 | - nouveau/nil: Take an nv_device_info for image functions
|
---|
2559 | - nouveau/nil: Use nv_device_info for format queries
|
---|
2560 | - nouveau/mme: Invoke SET_OBJECT in the tests
|
---|
2561 | - nouveau/mme: Make alu_op_to_str static
|
---|
2562 | - nouveau/mme: Move mme_value into its own header
|
---|
2563 | - nouveau/mme: Add a mme_reg_alloc struct
|
---|
2564 | - nouveau/mme: Add an intermediate MME_ALU_OP enum
|
---|
2565 | - nouveau/mme: Add an intermediate MME_CMP_OP enum
|
---|
2566 | - nouveau/mme: Use mme_mov() for temp copies of register IMM32 sources
|
---|
2567 | - nouveau/mme: Make helpers less Turing specific
|
---|
2568 | - nouveau/mme: Break the Turing builder guts into a separate header
|
---|
2569 | - nouveau/mme: Move the guts of mme_merge_to() into mme_tu104_builder.c
|
---|
2570 | - nouveau/mme: Move the guts of mme_state_arr_to() into mme_tu104_builder.c
|
---|
2571 | - nouveau/mme: Drop the implicit_imm parameter from mme_alu_to()
|
---|
2572 | - nouveau/mme: Move the cf_stack struct to mme_builder.h
|
---|
2573 | - nouveau/mme: Prepare the builder for multiple GPU generations
|
---|
2574 | - nouveau/mme: Take an nv_device_info in mme_builder_init
|
---|
2575 | - Support immediates in MERGE
|
---|
2576 | - Add add immediate optimizations
|
---|
2577 | - nvk: Add support for contiguous heaps to nvk_heap
|
---|
2578 | - nvk: Use a contiguous shader heap pre-Volta
|
---|
2579 | - nvk: Disable indirect draw/dispatch and query copy MMEs for now
|
---|
2580 | - nvk: Free a couple regs in nvk_mme_build_draw_*()
|
---|
2581 | - nvk: Properly align root descriptor tables for pre-Pascal
|
---|
2582 | - nvk: Compile all NIR before running codegen
|
---|
2583 | - vulkan/meta: Insert a geometry shader when needed
|
---|
2584 | - nvk: Use a GS for layerered rendering pre-MaxwellB
|
---|
2585 | - nvk: Handle zero-size index and vertex buffers pre-Turing
|
---|
2586 | - nvk: Cosmetic clean-ups to Create/DestroyDevice
|
---|
2587 | - nil: Only choose a PTE kind for tiled images
|
---|
2588 | - nouveau/mme: Fix is_int18 for negative numbers
|
---|
2589 | - nouveau/mme: Don't swap x and y in mme_fermi_merge_to()
|
---|
2590 | - nouveau/mme: Take a const nv_device_info in mme_builder_init
|
---|
2591 | - nouveau/mme: Unify some of the test framework
|
---|
2592 | - nouveau/mme: Add some generic builder tests
|
---|
2593 | - nouveau/mme: Add builder tests for SUB
|
---|
2594 | - nouveau/mme: Use a uint32_t for size in mme_fermi_bfe()
|
---|
2595 | - nouveau/mme: nouveau/mme: Add builder tests for SLL and SRL
|
---|
2596 | - nvk/drm: Take a byte offset/range in push_add_push
|
---|
2597 | - nvk: Rework nvk_cmd_push a bit
|
---|
2598 | - nvk: Add a helper for pushing indirect data
|
---|
2599 | - nvk: Make some MME builder names more consistent
|
---|
2600 | - nouveau/mme: Don't allow WaW dependencies in the same Turing instruction
|
---|
2601 | - nvk: Reduce register pressure in nvk_mme_build_draw*()
|
---|
2602 | - nouveau/push: Add an NV_PUSH_MAX_COUNT #define
|
---|
2603 | - nvk: Implement Draw*Indirect on pre-Turing
|
---|
2604 | - vulkan/meta: Use the new NIR texture helpers
|
---|
2605 | - nvk: Add a build test for MMEs
|
---|
2606 | - nvk: Don't over-size push descriptor sets
|
---|
2607 | - nvk: Return VK_ERROR_INCOMPATIBLE_DRIVER if the PCI vendor isn't NVIDIA
|
---|
2608 | - nvk: Bump init context batch size
|
---|
2609 | - nouveau/mme: Fix nested while instructions on Turing+
|
---|
2610 | - nouveau/mme: Add a helper to dump instructions
|
---|
2611 | - nvk: Rework extension enables
|
---|
2612 | - nvk: Rework features enables
|
---|
2613 | - nvk: Advertise shaderImageGatherExtended
|
---|
2614 | - nouveau/mme: Add a bfe helper
|
---|
2615 | - nouveau/mme: Ensure that zero-initizlied mme_value is ZERO
|
---|
2616 | - nvk: De-duplicate MME code for setting draw params
|
---|
2617 | - nvk: Clamp viewport clip to max range
|
---|
2618 | - nvk: Use the same lock for the submit and the memory objects list
|
---|
2619 | - nvk: Advertise ICD/loader interface version 4
|
---|
2620 | - nvk: Add instace WSI entrypoints
|
---|
2621 | - nouveau/mme: Use ADD for ine with an immediate
|
---|
2622 | - nouveau/mme: Fix while loops pre-Turing
|
---|
2623 | - nvk: Add begin to mme_scratch
|
---|
2624 | - nvk: Use the new load/store_scratch helpers for DRAW_PAD_DW
|
---|
2625 | - nouveau/mme: Add a helper for re-allocating registers
|
---|
2626 | - nvk: Rework spill helpers and DRAW_COUNT spilling
|
---|
2627 | - nvk: Spill DRAW_IDX pre-Turing
|
---|
2628 | - nvk: Break the inner MME draw loop into a helper
|
---|
2629 | - nvk: Increase the push runout to 512 dwords
|
---|
2630 | - nil: Add a nil_image_for_level helper
|
---|
2631 | - nil: Add an image_level_as_uncompressed helper
|
---|
2632 | - nvk: Implement uncompressed views of compressed images
|
---|
2633 | - nvk: Set pointClippingBehavior
|
---|
2634 | - nvk: Expose VK_KHR_maintenance2
|
---|
2635 | - nvk: Add a separate #define for SSBO alignment
|
---|
2636 | - nvk: Set spirv_to_nir_options::min_*_alignment
|
---|
2637 | - nvk: Use vk_device_memory
|
---|
2638 | - nvk: Implement VK_KHR_map_memory2
|
---|
2639 | - nvk: Sort SPIR-V caps
|
---|
2640 | - nvk: Advertise EXT_shader_viewport_index_layer on MaxwellB+
|
---|
2641 | - nvk: Only use view_id for layer in multiview
|
---|
2642 | - nvk/heap: Set the right pitch for heap resize copies
|
---|
2643 | - nvk: Advertise shaderStorageImageReadWithoutFormat
|
---|
2644 | - nvk: Fix the NO_PREFETCH assert for CmdDrawIndirect
|
---|
2645 | - nvk: Advertise KHR_spirv_1_4
|
---|
2646 | - nvk: s/device/dev in nvk_image.c
|
---|
2647 | - nvk: Add helpers for binding image planes
|
---|
2648 | - nvk: Take an nvk_image_plane in nouveau_copy_rect_image
|
---|
2649 | - nvk: Use the max descriptor alignemtn in GetDescriptorSetLayoutSupport
|
---|
2650 | - nvk: Use NVIDIA_VENDOR_ID in pdev try_create()
|
---|
2651 | - nvk: Use abbreviated names in nvk_device_memory.c
|
---|
2652 | - nvk: Add device and driver UUIDs
|
---|
2653 | - nvk: Add external memory queries
|
---|
2654 | - nvk: Dedicated allocations override internal
|
---|
2655 | - nvk: Require dedicated allocations for external images
|
---|
2656 | - nouveau/winsys: Add dma-buf import support
|
---|
2657 | - nvk: Support dma-buf import
|
---|
2658 | - nvk: Support dma-buf export
|
---|
2659 | - nvk: Enable external memory extensions
|
---|
2660 | - nvk: Reformat nvk_buffer.c
|
---|
2661 | - nvk: Add a buffer alignment helper
|
---|
2662 | - nvk: Add an addr field to nvk_image_plane
|
---|
2663 | - nvk: Use canonical variable names in nvk_physical_device.c
|
---|
2664 | - nvk: Use canonical variable names in nvk_shader.c
|
---|
2665 | - nvk: Use canonical variable names in nvk_bo_sync.c
|
---|
2666 | - nvk: Use canonical variable names in nvk_sampler.c
|
---|
2667 | - nvk: Drop nvk_physical_device::instance
|
---|
2668 | - nvk: Only advertise EXT_pci_bus_info on discrete GPUs
|
---|
2669 | - nouveau: Put PCI info in a pci substruct in nv_device_info
|
---|
2670 | - nouveau: Stop using hex for SM numbers
|
---|
2671 | - nvk: Set deviceType based on nv_device_info::type
|
---|
2672 | - nouveau: Move more stuff into nv_device_info
|
---|
2673 | - nouveau: Move gart_size to nv_device_info
|
---|
2674 | - nvk: Use nv_device_info for class checks
|
---|
2675 | - nvk: Rename nvk_device::ctx to ws_ctx
|
---|
2676 | - nvk: Add a ws_dev to nvk_device and use it
|
---|
2677 | - nvk: Move the winsys device to nvk_device
|
---|
2678 | - nvk: Don't enumerate pre-Kepler GPUs
|
---|
2679 | - nvk: Implement VK_EXT_physical_device_drm
|
---|
2680 | - nvk: Require an environment variable for poorly tested hardware
|
---|
2681 | - nvk: Use the new core vk_sampler struct
|
---|
2682 | - Revert "vulkan: Allow scissors or viewports to be set without counts"
|
---|
2683 | - vulkan/meta: Add a get_pipeline_layout helper
|
---|
2684 | - vulkan/meta: Use vk_meta_get_pipeline_layout in blit/resolve
|
---|
2685 | - nvk: Bind 3D depth/stencil images as 2D arrays
|
---|
2686 | - nvk: Flush more state on VI_BINDINGS_VALID dirty
|
---|
2687 | - nvk: Don't skip zero-size bindings in GetDescriptorSetLayoutSupport
|
---|
2688 | - docs: Add a docs page for NVK
|
---|
2689 | - docs: Add NVK to features.txt
|
---|
2690 | - docs/relnotes: Stick something about NVK in new_features.txt
|
---|
2691 | - nouveau: Drop GART size from nv_device_info
|
---|
2692 | - nil: Add a nil_image_level_extent_px() helper
|
---|
2693 | - nvk: Use the new NIL helper for image level extents for copies
|
---|
2694 | - nvk: Improve image format properties and limits
|
---|
2695 | - nvk: Rework multi-plane format features a bit
|
---|
2696 | - nvk: Use nvk_root_descriptor_offset for drawInfoBase
|
---|
2697 | - nvk: Add a root_desc_addr to the root descriptor table
|
---|
2698 | - nvk: Add support for variable pointers
|
---|
2699 | - nvk: Enable the SPIR-V DeviceGroup capability
|
---|
2700 | - nvk: Separate the MME query copy code out a bit
|
---|
2701 | - nvk: Implement CopyQueryPoolResults with a compute shader
|
---|
2702 | - nvk: Misc. style nits
|
---|
2703 | - nvk: Rework memory requirements to handle aspects correctly
|
---|
2704 | - nvk: Implement the maintenance5 image layout queries
|
---|
2705 | - nvk: Use VkBufferUsageFlags2
|
---|
2706 | - nvk: Implement CmdBindIndexBuffer2KHR
|
---|
2707 | - nvk: Implement GetRenderingAreaGranularityKHR
|
---|
2708 | - nvk: Decorate CmdBegin/EndRendering entrypoints
|
---|
2709 | - nouveau: Move shader topology info to nv_device_info
|
---|
2710 | - drm-uapi: Import nouveau_drm.h
|
---|
2711 | - nouveau/winsys: Use the imported nouveau_drm.h headers
|
---|
2712 | - nvk: Use the imported nouveau_drm.h headers
|
---|
2713 | - nouveau/shim: Use the imported nouveau_drm.h headers
|
---|
2714 | - nouveau/mme: Support the new UAPI
|
---|
2715 | - nvk: Use an empty EXEC for the empty submit case
|
---|
2716 | - nouveau/winsys: Allow nouveau_ws_device_new() without VM_BIND
|
---|
2717 | - nvk: Print an error message if VM_BIND support is missing
|
---|
2718 | - nvk: Enable the new UAPI
|
---|
2719 | - nvk: Use more consistent device variable names
|
---|
2720 | - nvk: Call nir_lower_int64
|
---|
2721 | - nir/gl: Move glsl_type::sampler_target() into a helper in its one caller
|
---|
2722 | - nvk: Remove plane sources from tex instructions
|
---|
2723 | - nvk: Use common physical device properties
|
---|
2724 | - nv50/ir: Rework conversions for texture array indices
|
---|
2725 | - clang-format: Add nir_foreach_reg_*
|
---|
2726 | - clang-format: nir_foreach_src is not a foreach macro
|
---|
2727 | - clang-format: Set the default ColumnLimit to 0
|
---|
2728 | - nir: Re-align a couple enums and add clang-format comments
|
---|
2729 | - nir: Don't clang-format const_value helpers
|
---|
2730 | - nir: Don't clang-format a couple typedefs
|
---|
2731 | - nir: Don't clang-format debug print setup
|
---|
2732 | - nir: More manual formatting
|
---|
2733 | - nir: Pretty format type mapping helpers
|
---|
2734 | - nir: Wrap pass macros in braces
|
---|
2735 | - nir: Add a do to the do/while in nir_const_value_t_array()
|
---|
2736 | - nir: Add a .clang-format file
|
---|
2737 | - nir: clang-format src/compiler/nir/\*.[ch]
|
---|
2738 | - nvk: Don't use nir_ssa_for_src()
|
---|
2739 | - nir: Drop most instances of nir_ssa_dest_init()
|
---|
2740 | - nir: Drop more instances of nir_ssa_dest_init()
|
---|
2741 | - nir/clone: Clone nir_def nor nir_dest
|
---|
2742 | - nir/serialize: [De]serialize nir_def nor nir_dest
|
---|
2743 | - nir: Drop nir_ssa_dest_init()
|
---|
2744 | - nir: Drop nir_ssa_dest_init_for_type()
|
---|
2745 | - nir: nir_foreach_ssa_def() -> nir_foreach_def()
|
---|
2746 | - st,zink,sfn: Use nir_foreach_def instead of nir_foreach_dest
|
---|
2747 | - dxil: Use nir_foreach_def() instead of nir_foreach_dest()
|
---|
2748 | - nir/from_ssa: Use nir_foreach_def() instead of nir_foreach_dest()
|
---|
2749 | - nir: Drop nir_foreach_dest()
|
---|
2750 | - intel/vec4: Stop passing around nir_dest
|
---|
2751 | - intel/fs: Stop passing around nir_dest and nir_alu_dest
|
---|
2752 | - broadcom: Stop using nir_dest directly
|
---|
2753 | - vc4: Stop passing around nir_dest
|
---|
2754 | - nir,ntt,a2xx,lima: Stop using nir_dest directly
|
---|
2755 | - lima: Stop using nir_dest directly
|
---|
2756 | - etnaviv: Stop passing around nir_dest
|
---|
2757 | - r600/sfn: Stop passing around nir_dest and nir_alu_dest
|
---|
2758 | - nv50/ir: Stop passing around nir_dest and nir_alu_dest
|
---|
2759 | - nir/gather_types: Stop passing around nir_dest
|
---|
2760 | - nir/dce: Stop passing around nir_dest
|
---|
2761 | - nir/propagate_invariant: Stop passing around nir_dest
|
---|
2762 | - nir/validate: Replace all dest validation with validate_def
|
---|
2763 | - nir/print: Replace all dest printing with print_def
|
---|
2764 | - nir: Get rid of nir_dest_bit_size()
|
---|
2765 | - nir: Get rid of nir_dest_num_components()
|
---|
2766 | - nir: Get rid of nir_dest_is_divergent()
|
---|
2767 | - nir: Drop nir_alu_dest
|
---|
2768 | - nir: Drop nir_dest
|
---|
2769 | - util/format: 8-bit interleaved YUV formats are UNORM
|
---|
2770 | - gallivm: Support G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM
|
---|
2771 | - blorp: Use R8G8_UINT for YCRCB_* formats with CCS
|
---|
2772 | - anv: Disable CCS_E for ISL_FORMAT_YCRCB_*
|
---|
2773 | - vulkan/format: Use correct swizzle for 1-plane YCbCr formats
|
---|
2774 | - gallivm: Drop the Vulkan YUV format hacks
|
---|
2775 | - nir: Rename nir_instr_type_ssa_undef to nir_instr_type_undef
|
---|
2776 | - nir s/nir_get_ssa_scalar/nir_get_scalar/
|
---|
2777 | - nir: s/live_ssa_def/live_def/
|
---|
2778 | - nir: s/nir_instr_ssa_def/nir_instr_def/
|
---|
2779 | - nir: Rework nir_scalar_chase_movs a bit
|
---|
2780 | - nir: Fix nir_op_mov handling in nir_collect_src_uniforms
|
---|
2781 | - nir: Handle nir_op_mov properly in opt_shrink_vectors
|
---|
2782 | - nir: Don't handle nir_op_mov in get_undef_mask in opt_undef
|
---|
2783 | - nir: Clean up nir_op_is_vec() and its callers
|
---|
2784 | - nir/large_constants: Use nir_component_mask_t
|
---|
2785 | - nir/large_constants: Add read/write_const_values helpers
|
---|
2786 | - nir/opt_large_constants: Add Small constant handling
|
---|
2787 | - spirv: Re-emit constants at their uses
|
---|
2788 | - nir: Take a nir_def * in nir_tex_instr_add_src()
|
---|
2789 | - nir: Take a nir_def * in nir_phi_instr_add_src()
|
---|
2790 | - nir/opt_undef: Don't rewrite a bcsel to mov
|
---|
2791 | - nir: Add a nir_instr_clear_src() helper and use it
|
---|
2792 | - nir: Add and use a nir_instr_init_src() helper
|
---|
2793 | - nir: Drop nir_if_rewrite_condition()
|
---|
2794 | - nir: Drop most uses of nir_instr_rewrite_src_ssa()
|
---|
2795 | - nir: Drop nir_instr_rewrite_src_ssa()
|
---|
2796 | - nir: Drop most uses if nir_instr_rewrite_src()
|
---|
2797 | - nir: Drop nir_instr_rewrite_src()
|
---|
2798 | - nir: Drop nir_push_if_src()
|
---|
2799 | - nir: Fix metadata in nir_lower_is_helper_invocation
|
---|
2800 | - nir: Use nir_shader_intrinsic_pass() a few places
|
---|
2801 | - drm-uapi: Sync nouveau_drm.h
|
---|
2802 | - nvk: Plumb no_prefetch through to the DRM back-end
|
---|
2803 | - nouveau/mme: Fix a compile warning
|
---|
2804 | - intel/isl: Rename ISL_TILING_Yf/s to ISL_TILING_SKL_Yf/s
|
---|
2805 | - intel/isl: Add ICL variants of Yf and Ys tiling
|
---|
2806 | - intel/isl: Implement correct tile size calculations for Ys/Yf
|
---|
2807 | - intel/isl: Use the depth field of phys_level0_sa for GFX4_2D 3D surfaces
|
---|
2808 | - intel/isl: Fill out the correct phys_total_extent for Ys/Yf/Tile64
|
---|
2809 | - intel/isl: Indent uncompressed surface code
|
---|
2810 | - intel/isl: Support Ys, Yf & Tile64 in isl_surf_get_uncompressed_surf
|
---|
2811 | - intel/isl: Support Yf/Ys tiling in surf_fill_state
|
---|
2812 | - intel/isl: Support Yf/Ys tiling in emit_depth_stencil_hiz
|
---|
2813 | - intel/isl: Add initial data-structure support for miptails
|
---|
2814 | - intel/isl: Add support for computing offsets with miptails
|
---|
2815 | - intel/isl: Support miptails in isl_surf_get_uncompressed_surf
|
---|
2816 | - intel/isl: Start using miptails
|
---|
2817 | - intel/isl: Disallow CCS on 3D surfaces with miptails
|
---|
2818 | - intel/isl: Allow Ys tiling
|
---|
2819 | - anv: Align memory VA to support for Ys, Tile64 tiled images
|
---|
2820 | - nvk: Clean up includes
|
---|
2821 | - nvk: Add include guards to nvk_bo_sync.h
|
---|
2822 | - nvk: SPDX everything
|
---|
2823 | - nouveau/nil: SPDX everything
|
---|
2824 | - nouveau/mme: SPDX everything
|
---|
2825 | - nvk: Don't add a dummy attachment when gl_SampleMask is written
|
---|
2826 | - nvk: Set the discard bit for Z/S self-deps
|
---|
2827 | - nvk: Invalidate the texture cache in PipelineBarrier
|
---|
2828 | - nvk: Lower interp_at_sample to interp_at_offset
|
---|
2829 | - nvk: Disable statistics around meta ops
|
---|
2830 | - nvk: Clean up viewport math
|
---|
2831 | - nvk: Fix depth clipping parameters
|
---|
2832 | - nvk: Enable dynamic clip/clamp enable
|
---|
2833 | - nvk: Set GUARDBAND_Z_SCALE_1 when Z-clipping
|
---|
2834 | - r600: Use more auto-generated nir_builder helpers
|
---|
2835 | - r600: Use nir_builder helpers for load/store_shared_r600
|
---|
2836 | - nvk: Re-order physical device limits
|
---|
2837 | - nvk: Advertise maxMemoryAllocationCount = 4096
|
---|
2838 | - nvk: Advertise discreteQueuePriorities = 2
|
---|
2839 | - nvk: Rip out old UAPI support
|
---|
2840 | - nvk/drm: Drop the push_add_push_bo() helper
|
---|
2841 | - nvk/drm: Drop the push_add_bo() helper
|
---|
2842 | - nvk: Drop command buffer BO tracking
|
---|
2843 | - nvk: Drop memory object tracking
|
---|
2844 | - nvk: Drop the device-level mutex
|
---|
2845 | - nvk: Get rid of the tiled memory allocation helpers
|
---|
2846 | - nvk/drm: Restructure nvk_queue_submit_drm_nouveau()
|
---|
2847 | - nvk/drm: Split exec as needed for large command buffers
|
---|
2848 | - nvk: Don't store the descriptor pool BO in the set
|
---|
2849 | - nvk: Store a 20-bit driver_build_sha in nvk_instance
|
---|
2850 | - nvk: Hook up the disk cache
|
---|
2851 | - nvk: Re-structure early shader compilation a bit
|
---|
2852 | - nvk: Add a default pipeline cache
|
---|
2853 | - nvk: Cache NIR shaders
|
---|
2854 | - nvk: Init pipelineCacheUUID
|
---|
2855 | - drm-uapi: Sync nouveau_drm.h
|
---|
2856 | - nvk: Take GETPARAM_EXEC_PUSH_MAX into account
|
---|
2857 | - nvk: Handle zero-sized sparse buffers
|
---|
2858 | - nvk: Use align() and align64() instead of ALIGN_POT
|
---|
2859 | - nouveau: Generate headers for Maxwell B compute
|
---|
2860 | - nvk: Add a nvk_cmd_buffer_compute_cls() helper
|
---|
2861 | - nvk: Invalidate sampler/texture header caches in BeginCommandBuffer()
|
---|
2862 | - nvk: Invalidate SKED caches at the top of command buffers
|
---|
2863 | - nvk: Advertise more inline uniform block limits
|
---|
2864 | - nvk: Emit MME_DMA_SYSMEMBAR before indirect draw/dispatch
|
---|
2865 | - nvk: Set max descriptors to 2^20 for most descriptor types
|
---|
2866 | - nvk: Reset descriptor pool allocator when all sets are destroyed
|
---|
2867 | - nil/format: Use A for alpha blend
|
---|
2868 | - nil/format: Advertise R10G10B10A2_UINT texture buffer support
|
---|
2869 | - nvk: Disable depth or stencil tests when unbound
|
---|
2870 | - nvk: Always emit at least one color attachment
|
---|
2871 | - nvk: Improve address space and buffer size limits
|
---|
2872 | - nvk: Always set pixel_min/max_Z to CLAMP
|
---|
2873 | - nvk: Use nouveau_ws_bo_unmap() instead of munmap()
|
---|
2874 | - nvk: Free the disk cache
|
---|
2875 | - nvk: Add an nvk_shader_finish() helper
|
---|
2876 | - nvk: Handle unbinding images and buffers
|
---|
2877 | - nvk: Clean up the disk cache on physical device create fail path
|
---|
2878 | - vulkan/wsi: Allow for larger linear images
|
---|
2879 | - nvk: Add a nvk_cmd_buffer_dirty_render_pass() helper
|
---|
2880 | - nvk: Re-sort device features
|
---|
2881 | - nvk: Implement VK_EXT_depth_bias_control
|
---|
2882 | - nvk: Advertise VK_KHR_workgroup_memory_explicit_layout
|
---|
2883 | - nvk: Implement VK_EXT_image_sliced_view_of_3d
|
---|
2884 | - nvk: Advertise VK_EXT_primitive_topology_list_restart
|
---|
2885 | - nvk: Advertise VK_EXT_attachment_feedback_loop_layout
|
---|
2886 | - features: Mark VK_EXT_attachment_feedback_loop_layout done for NVK
|
---|
2887 | - nvk: Re-arrange Vulkan 1.2 features to match the header
|
---|
2888 | - nvk: Advertise shaderOutputLayer and shaderOutputViewportIndex
|
---|
2889 | - nvk: Enable descriptorIndexing
|
---|
2890 | - nvk: Implement VK_EXT_dynamic_rendering_unused_attachments
|
---|
2891 | - nir: Add a nir_ssa_def_all_uses_are_fsat() helper
|
---|
2892 | - nir: Add convert_alu_types to divergence analysis
|
---|
2893 | - nir/lower_tex: Add a lower_txd_clamp option
|
---|
2894 | - nir: Add a load_sysval_nv intrinsic
|
---|
2895 | - nir: Add NV-specific texture opcodes
|
---|
2896 | - nir: Add an load_barycentric_at_offset_nv intrinsic
|
---|
2897 | - nir: Add a range to most I/O intrinsics
|
---|
2898 | - nir: Add NVIDIA-specific I/O intrinsics
|
---|
2899 | - nir/lower_bit_size: Fix subgroup lowering for floats
|
---|
2900 | - nir: add deref follower builder for casts.
|
---|
2901 | - nir: Handle wildcards with casts in copy_prop_vars
|
---|
2902 |
|
---|
2903 | Felix DeGrood (12):
|
---|
2904 |
|
---|
2905 | - anv: save a shader source uint32_t hash in gfx/compute pipelines
|
---|
2906 | - anv: Add Source hash field to VkPipelineExecutableStatisticKHR
|
---|
2907 | - iris: save shader source sha1 in ish
|
---|
2908 | - mesa: propagate shader source sha1 from gl_shader to nir_shader
|
---|
2909 | - intel: use shader source hash in INTEL_MEASURE
|
---|
2910 | - intel/compiler: use shader source hash in shader dump code
|
---|
2911 | - anv: add fake sparse support
|
---|
2912 | - anv: enable fake sparse for Elden Ring
|
---|
2913 | - anv: debug messaging for sparse texture usage
|
---|
2914 | - anv: fix frame count reporting in INTEL_MEASURE
|
---|
2915 | - anv: set ComputeMode.PixelAsyncComputeThreadLimit = 4
|
---|
2916 | - anv: remove CS_FLUSH from query regression
|
---|
2917 |
|
---|
2918 | Feng Jiang (9):
|
---|
2919 |
|
---|
2920 | - virgl: Only PIPE_BUFFER with VIRGL_BIND_CUSTOM flag is considered busy during creation
|
---|
2921 | - meson: Export winsys function symbols for target va
|
---|
2922 | - frontends/va: Add slice_count to AV1 slice_parameter
|
---|
2923 | - virgl/video: Add definition of virgl_av1_picture_desc
|
---|
2924 | - virgl/video: Add support for AV1 decoding
|
---|
2925 | - virgl/video: Enable AV1 decoding
|
---|
2926 | - meson: Rename dri-vdpau.dyn to dri.dyn
|
---|
2927 | - CODEOWNERS: Add \@flynnjiang for VirGL video
|
---|
2928 | - meson: Move video to separate section in meson configuration summary
|
---|
2929 |
|
---|
2930 | Filip Gawin (1):
|
---|
2931 |
|
---|
2932 | - crocus: Avoid fast-clear with incompatible view
|
---|
2933 |
|
---|
2934 | Flora Cui (1):
|
---|
2935 |
|
---|
2936 | - radeonsi: limit CP DMA to skip holes in sparse bo
|
---|
2937 |
|
---|
2938 | Francisco Jerez (29):
|
---|
2939 |
|
---|
2940 | - intel/fs/ra: Define REG_CLASS_COUNT constant specifying the number of register classes.
|
---|
2941 | - intel/vec4/ra: Define REG_CLASS_COUNT constant specifying the number of register classes.
|
---|
2942 | - intel/compiler: Make MAX_VGRF_SIZE macro depend on devinfo and update it for Xe2.
|
---|
2943 | - intel/fs/ra/xe2: Scale up register allocation granularity by 2x on Xe2+ platforms.
|
---|
2944 | - intel/eu/xe2+: Fix encoding of various message descriptors for change in register size.
|
---|
2945 | - intel/fs: Fix signedness of payload_node_count argument of calculate_payload_ranges().
|
---|
2946 | - intel/fs/xe2+: Fix payload node live range calculations for change in register size.
|
---|
2947 | - intel/fs/xe2+: Fix grf_count in post-RA scheduling for updated register file size.
|
---|
2948 | - intel/fs/xe2+: Fixes for increased accumulator register width.
|
---|
2949 | - intel/fs/xe2+: Scale MAX_SAMPLER_MESSAGE_SIZE by native register size.
|
---|
2950 | - intel/eu/xe2+: Update validation of GRF region size to account for Xe2 reg size
|
---|
2951 | - intel/fs/xe2+: Allow increased SIMD width for various get_fpu_lowered_simd_width() restrictions.
|
---|
2952 | - intel/compiler/xe2+: Represent dispatch_grf_start_reg in native GRF units.
|
---|
2953 | - intel/fs/xe2+: Update encoding of FB write message payload.
|
---|
2954 | - intel/fs/xe2+: Round up fs_builder::vgrf() size calculation to HW register unit.
|
---|
2955 | - intel/fs/xe2+: Scale BRW_MAX_MSG_LENGTH by native register size.
|
---|
2956 | - intel/fs/xe2+: Fix payload layout of sampler messages for Xe2 reg size
|
---|
2957 | - intel/fs/xe2+: Update GS payload setup for Xe2 reg size.
|
---|
2958 | - intel/fs/xe2+: Update TCS payload setup for Xe2 reg size.
|
---|
2959 | - intel/fs/xe2+: Update TES payload setup for Xe2 reg size.
|
---|
2960 | - intel/fs: Lower unsupported regioning with non-trivial 2D regions on FIXED_GRFs.
|
---|
2961 | - intel/fs/xe2+: Update regioning lowering offset alignment checks for Xe2 regs.
|
---|
2962 | - intel/fs/xe2+: Fix execution width of SHADER_OPCODE_GET_BUFFER_SIZE for SIMD16 EU.
|
---|
2963 | - intel/fs/xe2+: Fix calculation of spill message width for Xe2 regs.
|
---|
2964 | - intel/xe2+: Round up size to reg_unit() in fs_reg_alloc::alloc_spill_reg().
|
---|
2965 | - intel/fs/xe2+: Fix URB writes with 0 data components.
|
---|
2966 | - intel/fs: Specify number of data components of logical URB writes via control immediate.
|
---|
2967 | - intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB writes.
|
---|
2968 | - intel/fs: Delete manual 'inst->mlen' calculations from all uses of logical URB reads.
|
---|
2969 |
|
---|
2970 | Frank Binns (10):
|
---|
2971 |
|
---|
2972 | - pvr: clang-format fixes
|
---|
2973 | - pvr: skip setting up SPM consts buffer when no const shared regs are used
|
---|
2974 | - pvr: cleanup SPM EOT dynarray after upload
|
---|
2975 | - pvr: treat VK_IMAGE_CREATE_MUTABLE_FORMAT_BIT as not supported
|
---|
2976 | - pvr: remove pvr_pbe_get_src_pos()
|
---|
2977 | - pvr: fix attachments segfault in pvr_is_stencil_store_load_needed()
|
---|
2978 | - pvr: fix allocation size of clear colour consts shared regs buffer
|
---|
2979 | - pvr: change a few places to use PVR_DW_TO_BYTES()
|
---|
2980 | - pvr: fix setup of load op unresolved msaa mask
|
---|
2981 | - pvr: emit PPP state when vis_test dirty bit is set
|
---|
2982 |
|
---|
2983 | Friedrich Vock (19):
|
---|
2984 |
|
---|
2985 | - radv/ci: Set DRIVER_NAME in LAVA raven vkcts jobs
|
---|
2986 | - radv: Handle VK_SUBOPTIMAL_KHR in trace layers
|
---|
2987 | - ac/msgpack: make fixstrs a const char
|
---|
2988 | - ac/sqtt,radv: Split internal and API hash in PSO correlations
|
---|
2989 | - ac/rgp: Write lds_size metadata
|
---|
2990 | - ac/rgp: Add metadata for separate-compiled RT stages
|
---|
2991 | - radv/sqtt: Move record filling to helper function
|
---|
2992 | - radv/sqtt: Unregister records based on hash
|
---|
2993 | - radv/sqtt: Write LDS size metadata in code objects
|
---|
2994 | - radv/sqtt: Handle separately-compiled RT pipelines
|
---|
2995 | - ac/sqtt,radv/sqtt: Add and use marker for separate RT compilation
|
---|
2996 | - nir/load_store_vectorize: Handle intrinsics with constant base
|
---|
2997 | - radv/rt: Pre-initialize instance address
|
---|
2998 | - radv: Initialize shader freelist on allocation
|
---|
2999 | - radv: Fix check in insert_block
|
---|
3000 | - radv/rt: Reject hits within 10ULP of previous hits in emulated RT
|
---|
3001 | - radv/rra: Recognize LPDDR memory
|
---|
3002 | - radv/rmv: Recognize LPDDR memory
|
---|
3003 | - vulkan: Don't use set_foreach_remove when destroying pipeline caches
|
---|
3004 |
|
---|
3005 | Ganesh Belgur Ramachandra (5):
|
---|
3006 |
|
---|
3007 | - radeonsi: stores bottom_edge_rule option in the rasterizer state
|
---|
3008 | - radeonsi: sets OPTIMAL_BIN_SELECTION to 0 if using bottom_edge_rule
|
---|
3009 | - radeonsi: "clear_render_target" shader in nir
|
---|
3010 | - radeonsi: "clear_render_target_1d_array" shader in nir
|
---|
3011 | - radeonsi: "clear_12bytes_buffer" shader in nir
|
---|
3012 |
|
---|
3013 | Georg Lehmann (39):
|
---|
3014 |
|
---|
3015 | - aco/gfx11: fix get_gfx11_true16_mask with v_cmp_class_f16
|
---|
3016 | - aco: improve get_gfx11_true16_mask description
|
---|
3017 | - aco: combine a & ~b to bfi(b, 0, a)
|
---|
3018 | - aco/gfx11: use v_cmp_class_f16 with opsel for bitnz/bitz
|
---|
3019 | - aco: fix non constant 16bit bitnz/bitz
|
---|
3020 | - ac/nir: handle more special cases in ac_nir_unpack_arg
|
---|
3021 | - aco: use s_bitreplicate_b64_b32 to set exec to 0xffff0000ffff0000
|
---|
3022 | - nir/opt_intrinsics: optimize (exclusive_scan(op, a) op a) to inclusive scan
|
---|
3023 | - aco: always use rtne for fquantize2f16
|
---|
3024 | - nir/opt_if: also rewrite uniform uses for read_invocation
|
---|
3025 | - nir: unify lower_bitfield_insert with has_{bfm,bfi,bitfield_select}
|
---|
3026 | - nir: unify lower_bitfield_extract with has_bfe
|
---|
3027 | - nir: unify lower_find_msb with has_{find_msb_rev,uclz}
|
---|
3028 | - aco: fix u2f16 with 32bit input
|
---|
3029 | - aco: combine a | ~b to bfi(b, a, -1)
|
---|
3030 | - aco: use v_cvt_f32_ubyte for signed casts too
|
---|
3031 | - nir: add nir_scalar intrinsic helpers
|
---|
3032 | - nir: add nir_scalar_equal
|
---|
3033 | - aco: implement some exclusive scans with inclusive scans
|
---|
3034 | - aco/gfx11: don't use bfe for local_invocation_id if the others are always 0
|
---|
3035 | - nir/opt_algebraic: remove broken fddx/fddy patterns
|
---|
3036 | - aco: simplify masked swizzle dpp selection by removing or_mask first
|
---|
3037 | - aco: fix p_extract with v1 dst and s1 operand
|
---|
3038 | - aco: implement 64bit div find_lsb
|
---|
3039 | - nir: scalarize masked_swizzle_amd created from shuffle_xor
|
---|
3040 | - aco/optimizer: check if we can use omod before labeling it
|
---|
3041 | - aco/optimizer: copy propagate to output modifier instructions
|
---|
3042 | - aco: remove -0.0 for 32 bit fsign with mul_legacy/omod when denorms are flushed
|
---|
3043 | - nir: make quad intrinsic dst bit size match src0
|
---|
3044 | - nir/lower_subgroups: use intrinsic builder more
|
---|
3045 | - aco: assume new generations are unsupported by clrx
|
---|
3046 | - aco: assume newer generation will use GFX11 wait_imm packing
|
---|
3047 | - aco: print final ir instead if printing asm is unsupported
|
---|
3048 | - aco/gfx11: optimize dual source export
|
---|
3049 | - aco/gfx11: apply clamp/omod to vinterp
|
---|
3050 | - aco: support v_fma_f32_dpp as fma_mix
|
---|
3051 | - aco/gfx11: support vinterp as fma_mix
|
---|
3052 | - aco: add missing scc def for SALU quad broadcast
|
---|
3053 | - aco/sched: treat p_dual_src_export_gfx11 like export
|
---|
3054 |
|
---|
3055 | George Ouzounoudis (38):
|
---|
3056 |
|
---|
3057 | - nouveau/codegen: Support compact clip distances with arrayed_io
|
---|
3058 | - nouveau/codegen: Handle nir op amul
|
---|
3059 | - nouveau/codegen: Fix compact patch varyings in case of NIR
|
---|
3060 | - nouveau/codegen: Add capability to pre-specify tessellation domain
|
---|
3061 | - nvk: Do not increment instance id across draws
|
---|
3062 | - nvk: Add a macro for root descriptor table byte offsets
|
---|
3063 | - nvk: Set base vertex state in sequential mme draw
|
---|
3064 | - nvk: Support base instance in instanced draw calls
|
---|
3065 | - nvk: Switch point rasterization to point sprites
|
---|
3066 | - nvk: Support large points
|
---|
3067 | - nvk: Compile geometry shaders
|
---|
3068 | - nouveau/mme: Keep device info in mme_builder
|
---|
3069 | - nvk: Simplify mme build function argument
|
---|
3070 | - nvk: Support VK_KHR_shader_draw_parameters
|
---|
3071 | - nvk: Support for vertex shader transform feedback
|
---|
3072 | - nvk: Support transform feedback indirect draws
|
---|
3073 | - nvk: Support transform feedback geometry streams
|
---|
3074 | - nvk: Support transform feedback queries
|
---|
3075 | - nvk: Support vertex shader transform feedback on Fermi
|
---|
3076 | - nvk: Disable PRIMITIVE_RESTART_VERTEX_ARRAY by default
|
---|
3077 | - nvk: Fix geometry shader active stream mask
|
---|
3078 | - nvk: Support geometry shaders
|
---|
3079 | - nvk: Basic tessellation shader support
|
---|
3080 | - nvk: Assign locations correctly for arrayed IO
|
---|
3081 | - nvk: Enable multiview with tessellation shader
|
---|
3082 | - nvk: Fix cases where execution mode is specified in the tesc shader.
|
---|
3083 | - nvk: Respect tessellation domain origin state
|
---|
3084 | - nvk: Lower io to temporaries for tessellation evaluation nir
|
---|
3085 | - nvk: Support VkDescriptorSetVariableDescriptorCountLayoutSupport
|
---|
3086 | - nvk: Handle cases of descriptor bindings with variable counts
|
---|
3087 | - nvk: Add nir non-uniform optimization pass
|
---|
3088 | - nvk: Enable descriptor indexing
|
---|
3089 | - nvk: Do not keep redundant info for tessellation domain
|
---|
3090 | - nouveau/codegen: Do not keep redundant info for tessellation domain
|
---|
3091 | - nvk: Enable dynamic line rasterization mode state
|
---|
3092 | - nvk: Fix support for VK_EXT_sample_locations
|
---|
3093 | - nvk: Support dynamic state for enabling sample locations
|
---|
3094 | - nouveau/codegen: Add a 4th optimization level for MemoryOpts
|
---|
3095 |
|
---|
3096 | Gert Wollny (63):
|
---|
3097 |
|
---|
3098 | - r600/sfn: Switch to register intrinsics
|
---|
3099 | - r600/sfn/tests: add simple copy-prop test with register source
|
---|
3100 | - r600/sfn: Allow for larger ALU CF's
|
---|
3101 | - r600/sfn: Handle indirect array load/store dependencies better
|
---|
3102 | - r600/sfn: Increase LDS fetch schedule priority
|
---|
3103 | - r600/sfn: Add peephole optimization to move a dest to the previous op
|
---|
3104 | - r600/sfn: reorder the value factory class member declaration a bit
|
---|
3105 | - r600/sfn: Add some tests for proper register access
|
---|
3106 | - r600/sfn: Print more info if scheduling fails
|
---|
3107 | - r600/sfn: remove debug output leftovers
|
---|
3108 | - r600/sfn: Fix use of multiple IDX with kcache
|
---|
3109 | - r600/sfn: Always check arrays writes before allowing copy propagation
|
---|
3110 | - r600/sfn: set block sizes based on chip class
|
---|
3111 | - r600/sfn: Fix typo with block type
|
---|
3112 | - r600/sfn: override slot count for IfInstr
|
---|
3113 | - r600/sfn: Add method to convert to AluGroup directly
|
---|
3114 | - r600/sfn: Add flags to check whether a group starts CF and can do that
|
---|
3115 | - r600/sfn: make remaining slots a signed value
|
---|
3116 | - r600/sfn: on Cayman loading an index register needs only one slot
|
---|
3117 | - r600/sfn: Splizt ALU blocks in scheduler to fit into 128 slots
|
---|
3118 | - r600/sfn: rework checks for ALU CF emission
|
---|
3119 | - r600/sfn: Schedule AR uses befor possible groups
|
---|
3120 | - r600: Explicitly force new CF in gs copy shader
|
---|
3121 | - r600: Assert when backend wants to create a new ALU CF
|
---|
3122 | - r600: don't check possible size of ALU CF
|
---|
3123 | - r600: don't use sb disasm to disassamble copy shader
|
---|
3124 | - r600: Force CF when emitting a NOP on R600 in gs copy shader
|
---|
3125 | - r600/sfn: Don't try to propagate to vec4 with more than one use
|
---|
3126 | - r600/sfn: Only switch to other CF if no AR uses are pending
|
---|
3127 | - r600/sfn: AR loads should depend on all previous non ALU instructions
|
---|
3128 | - r600/sfn: Renumber shader blocks in scheduler
|
---|
3129 | - r600/sfn: Track whether a register is ALU clause local
|
---|
3130 | - r600/sfn: Use clause local registers in RA
|
---|
3131 | - r600/sfn: Take source uses into account when switching channels
|
---|
3132 | - r600/sfn: take number of dest values into account
|
---|
3133 | - r600: retire SB optimizer
|
---|
3134 | - r600/sfn: work around injecting extra CF's to handle hardware bugs
|
---|
3135 | - r600: use correct cso pointer for fetch shader
|
---|
3136 | - r600/sfn: Make use of four clause local registers
|
---|
3137 | - r600/sfn: drop unused ControlFlowInstr type enum
|
---|
3138 | - r600/sfn: factor out resource as extra class
|
---|
3139 | - r600/sfn: Simplify dependency chain for index loads on EG
|
---|
3140 | - r600: print texture resource index mode separately
|
---|
3141 | - r600/sfn: Make address split pass obligatory
|
---|
3142 | - r600/sfn: rename method resource_base to resource_id
|
---|
3143 | - r600/sfn: Add old address to update_indirect_addr
|
---|
3144 | - r600/sfn: Sepeate resource and sampler in texture instructions
|
---|
3145 | - r600/sfn: get rid of the method to get the index mode
|
---|
3146 | - r600/sfn: sort the uniforms of the right shader
|
---|
3147 | - r600/sfn: Fix use of scheduled_shader vs shader
|
---|
3148 | - virgl: report MIRROR_CLAMP features better
|
---|
3149 | - ci: Upref virglrenderer
|
---|
3150 | - copyimage: check requested slice early when cube maps are involved
|
---|
3151 | - mesa: check numlevels and numlayers when creating a texture view
|
---|
3152 | - virgl: Use common clear_texture if host doesn't support the feature
|
---|
3153 | - r600/sfn: don't remove texture sources by using the enum value
|
---|
3154 | - r600: drop egcm_load_index_reg
|
---|
3155 | - r600/sfn: Don't override a chgr pinning during copy propagation
|
---|
3156 | - r600/sfn: When simplifying src vec4 pinnings, also check all uses
|
---|
3157 | - virgl: Fix logic for reporting PIPE_MIRROR_CLAMP
|
---|
3158 | - r600: Add callbacks for get_driver_uuid and get_device_uuid
|
---|
3159 | - r600: Link with libgalliumvl, when enabling rusticl this is needed
|
---|
3160 | - r600/sfn: Fixup component count only if intrinsic has it
|
---|
3161 |
|
---|
3162 | Guilherme Gallo (5):
|
---|
3163 |
|
---|
3164 | - bin/ci: Ensure that all jobs have nodes in DAG
|
---|
3165 | - ci/radeonsi: Update flake list
|
---|
3166 | - ci/freedreno: Add a new flake
|
---|
3167 | - ci/zink: Found some flakes
|
---|
3168 | - ci/anv: Catch some flakes
|
---|
3169 |
|
---|
3170 | Hannes Mann (1):
|
---|
3171 |
|
---|
3172 | - vulkan/wsi/wayland: Fix detection of tearing control protocol
|
---|
3173 |
|
---|
3174 | Hans-Kristian Arntzen (2):
|
---|
3175 |
|
---|
3176 | - wsi/x11: Fix potential deadlock in present ID.
|
---|
3177 | - wsi/x11: Don't allow signal_present_id to rewind.
|
---|
3178 |
|
---|
3179 | Helen Koike (21):
|
---|
3180 |
|
---|
3181 | - ci: re-add EXTRA_LOCAL_PACKAGES to rootfs
|
---|
3182 | - ci: add EXTRA_LOCAL_PACKAGES to apt-get install
|
---|
3183 | - docs/ci: Add docs for EXTRA_LOCAL_PACKAGES
|
---|
3184 | - ci: disable duplicated pipelines triggered by marge
|
---|
3185 | - ci: add --project option to ci_run_n_monitor.py
|
---|
3186 | - ci/android: remove strace output from cuttlefish-runner.sh
|
---|
3187 | - ci: add locked flag to bindgen-cli on x86_64_build.sh
|
---|
3188 | - ci: separate hiden jobs to -inc.yml files
|
---|
3189 | - ci/ci_run_n_monitor: add docs for multiple targets
|
---|
3190 | - ci/ci_run_n_monitor: print stress test results per job
|
---|
3191 | - ci/ci_run_n_monitor: simplify with defaultdict
|
---|
3192 | - ci/ci_run_n_monitor: merge print_job_status_change with print_job_status
|
---|
3193 | - ci/ci_run_n_monitor: make --target mandatory
|
---|
3194 | - ci/ci_run_n_monitor: merge enable_job with retry_job
|
---|
3195 | - ci/ci_run_n_monitor: simplify enable/cancel logic in monitor_pipeline()
|
---|
3196 | - ci/ci_run_n_monitor: allow <user>/<project> in --project
|
---|
3197 | - ci/ci_run_n_monitor: limit repetitions on --stress
|
---|
3198 | - ci/marge_queue: add missing python-dateutils to requirements.txt
|
---|
3199 | - ci/ci_run_n_monitor: keep monitoring if a job is still running
|
---|
3200 | - ci/marge_queue: add pretty_dutation()
|
---|
3201 | - ci/ci_run_n_monitor: print job duration time
|
---|
3202 |
|
---|
3203 | Honglei Huang (7):
|
---|
3204 |
|
---|
3205 | - virgl/video: Add support for mpeg12 decoding
|
---|
3206 | - virgl/video: Add support for vc1 decoding
|
---|
3207 | - virgl/video: Add support for jpeg decoding
|
---|
3208 | - virgl/video: Add support for hevc10bit decoding.
|
---|
3209 | - virgl/video: Add more pipe type in virgl formats convert table
|
---|
3210 | - virgl/video: Add jpeg buf start code check
|
---|
3211 | - virgl: Enable vp9 hardware decode
|
---|
3212 |
|
---|
3213 | Hyunjun Ko (3):
|
---|
3214 |
|
---|
3215 | - anv: use ycbcr_info for P010 format
|
---|
3216 | - anv: don't use cmd_buffer after destroyed.
|
---|
3217 | - anv: don't flush_llc on gen9
|
---|
3218 |
|
---|
3219 | Iago Toral Quiroga (100):
|
---|
3220 |
|
---|
3221 | - nir/trivialize: Move decl_reg to the start of the block
|
---|
3222 | - v3dv: stop incrementing UBO indices by one
|
---|
3223 | - nir/lower_robustness: drop skip_ubo_0 option
|
---|
3224 | - v3dv: fix incorrect key setup
|
---|
3225 | - broadcom/compiler: stop asserting on Vulkan environment
|
---|
3226 | - broadcom/compiler: use NIR's lowering for dispatch base
|
---|
3227 | - broadcom/compiler: move uniform offset lowering from compiler to GL driver
|
---|
3228 | - broadcom/compiler: move vulkan's point coord lowering to the driver
|
---|
3229 | - v3dv: don't set lower_wpos_pntc for Vulkan
|
---|
3230 | - broadcom/compiler: always clamp results from logic ops
|
---|
3231 | - broadcom/compiler: drop execution environment from the shader key
|
---|
3232 | - v3dv: drop cpu path for buffer to image copies
|
---|
3233 | - v3dv: remove unused code
|
---|
3234 | - nir/lower_tex: copy backend_flags field when copying a tex instruction
|
---|
3235 | - nir/lower_tex: use a callback to check sampler return size packing
|
---|
3236 | - squash! v3dv,broadcom/compiler: don't abuse sampler index
|
---|
3237 | - v3dv: assert that only tex instructions with sampler state have a sampler src
|
---|
3238 | - v3d: fix texture packing lowering
|
---|
3239 | - v3d,v3dv: use fquantize2f16 lowering in NIR
|
---|
3240 | - v3dv: be more precise in vkGetImageSubresourceLayout
|
---|
3241 | - v3dv: handle pPlaneLayouts in VkImageDrmFormatModifierExplicitCreateInfoEXT
|
---|
3242 | - v3dv: bump up MAX_UNIFORM_BUFFERS to 16
|
---|
3243 | - v3dv: add support for sampling simple 2D linear textures
|
---|
3244 | - v3dv: expand sampling from linear image hack to support multi-planar images
|
---|
3245 | - v3dv: don't assume that bound descriptors have been written
|
---|
3246 | - v3dv: only handle Android Hardware Buffer on Android
|
---|
3247 | - v3dv: we can sample from 1D array too
|
---|
3248 | - broadcom/compiler: add a couple of shader key helpers
|
---|
3249 | - v3d: compute nir sha1 for uncompiled shader state
|
---|
3250 | - v3d: use pre-computed shader sha1 for disk cache
|
---|
3251 | - v3d: fix RAM shader cache
|
---|
3252 | - v3d: get rid of shader_state pointer in v3d_key
|
---|
3253 | - broadcom/simulator: reset CFG7 for compute dispatch in v71
|
---|
3254 | - broadcom/common: retrieve V3D revision number
|
---|
3255 | - broadcom/compiler: update node/temp translation for v71
|
---|
3256 | - broadcom/compiler: implement "reads/writes too soon" checks for v71
|
---|
3257 | - broadcom/compiler: implement read stall check for v71
|
---|
3258 | - broadcom/compiler: add a v3d71_qpu_writes_waddr_explicitly helper
|
---|
3259 | - broadcom/compiler: prevent rf2-3 usage in thread end delay slots for v71
|
---|
3260 | - broadcom/qpu: add new ADD opcodes for FMOV/MOV in v71
|
---|
3261 | - broadcom/qpu: fix packing/unpacking of fmov variants for v71
|
---|
3262 | - broadcom/compiler: make vir_write_rX return false on platforms without accums
|
---|
3263 | - broadcom/compiler: rename vir_writes_rX to vir_writes_rX_implicitly
|
---|
3264 | - broadcom/compiler: only handle accumulator classes if present
|
---|
3265 | - broadcom/compiler: don't assign rf0 to temps across implicit rf0 writes
|
---|
3266 | - broadcom/compiler: CS payload registers have changed in v71
|
---|
3267 | - broadcom/compiler: don't schedule rf0 writes right after ldvary
|
---|
3268 | - broadcom/compiler: allow instruction merges in v71
|
---|
3269 | - broadcom/qpu: add MOV integer packing/unpacking variants
|
---|
3270 | - broadcom/qpu: fail packing on unhandled mul pack/unpack
|
---|
3271 | - broadcom/compiler: generalize check for shaders using pixel center W
|
---|
3272 | - broadcom/compiler: v71 isn't affected by double-rounding of viewport X,Y coords
|
---|
3273 | - broadcom/compiler: update peripheral access restrictions for v71
|
---|
3274 | - broadcom/qpu: add packing for fmov on ADD alu
|
---|
3275 | - broadcom/compiler: handle rf0 flops storage restriction in v71
|
---|
3276 | - broadcom/compiler: enable ldvary pipelining on v71
|
---|
3277 | - broadcom/compiler: try to use ldunif(a) instead of ldunif(a)rf in v71
|
---|
3278 | - broadcom/compiler: don't assign rf0 to temps that conflict with ldvary
|
---|
3279 | - broadcom/compiler: convert mul to add when needed to allow merge
|
---|
3280 | - broadcom/compiler: implement small immediates for v71
|
---|
3281 | - broadcom/compiler: update thread end restrictions for v7.x
|
---|
3282 | - broadcom/compiler: update ldvary thread switch delay slot restriction for v7.x
|
---|
3283 | - broadcom/compiler: lift restriction for branch + msfign after setmsf for v7.x
|
---|
3284 | - broadcom/compiler: start allocating from RF 4 in V7.x
|
---|
3285 | - broadcom/compiler: validate restrictions after TLB Z write
|
---|
3286 | - broadcom/compiler: lift restriction on vpmwt in last instruction for V3D 7.x
|
---|
3287 | - broadcom/compiler: fix up copy propagation for v71
|
---|
3288 | - broadcom/compiler: don't allocate spill base to rf0 in V3D 7.x
|
---|
3289 | - broadcom/compiler: improve allocation for final program instructions
|
---|
3290 | - broadcom/compiler: don't assign registers to unused nodes/temps
|
---|
3291 | - broadcom/compiler: only assign rf0 as last resort in V3D 7.x
|
---|
3292 | - v3dv: expose V3D revision number in device name
|
---|
3293 | - v3dv/device: handle new rpi5 device (bcm2712)
|
---|
3294 | - v3dv: setup render pass color clears for any format bpp in v71
|
---|
3295 | - v3dv: setup TLB clear color for meta operations in v71
|
---|
3296 | - v3dv: fix up texture shader state for v71
|
---|
3297 | - v3dv: handle new texture state transfer functions in v71
|
---|
3298 | - v3dv: implement noop job for v71
|
---|
3299 | - v3dv: handle render pass global clear for v71
|
---|
3300 | - v3dv: GFX-1461 does not affect V3D 7.x
|
---|
3301 | - broadcom/compiler: update thread end restrictions validation for v71
|
---|
3302 | - v3dv: handle early Z/S clears for v71
|
---|
3303 | - v3dv: handle RTs with no color targets in v71
|
---|
3304 | - v3dv: don't convert floating point border colors in v71
|
---|
3305 | - v3dv: handle Z clipping in v71
|
---|
3306 | - v3dv: make v3dv_viewport_compute_xform depend on the V3D version
|
---|
3307 | - v3dv: fix depth clipping then Z scale is too small in V3D 7.x
|
---|
3308 | - v3d/v3dv: fix texture state array stride packing for V3D 7.1.5
|
---|
3309 | - v3d,v3dv: support up to 8 render targets in v7.1+
|
---|
3310 | - v3d,v3dv: don't use max internal bpp for tile sizing in V3D 7.x
|
---|
3311 | - v3d,v3dv: propagate NaNs bits in shader state records are reserved in v7.x
|
---|
3312 | - v3dv: use new texture shader state rb_swap and reverse fields in v3d 7.x
|
---|
3313 | - v3dv: fix color write mask for v3d 7.x
|
---|
3314 | - v3d,v3dv: fix depth bias for v3d 7.x
|
---|
3315 | - v3d,v3dv: fix compute for V3D 7.1.6+
|
---|
3316 | - v3dv: expose fullDrawIndexUint32 in V3D 7.x
|
---|
3317 | - v3dv: expose depthClamp in V3D 7.x
|
---|
3318 | - v3dv: expose scalarBlockLayout on V3D 7.x
|
---|
3319 | - v3dv: fix confusing nomenclature about DRM nodes
|
---|
3320 | - v3d,v3dv: fix MMU error from hardware prefetch after ldunifa
|
---|
3321 |
|
---|
3322 | Ian Douglas Scott (1):
|
---|
3323 |
|
---|
3324 | - egl/wayland: Don't segfault if \`create_wl_buffer` returns \`NULL`
|
---|
3325 |
|
---|
3326 | Ian Romanick (38):
|
---|
3327 |
|
---|
3328 | - intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress
|
---|
3329 | - intel/fs: Constant fold SHL
|
---|
3330 | - intel/fs: Constant fold OR and AND
|
---|
3331 | - util/rb-tree: Return the actual first node from rb_tree_search
|
---|
3332 | - util/rb-tree: Fix typo in comment
|
---|
3333 | - nir/builder: Add nir_extract_i8_imm and nir_extract_u8_imm helpers
|
---|
3334 | - nir/algebraic: Remove redundant pack / unpack lowering patterns
|
---|
3335 | - intel/fs: Completely re-write the combine constants pass
|
---|
3336 | - intel/fs: Combine constants for SEL instructions too
|
---|
3337 | - intel/fs: Combine constants for integer instructions too
|
---|
3338 | - intel/fs: New VGRF packing scheme for constant combining
|
---|
3339 | - intel/compiler: Combine control barriers with identical memory semantics
|
---|
3340 | - intel/compiler: Don't evict for workgroup-scope fences
|
---|
3341 | - glsl/list: Clean up an inappropriate comment
|
---|
3342 | - util/rb-tree: Work around C++'s dislike of offsetof
|
---|
3343 | - util/rb-tree: Inline rb_tree_init
|
---|
3344 | - intel/fs: Don't continue fixed point iteration just because liveout changes
|
---|
3345 | - intel/fs: Don't try to copy propagate into a source again after progress is made
|
---|
3346 | - intel/fs: Make try_constant_propagate and try_copy_propagate file private
|
---|
3347 | - intel/fs: Move src.file checks out of try_constant_propagate and try_copy_propagate
|
---|
3348 | - intel/fs: Don't loop in try_constant_propagate
|
---|
3349 | - intel/fs: Simplify check in can_propagate_from
|
---|
3350 | - intel/fs: Make opt_copy_propagation_local file private
|
---|
3351 | - intel/fs: Encapsulate per-block ACP in a structure
|
---|
3352 | - intel/fs: Use rb_tree to store ACP entries by source
|
---|
3353 | - intel/fs: Use rb_tree to store ACP entries by destination
|
---|
3354 | - intel/fs: Use rb_tree for copy prop dataflow
|
---|
3355 | - intel/fs: Merge copy prop dataflow loops
|
---|
3356 | - intel/compiler/xe2: Update fs_visitor::setup_vs_payload to account for Xe2 reg size
|
---|
3357 | - intel/compiler/xe2: Use SIMD16 for nir_intrinsic_image_size
|
---|
3358 | - intel/compiler/xe2: TXD is lowered to SIMD16 in SIMD32 mode
|
---|
3359 | - nir/rematerialize: Rematerialize ALUs used only by compares with zero
|
---|
3360 | - intel/compiler/xe2: Handle new URB read messages
|
---|
3361 | - intel/compiler/xe2: Handle new URB write messages
|
---|
3362 | - intel/compiler/xe2: Update fs_visitor::emit_urb_writes to not assume SIMD8
|
---|
3363 | - spirv: Track when a shader has a cooperative matrix
|
---|
3364 | - intel/fs: Add DP4A to get_lowered_simd_width
|
---|
3365 | - nir/split_vars: Don't split arrays of cooperative matrix types
|
---|
3366 |
|
---|
3367 | Igor Torrente (4):
|
---|
3368 |
|
---|
3369 | - zink: Fix enumerate devices when running compositor
|
---|
3370 | - zink: Removes \`disable_xcb_surface`
|
---|
3371 | - zink: Fix one addicional case when running a compositor
|
---|
3372 | - zink: fix for startup crash of weston running on top of zink + venus
|
---|
3373 |
|
---|
3374 | Illia Abernikhin (2):
|
---|
3375 |
|
---|
3376 | - state_tracker: moving initialisation of whandle out from if statement whandle initialization inside if statement but used also outside
|
---|
3377 | - i915: change format in dbg string Actually, uintptr_t is of type unsigned long, but the debug line uses the %d format specifier, which expects an int.
|
---|
3378 |
|
---|
3379 | Illia Polishchuk (7):
|
---|
3380 |
|
---|
3381 | - iris: remove NULL check for already dereferenced pointer earlier
|
---|
3382 | - s/Intel: fix/anv: fix: potentially overflowing expression in genX
|
---|
3383 | - glx: fix dead code when gc var cannot be null due to earlier check
|
---|
3384 | - state_tracker: fix dereference before null check
|
---|
3385 | - anv, drirc: Add workaround to speed up Cyberpunk 2077 reg allocation
|
---|
3386 | - zink: move find_sampler_var from zink to nir core
|
---|
3387 | - nir: fix invalid sampler search by texture id
|
---|
3388 |
|
---|
3389 | Italo Nicola (24):
|
---|
3390 |
|
---|
3391 | - mesa/main: account for RTT samples when updating framebuffer
|
---|
3392 | - mesa/main: allow readpix/teximage to read from implicitly multisampled fbos
|
---|
3393 | - panfrost/genxml: fix Surface With Stride descriptor alignment
|
---|
3394 | - panfrost/genxml: add Multiplanar Surface descriptor
|
---|
3395 | - panfrost: refactor (un)packing of surface descriptors
|
---|
3396 | - pan/decode: decode Multiplanar Surface descriptors
|
---|
3397 | - panfrost: prepare pan_image_view for multiplanar formats
|
---|
3398 | - panfrost: prepare the driver to support YUYV and variants
|
---|
3399 | - panfrost: advertise support for YUYV and variants
|
---|
3400 | - panfrost: mandate proper alignment requirement depending format and arch
|
---|
3401 | - panfrost: add PAN_MESA_DEBUG=yuv for debugging yuv sampler
|
---|
3402 | - gallium/st: add non-CSC lowering of I420 as PIPE_FORMAT_R8_G8_B8_420
|
---|
3403 | - gallium/st: add non-CSC lowering of YV12 as PIPE_FORMAT_R8_B8_G8_420
|
---|
3404 | - pan/bi: add support for I420 and YV12 sampling
|
---|
3405 | - gallium/st: lower NV21 to R8_B8G8 instead of G8_B8R8
|
---|
3406 | - panfrost: fix invalid memory access in get_equation_str()
|
---|
3407 | - pan/decode: handle more than one panfrost_device
|
---|
3408 | - panfrost/ci: updated CI expectations
|
---|
3409 | - egl: reenable partial redraw with a warning when using gallium hud
|
---|
3410 | - pan/genxml: add Width/Height fields to v9+ Plane descriptor
|
---|
3411 | - panfrost: rename _needs_multiplanar_descriptor to _is_yuv
|
---|
3412 | - panfrost: prepare v9+ to support YUV sampling
|
---|
3413 | - panfrost: use centered YUV chroma siting
|
---|
3414 | - panfrost: advertise YUV formats for valhall
|
---|
3415 |
|
---|
3416 | Iván Briano (23):
|
---|
3417 |
|
---|
3418 | - anv: ensure CFE_STATE is emitted for ray tracing pipelines
|
---|
3419 | - iris: ensure mesh is disabled on context init
|
---|
3420 | - anv: ensure mesh is disabled on context init
|
---|
3421 | - anv: implement Wa_14019750404
|
---|
3422 | - intel/compiler: call brw_nir_adjust_payload from brw_postprocess_nir
|
---|
3423 | - anv,hasvk: respect provoking vertex setting on geometry shaders
|
---|
3424 | - anv: fix missing 3DSTATE_SBE_CLIP emission
|
---|
3425 | - anv: ensure pipelines have all state
|
---|
3426 | - anv: tell blorp to do mesh stuff only if it's enabled
|
---|
3427 | - blorp: fix hangs with mesh enabled
|
---|
3428 | - anv: use a simpler MUE layout for fast linked libraries
|
---|
3429 | - anv: track what kind of pipeline a fragment shader may be used with
|
---|
3430 | - intel/fs: read viewport and layer from the FS payload
|
---|
3431 | - intel/fs: handle URB setup for fast linked mesh pipelines
|
---|
3432 | - anv: enable VK_EXT_mesh_shader where supported
|
---|
3433 | - intel/fs: use ffsll so we don't explode on 32 bits
|
---|
3434 | - vulkan/runtime: add internal parameter to vk_spirv_to_nir
|
---|
3435 | - nir/lower_int64: respect rounding mode when casting to float
|
---|
3436 | - intel/compiler: round f2f16 correctly for RTNE case
|
---|
3437 | - util: add double_to_float16 helpers
|
---|
3438 | - nir: round f2f16{_rtne/_rtz} correctly for constant expressions
|
---|
3439 | - anv: advertise VK_KHR_global_priority_queue
|
---|
3440 | - anv: use the right vertexOffset on CmdDrawMultiIndexed
|
---|
3441 |
|
---|
3442 | Jani Nikula (1):
|
---|
3443 |
|
---|
3444 | - docs/vulkan: fixup some typos
|
---|
3445 |
|
---|
3446 | Janne Grunau (4):
|
---|
3447 |
|
---|
3448 | - asahi: toggle more barrier bits after transform feedback
|
---|
3449 | - asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs
|
---|
3450 | - asahi,agx: Upload constant buffers immediately
|
---|
3451 | - asahi: decode: Fix uint64_t format modifiers in agxdecode_stateful()
|
---|
3452 |
|
---|
3453 | Jesse Natalie (2):
|
---|
3454 |
|
---|
3455 | - nir_lower_mem_access_bit_sizes: Fix write-mask-constrained 3-byte stores as atomics
|
---|
3456 | - d3d12: Fix multidimensional array ordering
|
---|
3457 |
|
---|
3458 | Jianxun Zhang (1):
|
---|
3459 |
|
---|
3460 | - intel/common: Only set op mask on instructions in decoder
|
---|
3461 |
|
---|
3462 | Jonathan Marek (2):
|
---|
3463 |
|
---|
3464 | - freedreno: move redump.h to common code + cleanup
|
---|
3465 | - tu: add a TU_DEBUG=rd option for cmdstream dumping
|
---|
3466 |
|
---|
3467 | Jordan Justen (73):
|
---|
3468 |
|
---|
3469 | - isl: Add ISL_SURF_USAGE_STREAM_OUT_BIT
|
---|
3470 | - anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS
|
---|
3471 | - genxml/hsw: Add additional MOCS field enumerations
|
---|
3472 | - genxml/chv: Add MEMORY_OBJECT_CONTROL_STATE_CHV to document compared to BDW
|
---|
3473 | - isl/dev: Add uncached MOCS value
|
---|
3474 | - isl: Set MOCS to uncached for MTL stream-out
|
---|
3475 | - intel/isl: Use intel_needs_workaround() for MTL CCS WA
|
---|
3476 | - intel/compiler: Use nir SUBGROUP_INVOCATION for RT TOPOLOGY_ID
|
---|
3477 | - intel/dev: Add LNL platform enum
|
---|
3478 | - intel/dev: Support xe2 device init (for intel_device_info_test)
|
---|
3479 | - intel/tools: Use 'env bash' to find bash executable
|
---|
3480 | - intel/decoder: Fix xml filename when verx10 % 10 is not 0
|
---|
3481 | - intel/decoder: Add intel_spec_load_common()
|
---|
3482 | - intel/decoder: Make intel_spec_load_filename() have separate dir and name strings
|
---|
3483 | - intel/genxml: Align "Texture Coordinate Mode" naming
|
---|
3484 | - intel/genxml: Split some genxml sorting code into a intel_genxml module
|
---|
3485 | - intel/genxml: Convert gen_bits_header to use ElementTree
|
---|
3486 | - intel/genxml: Convert gen_pack_header to use ElementTree
|
---|
3487 | - intel/genxml: Add GenXml class into intel_genxml module
|
---|
3488 | - intel/genxml: Add filter_engines() to GenXml class
|
---|
3489 | - intel/genxml: Move sorting & writing into GenXml class
|
---|
3490 | - intel/genxml: Don't rewrite sorted xml if the contents didn't change
|
---|
3491 | - intel/genxml: Add final newline to output when saving xml
|
---|
3492 | - intel/genxml: Update xml with gen_sort_tags.py output
|
---|
3493 | - intel/dev: Use RPL-U name on RPL-U devices
|
---|
3494 | - intel/dev: Add more RPL PCI IDs
|
---|
3495 | - anvil,hasvk: Rename need_clflush to need_flush
|
---|
3496 | - intel/common: Move intel_clflush.h to intel_mem.h/intel_mem.c
|
---|
3497 | - anvil,hasvk: Replace intel_clflush_range with intel_flush_range
|
---|
3498 | - intel/common: Add intel_flush_range_no_fence
|
---|
3499 | - anvil,hasvk: Use intel_flush_range_no_fence to flush command buffers
|
---|
3500 | - util/u_cpu_detect: Drop unused has_tsc
|
---|
3501 | - util/u_cpu_detect: Detect clflushopt support
|
---|
3502 | - meson: Check for the __builtin_ia32_clflushopt function
|
---|
3503 | - intel/clflush: Add support for clflushopt instruction
|
---|
3504 | - intel/dev/xe: Move placeholder subslice info into XEHP_FEATURES
|
---|
3505 | - intel/genxml: Ignore tail leading/trailing whitespace in node_validator()
|
---|
3506 | - intel/genxml: Fix comparing xml when node counts differ
|
---|
3507 | - intel/dev: Update device string for MTL PCI ID 0x7d55
|
---|
3508 | - intel/genxml: Support importing from another genxml file
|
---|
3509 | - intel/genxml: Add support for excluding items when importing
|
---|
3510 | - intel/genxml: Add all xml files as pack dependencies
|
---|
3511 | - intel/genxml: Add GenXml.optimize_xml_import()
|
---|
3512 | - intel/genxml: Drop assertion to allow for importing
|
---|
3513 | - intel/genxml: Add GenXml.add_xml_imports method
|
---|
3514 | - intel/genxml: Add GenXml.flatten_xml() method
|
---|
3515 | - intel/genxml: Add genxml_import.py script
|
---|
3516 | - intel/decoder: ralloc_steal() values from spec context for fields and enums
|
---|
3517 | - intel/decoder: Implement support for importing genxml
|
---|
3518 | - intel/genxml: Start Xe2 support
|
---|
3519 | - intel/genxml: Auto-import genxml files using genxml_import.py
|
---|
3520 | - intel/common: Add sse2_args for 32-bit build when -Dsse2=false was set
|
---|
3521 | - intel/compiler/fs: Support Xe2 reg size in assign_curb_setup
|
---|
3522 | - intel/compiler: Update opt_split_sends() for Xe2 reg size
|
---|
3523 | - intel/compiler: Update emit_rt_lsc_fence() for Xe2
|
---|
3524 | - intel/compiler: Update lower_trace_ray_logical_send() for Xe2
|
---|
3525 | - intel/compiler: Update ray-tracing intrinsic lowering for Xe2
|
---|
3526 | - intel/compiler: Update RT stack_id access for Xe2
|
---|
3527 | - intel/fs: Update SSBO & shared uniform block loads for Xe2
|
---|
3528 | - intel/genxml: Build with gen20.xml
|
---|
3529 | - intel/isl: Build for Xe2
|
---|
3530 | - iris: Build for Xe2
|
---|
3531 | - anv/blorp: Use anv_genX to set device->blorp.exec
|
---|
3532 | - anv: Disable Ray Tracing on xe2 until our compiler supports Xe2 RT
|
---|
3533 | - anv: Build for Xe2
|
---|
3534 | - anv: Print warning that Xe2 is not supported rather than failing
|
---|
3535 | - intel/compiler: Add enum xe2_lsc_cache_store
|
---|
3536 | - intel/compiler: Use enum xe2_lsc_cache_store on xe2
|
---|
3537 | - intel/compiler: Add enum xe2_lsc_cache_load
|
---|
3538 | - intel/compiler: Use enum xe2_lsc_cache_load on xe2
|
---|
3539 | - anv/batch: Check if batch already has an error in anv_queue_submit_simple_batch()
|
---|
3540 | - anv/batch: Assert that extend_cb is non-NULL if the batch is out of space
|
---|
3541 | - intel/dev: Add 0x56ba-0x56bd DG2 PCI IDs
|
---|
3542 |
|
---|
3543 | Jose Maria Casanova Crespo (2):
|
---|
3544 |
|
---|
3545 | - vc4: mark buffers as initialized at vc4_texture_subdata
|
---|
3546 | - vc4: Fix mask RGBA validation at YUV blit
|
---|
3547 |
|
---|
3548 | José Expósito (3):
|
---|
3549 |
|
---|
3550 | - zink: Fix crash on zink_create_screen error path
|
---|
3551 | - zink: fix dereference before NULL check
|
---|
3552 | - zink: allow software rendering only if selected
|
---|
3553 |
|
---|
3554 | José Roberto de Souza (51):
|
---|
3555 |
|
---|
3556 | - anv: Use workaround framework to Wa_14016118574
|
---|
3557 | - intel/aux_map: Nuke format_enum
|
---|
3558 | - intel/aux_map: Use get_aux_entry() in remove_mapping()
|
---|
3559 | - intel/aux_map: Replace magic number by INTEL_AUX_MAP_ENTRY_VALID_BIT
|
---|
3560 | - intel/aux_map: Rename some variables to improve readability
|
---|
3561 | - intel/aux_map: Mask out bits above index 47 in intel_aux_get_meta_address_mask()
|
---|
3562 | - intel/aux_map: Convert l1_entry_addr_out to canonical
|
---|
3563 | - intel/aux_map: Drop magic sub table size number
|
---|
3564 | - intel/aux_map: Add function and macro to return l2 and l1 table masks
|
---|
3565 | - anv: Add gem_create_userptr() to KMD backend
|
---|
3566 | - anv: Replace handle by anv_bo in the gem_close()
|
---|
3567 | - anv: Add support for userptr in Xe KMD
|
---|
3568 | - intel: Sync xe_drm.h
|
---|
3569 | - intel/dev/xe: Add support for small-bar setups
|
---|
3570 | - anv: Request Xe KMD to place BOs to CPU visible VRAM when required
|
---|
3571 | - iris: Request Xe KMD to place BOs to CPU visible VRAM when required
|
---|
3572 | - iris/xe: Call iris_lost_context_state() when batch engine is replaced
|
---|
3573 | - intel/dev: Port intel_dev_info tool to Xe KMD
|
---|
3574 | - iris: Replace I915_EXEC_FENCE_SIGNAL by IRIS_BATCH_FENCE_SIGNAL in common code
|
---|
3575 | - intel: Move i915_drm.h specific code from common/intel_gem.h to common/i915/intel_gem.h
|
---|
3576 | - intel/common: Move functions inside of C++ ifdef
|
---|
3577 | - intel: Rename intel_gem_add_ext() to intel_i915_gem_add_ext()
|
---|
3578 | - iris: Move i915_gem_set_domain() call to i915 backend
|
---|
3579 | - iris: Move iris_bufmgr_bo_close() to kmd backend
|
---|
3580 | - iris: Add gem_create_userptr() to KMD backend
|
---|
3581 | - iris: Add support for userptr in Xe KMD
|
---|
3582 | - intel/genxml/gen125: Add missing fields in MI_MATH
|
---|
3583 | - iris: Set MI_MATH MOCS field
|
---|
3584 | - anv: Set MI_MATH MOCS field
|
---|
3585 | - intel/tests/mi_builder: Set MI_MATH MOCS field
|
---|
3586 | - intel/genxml/gen125: Set MI_MATH MOCS field as non-zero
|
---|
3587 | - anv: Nuke unused READ_ONCE() from anv_batch_chain.c
|
---|
3588 | - anv: Remove VkAllocationCallbacks parameter from reloc functions
|
---|
3589 | - anv: Return earlier in anv_reloc_list functions
|
---|
3590 | - intel: Sync xe_drm.h and rename engine to exec_queue
|
---|
3591 | - anv: Override vendorID for Hogwarts Legacy
|
---|
3592 | - intel/isl: Remove unknown workaround
|
---|
3593 | - intel/isl: Remove Wa_22011186057
|
---|
3594 | - anv: Update Wa_16014390852 for MTL
|
---|
3595 | - intel: Sync xe_drm.h
|
---|
3596 | - anv: Move i915 specific gem_set_caching to backend
|
---|
3597 | - anv: Move i915 specific code from common anv_gem.c
|
---|
3598 | - anv: Move bo_alloc_flags_to_bo_flags() to backend
|
---|
3599 | - anv: Move i915 handling of imported bos bo_flags
|
---|
3600 | - anv: Remove i915_drm.h include from common code
|
---|
3601 | - iris: Lock bufmgr->lock before call vma_free() in error path
|
---|
3602 | - iris: Nuke useless flags from iris_fine_fence_new()
|
---|
3603 | - intel: Prepare implementation of Wa_18019816803 and Wa_16013994831 for future platforms
|
---|
3604 | - intel: Sync xe_drm.h
|
---|
3605 | - anv: Switch Xe KMD vm bind to sync
|
---|
3606 | - anv: Add missing ANV_BO_ALLOC_EXTERNAL flags when calling anv_device_import_bo()
|
---|
3607 |
|
---|
3608 | Juan A. Suarez Romero (7):
|
---|
3609 |
|
---|
3610 | - broadcom/ci: update expected results
|
---|
3611 | - vc4/ci: update expected results
|
---|
3612 | - v3d/shim: include new ioctl parameters
|
---|
3613 | - v3dv/ci: update expected list
|
---|
3614 | - broadcom: add performance counters for V3D 7.x
|
---|
3615 | - broadcom/simulator: add per-hw version calls
|
---|
3616 | - v3d/vc4/ci: add new fails/timeout
|
---|
3617 |
|
---|
3618 | Julia Tatz (10):
|
---|
3619 |
|
---|
3620 | - gallium/dri: fix dri2_from_names
|
---|
3621 | - aux/trace: skip multi-line comments in enums2names
|
---|
3622 | - aux/trace: deduplicate enum dump macro work
|
---|
3623 | - aux/trace: move trace_sample_view logic
|
---|
3624 | - aux/trace: fix set_hw_atomic_buffers method name
|
---|
3625 | - aux/trace: add screen video methods
|
---|
3626 | - aux/trace: add context video methods
|
---|
3627 | - aux/trace: wrap video_codec & video_buffer
|
---|
3628 | - aux/trace: unwrap refrence frames in picture_desc
|
---|
3629 | - aux/trace: trace video_buffer method return vals
|
---|
3630 |
|
---|
3631 | Julia Zhang (1):
|
---|
3632 |
|
---|
3633 | - radeonsi: modify algorithm of skipping holes of sparse bo
|
---|
3634 |
|
---|
3635 | Julian Hagemeister (1):
|
---|
3636 |
|
---|
3637 | - Gallium: Fix shared memory segment leak
|
---|
3638 |
|
---|
3639 | Juston Li (10):
|
---|
3640 |
|
---|
3641 | - zink: remove venus from renderpass optimizations
|
---|
3642 | - venus: sync protocol for VK_EXT_vertex_input_dynamic_state
|
---|
3643 | - venus: implement VK_EXT_vertex_input_dynamic_state
|
---|
3644 | - venus: set lvp queries as saturate on overflow
|
---|
3645 | - venus: add helper function to get cmd handle
|
---|
3646 | - venus: refactor out common cmd feedback functions
|
---|
3647 | - venus: support deferred query feedback recording
|
---|
3648 | - venus: track/recycle appended query feedback cmds
|
---|
3649 | - venus: append query feedback at submission time
|
---|
3650 | - venus: switch to unconditionally deferred query feedback
|
---|
3651 |
|
---|
3652 | Kai Wasserbäch (3):
|
---|
3653 |
|
---|
3654 | - fix: clover: LLVM 18 renamed/moved CGFT_*, update compat layer
|
---|
3655 | - fix: clover: LLVM 18: s/CodeGenOpt::/CodeGenOptLevel::/
|
---|
3656 | - fix: clover: warning: ignoring return value of ‘int posix_memalign(…)’ [-Wunused-result]
|
---|
3657 |
|
---|
3658 | Karmjit Mahil (29):
|
---|
3659 |
|
---|
3660 | - pvr: Remove mrt setup from SPM EOT
|
---|
3661 | - pvr: Compile SPM EOT shader
|
---|
3662 | - pvr: Use the SPM EOT on barrier stores
|
---|
3663 | - pvr: Remove some magic numbers and increments from km stream
|
---|
3664 | - pvr: Restructure \`rogue_kmd_stream.xml`
|
---|
3665 | - pvr: Submit PR commands
|
---|
3666 | - pvr: Use the correct size for the unified store allocation
|
---|
3667 | - pvr: Allow query stage for barrier sub cmds
|
---|
3668 | - pvr: Fix occlusion query unaccounted for user fences
|
---|
3669 | - pvr: Fix writing query availability write out
|
---|
3670 | - pvr: Fix packing issue with max_{x,y}_clip
|
---|
3671 | - pvr: Fix csb relocation status assert on \`pvr_csb_finish()`
|
---|
3672 | - pvr: Fix \`for` loop itarator usage
|
---|
3673 | - pvr: Fix dynamic desc offset storage
|
---|
3674 | - pvr: Fix cubemap layer stride
|
---|
3675 | - pvr: Use the render passes' attachments array to setup ISP state
|
---|
3676 | - pvr: Adjust EOT PBE state to account for the iview's base array layer
|
---|
3677 | - pvr: Fix MRT index in PBE state
|
---|
3678 | - pvr: Fix pbe_emit assert
|
---|
3679 | - pvr: Fix OOB access of pbe_{cs,reg}_words
|
---|
3680 | - pvr: Order tile buffer EOT emits to be last
|
---|
3681 | - pvr: Fix subpass sample count on ds attachment only
|
---|
3682 | - pvr: Refactor subpass ds and sample count setup
|
---|
3683 | - pvr: Fix SPM load shader sample rate
|
---|
3684 | - pvr: Fix PPP_SCREEN sizes
|
---|
3685 | - vulkan: Add \`vk_subpass_dependency_is_fb_local()` helper
|
---|
3686 | - tu: Use common \`vk_subpass_dependency_is_fb_local()`
|
---|
3687 | - pvr: Don't merge subpasses on framebuffer-global dependancy
|
---|
3688 | - pvr: Only setup the bgobj to load if we have a load_op
|
---|
3689 |
|
---|
3690 | Karol Herbst (213):
|
---|
3691 |
|
---|
3692 | - nvc0: initial Ada enablement
|
---|
3693 | - rusticl/mesa: make svm_migrate optional
|
---|
3694 | - llvmpipe: enable system SVM
|
---|
3695 | - nvc0: fix num_gprs for Volta+
|
---|
3696 | - rusticl: fix warnings with newer rustc
|
---|
3697 | - gm107/ir: fix SULDP for loads without a known format
|
---|
3698 | - nv50/ir/nir: fix txq emission on MS textures
|
---|
3699 | - nv50/ir/nir: Fix zero source handling of tex instructions.
|
---|
3700 | - rusticl/kernel: only handle function_temp memory before lowering printf
|
---|
3701 | - meson,ci: bump meson req for rusticl to 1.2
|
---|
3702 | - rusticl/nir: add helper functions we need for a NIR_PASS macro
|
---|
3703 | - rusticl/nir: add a nir_pass macro
|
---|
3704 | - rusticl/nir: use the new nir_pass macro
|
---|
3705 | - rusticl/kernel: rename res to internal_args inside lower_and_optimize_nir_late
|
---|
3706 | - rusticl/kernel: merge lower_and_optimize_nir_pre_inputs and lower_and_optimize_nir_late
|
---|
3707 | - rusticl/kernel: move things around in lower_and_optimize_nir
|
---|
3708 | - rusticl/kernel: get rid of initial function_temp type lowering
|
---|
3709 | - rusticl/kernel: mark can_remove_var as unsafe and document it
|
---|
3710 | - n50/compute: submit initial compute state in nv50_screen_create
|
---|
3711 | - nvk: add vulkan skeleton
|
---|
3712 | - nouveau/winsys: add the new winsys implementation
|
---|
3713 | - nvk: use winsys lib
|
---|
3714 | - nvk: fix nvk_buffer include guards
|
---|
3715 | - nouveau/headers: add script to sync in-tree headers with open-gpu-doc
|
---|
3716 | - nouveau/headers: initial sync of headers
|
---|
3717 | - nvk: implement GetPhysicalDeviceQueueFamilyProperties2 to make the CTS happy
|
---|
3718 | - nvk: advertize memory heaps and types
|
---|
3719 | - nouveau/ws: reorganize a little
|
---|
3720 | - nouveau/ws: dup the fd
|
---|
3721 | - nouveau/ws: add a field for the SM version
|
---|
3722 | - nvk: set nonCoherentAtomSize as the CTS divides with this value
|
---|
3723 | - nouveau/ws: add bo API
|
---|
3724 | - nvk: add basic device memory support
|
---|
3725 | - nouveau/headers: add nvtypes.h
|
---|
3726 | - nouveau/headers: typedef Nv void types
|
---|
3727 | - nouveau/headers: add host classes
|
---|
3728 | - nouveau/ws: add context support
|
---|
3729 | - nouveau/ws: add a cmd buffer
|
---|
3730 | - novueau/bo: refcount it
|
---|
3731 | - novueau/bo: add nouveau_ws_bo_wait
|
---|
3732 | - nvk: allocate a GPU context for each VkDevice
|
---|
3733 | - nvk: add nvk_bo_sync
|
---|
3734 | - nvk: add nvk_CmdPipelineBarrier2 stub
|
---|
3735 | - nvk: impl nvk_CmdCopyBuffer
|
---|
3736 | - nouveau/ws: fix setting push bo domains
|
---|
3737 | - nouveau/ws: PUSH_IMMD only works with 16 bit values
|
---|
3738 | - nouveau/ws: set GPU object class
|
---|
3739 | - nouveau/ws: bind 2D class
|
---|
3740 | - nvk: use fermi class definitions
|
---|
3741 | - nvk: add basic support for images
|
---|
3742 | - nvk: simple format table
|
---|
3743 | - nvk: add support for blits
|
---|
3744 | - nvk: report maxMipLevels as 1
|
---|
3745 | - nvk: optimize blit command buffer gen
|
---|
3746 | - nvk: implement CmdFillBuffer
|
---|
3747 | - nvk: implement CmdUpdateBuffer
|
---|
3748 | - nvk: implement CmdCopyBuffer2
|
---|
3749 | - nvk: advertise VK_KHR_copy_commands2
|
---|
3750 | - nvk: implicitly reset the command buffer
|
---|
3751 | - nouveau/ws: handle 0inc inside nvk_push_val as well
|
---|
3752 | - nvk: reduce pitch even further in CmdFillBuffer
|
---|
3753 | - nvk: support multiple miplevels
|
---|
3754 | - nvk: support array blits over multiple layers
|
---|
3755 | - nvk: tiling prep work for VK_EXT_image_2d_view_of_3d
|
---|
3756 | - nouveau/ws: make sure we don't submit nonsense
|
---|
3757 | - nouveau/ws: assert on broken channel
|
---|
3758 | - nvk/blit: assert that formats are supported
|
---|
3759 | - nouveau/headers: Generate parser functions
|
---|
3760 | - nouveau/ws: initial debugging options for command submissions
|
---|
3761 | - nouveau/ws: depend on generated class header files
|
---|
3762 | - nouveau/ws: get rid of libdrm
|
---|
3763 | - nouveau/ws: use new NVIF interface to query oclasses
|
---|
3764 | - nvk: set deviceName
|
---|
3765 | - nouveau/headers: add path for 3D headers
|
---|
3766 | - nouveau/headers: initial 3D headers import
|
---|
3767 | - nouveau/ws: allocate 3D subchan
|
---|
3768 | - nouveau/ws: allocate copy subchan as well
|
---|
3769 | - nouveau/ws: add API to query if the context was killed
|
---|
3770 | - nouveau/ws: add a bo unmap helper function
|
---|
3771 | - nvk: clean up bo mappings
|
---|
3772 | - nouveau/ws: bound check nouveau_ws_push_append
|
---|
3773 | - nouveau/ws: rework refing push buffer bos
|
---|
3774 | - nouveau/ws: push chaining
|
---|
3775 | - nvk: fix OOB read inside nvk_get_va_format
|
---|
3776 | - nvk: alloc a zero page and use it for vertex runouts
|
---|
3777 | - nvk: fix zero page refing
|
---|
3778 | - nvk: support exporting buffers
|
---|
3779 | - nvk: fix some class version checks
|
---|
3780 | - nvk: properly align shaders pre Turing
|
---|
3781 | - nvk: rework QMD handling to support pre Turing
|
---|
3782 | - nvk: align desc root table
|
---|
3783 | - nvk: Use SET_PIPELINE_PROGRAM pre-Volta
|
---|
3784 | - nvk: properly align slm size
|
---|
3785 | - nvk: use remaps for image copies
|
---|
3786 | - nvk: reduce pitch for FillBuffer
|
---|
3787 | - nvk: bind more subchans in init_context_state
|
---|
3788 | - nvk: support pre Maxwell Texture Headers
|
---|
3789 | - nvk/device: fix order of error handling
|
---|
3790 | - nvk: allocate VAB memory area
|
---|
3791 | - nvk: wire up M2MF for Fermi
|
---|
3792 | - nouveau/mme: add test for BEQ with magic exit offset
|
---|
3793 | - nouveau/mme: add a macro exit helper
|
---|
3794 | - nvk: Add a macro to set MMIO registers via falcons
|
---|
3795 | - nouveau/winsys: fix SM value for Ada
|
---|
3796 | - nvk: fix num_gprs for Volta+
|
---|
3797 | - nvk: replace mp with tpc
|
---|
3798 | - nvk: properly calculate SLM region by taking per arch limits into account
|
---|
3799 | - nouveau: fix max_warps_per_mp_for_sm for builds with asserts disabled
|
---|
3800 | - nvk: enable fp helper invocations loads on more gens
|
---|
3801 | - nv50/ir: use own info struct for sys vals
|
---|
3802 | - nv50/ir: convert system values to gl_system_value
|
---|
3803 | - nouveau/mme: fix OOB access inside while_ine builder test
|
---|
3804 | - nouveau/mme: fix OOB inside tu104 simulator
|
---|
3805 | - clc: use CLANG_RESOURCE_DIR for clang's resource path
|
---|
3806 | - nv50: fix code uploads bigger than 0x10000 bytes
|
---|
3807 | - nouveau: take glsl_type ref unconditionally
|
---|
3808 | - rusticl/kernel: optimize nir between lowering io and explicit types
|
---|
3809 | - nv50: limit max code uploads to 0x8000
|
---|
3810 | - zink: fix source type in load/store scratch
|
---|
3811 | - zink: fix global stores
|
---|
3812 | - zink: update some compute caps
|
---|
3813 | - rusticl: add debug option to sync every event
|
---|
3814 | - rusticl/device: _MAX_CONST_BUFFER0_SIZE is unsigned
|
---|
3815 | - ci: disable a660 jobs
|
---|
3816 | - nir: make workgroup_id 32 bit only
|
---|
3817 | - nir: make num_workgroups 32 bit only
|
---|
3818 | - ac: drop 64 bit handling for cl workgroup intrinsics
|
---|
3819 | - gallivm/nir: drop 64 bit handling for cl workgroup intrinsics
|
---|
3820 | - intel/compiler: drop 64 bit handling for cl workgroup intrinsics
|
---|
3821 | - panfrost: drop 64 bit handling for cl workgroup intrinsics
|
---|
3822 | - rusticl: reduce global_invocation_id_zero_base to 32 bit
|
---|
3823 | - panfrost: drop pan_nir_lower_64bit_intrin
|
---|
3824 | - rusticl/disk_cache: fix stack corruption
|
---|
3825 | - rusticl/query: fix use-after-free, but also fix incorrect usage of unsafe
|
---|
3826 | - rusticl/event: disable profiling for devices without timestamps
|
---|
3827 | - rusticl/queue: properly implement clCreateCommandQueueWithProperties
|
---|
3828 | - rusticl/memory: do not verify pitch for IMAGE1D_BUFFER
|
---|
3829 | - rusticl/memory: only specify PIPE_BIND_SHADER_IMAGE where supported
|
---|
3830 | - asahi: fetch available system memory
|
---|
3831 | - asahi: lower hadd
|
---|
3832 | - asahi: handle kernels
|
---|
3833 | - asahi: handle load_workgroup_size
|
---|
3834 | - asahi: handle load_global_invocation_id_zero_base
|
---|
3835 | - asahi: implement get_compute_state_info
|
---|
3836 | - asahi: implement set_global_binding
|
---|
3837 | - asahi: implement clear_buffer
|
---|
3838 | - asahi: gracefully handle allocating linear images
|
---|
3839 | - asahi: handle images in is_format_supported
|
---|
3840 | - rusticl/memory: fallback if allocating linear images fails
|
---|
3841 | - rusticl: enable asahi
|
---|
3842 | - rusticl/mesa: create contexts with PIPE_CONTEXT_NO_LOD_BIAS
|
---|
3843 | - docs/features: cl_khr_3d_image_writes needs driver support
|
---|
3844 | - rusticl/mesa: fix \`set_constant_buffer` when passing an empty buffer
|
---|
3845 | - rusticl/kernel: skip adding global id offsets if not used
|
---|
3846 | - meson/rusticl: add sha1_h
|
---|
3847 | - rusticl/mesa/context: fix clear_sampler_views
|
---|
3848 | - nir: add nir_lower_alu_vec8_16_srcs pass
|
---|
3849 | - zink: lower vec8/16
|
---|
3850 | - rusticl/mesa: create COMPUTE_ONLY contexts
|
---|
3851 | - rusticl: fix clippys bool_to_int_with_if
|
---|
3852 | - rusticl/memory: fix potential use-after-free in clEnqueueSVMMemFill
|
---|
3853 | - nir/load_libclc: fix libclc memory leak
|
---|
3854 | - rusticl/kernel: Fix creation from programs not built for every device
|
---|
3855 | - ci: add half-life 2 freedreno flake
|
---|
3856 | - zink: implement get_compute_state_info
|
---|
3857 | - zink: copy has_variable_shared_mem cs property
|
---|
3858 | - zink: pass entire pipe_grid_info into zink_program_update_compute_pipeline_state
|
---|
3859 | - zink: refactor spec constant handling
|
---|
3860 | - zink: variable shared mem support
|
---|
3861 | - zink: support more nir opcodes
|
---|
3862 | - zink: make spirv_builder_emit_*op compatible with spec constants
|
---|
3863 | - zink: support samplers with unnormalized_coords
|
---|
3864 | - zink: implement remaining pack ops via bitcast
|
---|
3865 | - zink: fix RA textures
|
---|
3866 | - zink: fix load/store scratch offsets
|
---|
3867 | - rusticl/mesa/screen,device: add driver_name
|
---|
3868 | - rusticl: enable zink
|
---|
3869 | - pipe-loader: allow to load multiple zink devices
|
---|
3870 | - rusticl: bump rustc version to 1.66
|
---|
3871 | - rusticl/mesa/nir: mark more methods as mut
|
---|
3872 | - rusticl/mesa/nir: Mark NirShader and NirPrintfInfo as Send and Sync
|
---|
3873 | - rusticl/mesa: mark PipeResource as Send and Sync
|
---|
3874 | - rusticl/mesa: mark PipeTransfer as Send
|
---|
3875 | - rusticl/cl: mark _cl_image_desc as Send and Sync
|
---|
3876 | - rusticl/queue: get rid of pointless Option around our worker thread handle
|
---|
3877 | - rusticl/queue: make it Sync
|
---|
3878 | - rusticl/kernel: get rid of Arcs in KernelDevStateVariant
|
---|
3879 | - rusticl/memory: use get_mut instead of lock in drop
|
---|
3880 | - zink: implement PIPE_COMPUTE_CAP_MAX_COMPUTE_UNITS
|
---|
3881 | - rusticl/api: remove cl_closure macro
|
---|
3882 | - zink: implement load_global_constant
|
---|
3883 | - zink: properly emit PhysicalStorageBufferAddresses cap
|
---|
3884 | - nir/lower_mem_access_bit_sizes: fix invalid shift bit_size
|
---|
3885 | - rusticl/device: restrict 1Dbuffer images for RGB and RGBx
|
---|
3886 | - rusticl/memory: use PIPE_BUFFER for IMAGE1D_BUFFER images
|
---|
3887 | - rusticl/format: disable all sRGB formats
|
---|
3888 | - asahi: flush denorms on exact fmin/fmax
|
---|
3889 | - zink: wrap shared memory blocks in a struct
|
---|
3890 | - zink: properly alias shared memory
|
---|
3891 | - zink: fix zink_destroy_screen for early screen creation fails
|
---|
3892 | - docs/features: remove empty lines confusing mesamatrix
|
---|
3893 | - rusticl/device: restrict image_buffer_size
|
---|
3894 | - rusticl/device: restrict param_max_size further
|
---|
3895 | - rusticl/mem: properly set pipe_image_view::access
|
---|
3896 | - zink: lower fisnormal as it requires the Kernel Cap
|
---|
3897 | - radv: fix buffers in vkGetDescriptorEXT with size not aligned to 4
|
---|
3898 | - rusticl/queue: Only take a weak ref to the last Event
|
---|
3899 | - rusticl/mesa: pass PIPE_BIND_LINEAR in resource_create_texture_from_user
|
---|
3900 | - zink: deallocate global_bindings array
|
---|
3901 | - rusticl/mesa/screen: do not derefence the entire pipe_screen struct
|
---|
3902 | - nvc0: implement PIPE_CAP_TIMER_RESOLUTION
|
---|
3903 | - rusticl/queue: do not send empty lists of event to worker queue
|
---|
3904 | - rusticl/queue: fix implicit flushing of queue dependencies
|
---|
3905 |
|
---|
3906 | Kenneth Graunke (21):
|
---|
3907 |
|
---|
3908 | - iris: Re-emit 3DSTATE_DS for each primitive (workaround 14019750404)
|
---|
3909 | - intel/compiler: Fix sparse cube map array coordinate lowering
|
---|
3910 | - intel/compiler: Respect NIR_DEBUG_PRINT_INTERNAL for DEBUG_OPTIMIZER
|
---|
3911 | - intel/fs: Account for payload GRFs when calculating register pressure
|
---|
3912 | - intel/compiler: Move SCHEDULE_NONE handling into schedule_instructions()
|
---|
3913 | - intel/fs: Index scheduler mode string table by mode enum
|
---|
3914 | - intel/fs: Make helpers for saving/restoring instruction order
|
---|
3915 | - intel/fs: Pick the lowest register pressure schedule when spilling
|
---|
3916 | - intel/fs: Dump IR for pre-RA scheduler modes in DEBUG_OPTIMIZER
|
---|
3917 | - iris: Check prog[] instead of uncompiled[] for BLORP state skipping
|
---|
3918 | - nir: Fix function parameter indentation in nir_opt_barriers.c
|
---|
3919 | - nir: Add an optimization pass to reduce barrier modes
|
---|
3920 | - nir: Reduce the scope of shared memory barriers
|
---|
3921 | - lavapipe: Don't delete control barriers
|
---|
3922 | - virgl, nir_to_tgsi: Add a hack for promoting partial memory barriers
|
---|
3923 | - dxil: Set UAV_FENCE_THREAD_GROUP any time global isn't required
|
---|
3924 | - glsl: Use nir_opt_barrier_modes() to drop unnecessary barriers
|
---|
3925 | - anv: Use nir_opt_barrier_modes() to drop unnecessary barriers
|
---|
3926 | - mesa: Fix zeroing of new ParameterValues array entries when growing
|
---|
3927 | - intel/fs: Fix Xe2 URB read/lowering with per-slot offsets
|
---|
3928 | - anv: Add support for a transfer queue on Alchemist
|
---|
3929 |
|
---|
3930 | Kevron Rees (1):
|
---|
3931 |
|
---|
3932 | - Force vk vendor for spider-man remastered
|
---|
3933 |
|
---|
3934 | Konrad Dybcio (5):
|
---|
3935 |
|
---|
3936 | - freedreno: Set magic writes per-GPU, using existing data
|
---|
3937 | - freedreno: Include speedbin fallback in 740 chipid to fix probing
|
---|
3938 | - freedreno: Include speedbin fallback in 730 chipid to fix probing
|
---|
3939 | - freedreno: Include speedbin fallback in 690 chipid to fix probing
|
---|
3940 | - freedreno: Add Adreno 643
|
---|
3941 |
|
---|
3942 | Konstantin Seurer (95):
|
---|
3943 |
|
---|
3944 | - radv: Stop using the misleading round_up_u* functions
|
---|
3945 | - radv/meta_buffer: Stop setting RADV_META_SAVE_DESCRIPTORS
|
---|
3946 | - radv/meta_buffer: Rename size_minus16 to max_offset
|
---|
3947 | - llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE
|
---|
3948 | - nir/tests: Refactor boilerplate into a common header
|
---|
3949 | - nir/tests: Use a single binary
|
---|
3950 | - draw: Do not restart the primitive_id at 0
|
---|
3951 | - gallivm: Fix subsampled format sampling under Vulkan
|
---|
3952 | - gallivm: Ignore nir_tex_src_plane
|
---|
3953 | - lavapipe: Remove dummy sampler ycbcr conversion
|
---|
3954 | - lavapipe: Store immutable_samplers as lvp_sampler array
|
---|
3955 | - lavapipe: Fix binding immutable samplers with desc buffers
|
---|
3956 | - lavapipe: Implement samplerYcbcrConversion
|
---|
3957 | - lavapipe: Advertise samplerYcbcrConversion
|
---|
3958 | - llvmpipe: Zero extend vectors in widen_to_simd_width
|
---|
3959 | - vulkan: Add a generated vk_properties struct
|
---|
3960 | - radv: Use common physical device properties
|
---|
3961 | - clang-format: Disable formatting by default
|
---|
3962 | - lavapipe: Use common physical device properties
|
---|
3963 | - nir/from_ssa: Don't insert store_reg instructions before phis
|
---|
3964 | - gallivm: Run nir_convert_to_lcssa before nir_convert_from_ssa
|
---|
3965 | - lavapipe/ci: Remove descriptor_indexing fails
|
---|
3966 | - radv/rt: Rename shader_pc and next_shader
|
---|
3967 | - radv/rt: Rename traversal_shader to traversal_shader_addr
|
---|
3968 | - nir/opt_large_constants: Handle small float arrays
|
---|
3969 | - bin: Update spirv sources
|
---|
3970 | - vulkan: Allow beta extensions for physical device features
|
---|
3971 | - vulkan: Allow beta extensions for physical device properties
|
---|
3972 | - vulkan Add enqueue entrypoint for CmdDispatchGraphAMDX
|
---|
3973 | - nir: Add shader enqueue data structures and handling
|
---|
3974 | - spirv: Update headers and grammer JSON
|
---|
3975 | - spirv: Implement SPV_AMDX_shader_enqueue
|
---|
3976 | - lavapipe: Add lvp_pipeline_type
|
---|
3977 | - lavapipe: Implement exec graph pipelines
|
---|
3978 | - lavapipe: Implement AMDX_shader_enqueue commands
|
---|
3979 | - lavapipe: Advertise AMDX_shader_enqueue
|
---|
3980 | - radv: Add internal_nodes_offset to scratch_layout
|
---|
3981 | - radv: Remove leaf_args::dst_offset
|
---|
3982 | - radv/rt: Remove some dead code
|
---|
3983 | - radv/rt: Do not apply stack_ptr for non-recursive stages
|
---|
3984 | - radv/rt: Add and use radv_build_traversal
|
---|
3985 | - radv/rt: Insert rt_return_amd before lowering shader calls
|
---|
3986 | - radv/rt: Split stage initialization and hashing
|
---|
3987 | - aco: Do not fixup registers if there are no shader calls
|
---|
3988 | - radv: Stop updating the stack_size in insert_rt_case
|
---|
3989 | - lavapipe: Lock around CSO destroys
|
---|
3990 | - vulkan/wsi/x11: Implement capture hotkey using the keymap
|
---|
3991 | - venus: Use the common GetPhysicalDeviceFeatures2 implementation
|
---|
3992 | - nir/lower_shader_calls: Limit the remat chain length
|
---|
3993 | - lavapipe: Avoid lowering shaders twice
|
---|
3994 | - lavapipe: Fix the locking around cso destruction
|
---|
3995 | - aco/validate: Handle p_wqm like p_parallelcopy
|
---|
3996 | - aco: Use bytes() instead of size() in emit_wqm
|
---|
3997 | - aco: Unify demote and demote_if selection
|
---|
3998 | - radv: Only generate debug info if required
|
---|
3999 | - aco/lower_to_cssa: Fix typo
|
---|
4000 | - radv: Don't use the depth image view for depth bias emission
|
---|
4001 | - radv/rt: Store NIR shaders separately
|
---|
4002 | - radv/rt: Add monolithic raygen lowering
|
---|
4003 | - radv/rt: Enable monolithic pipelines
|
---|
4004 | - radv/ci: Document new flake
|
---|
4005 | - vulkan/properties: Handle unsized arrays properly
|
---|
4006 | - radv: Remove dead radix_sort_vk_get_memory_requirements call
|
---|
4007 | - radv/radix_sort: Vendor the radix sort dispatch code
|
---|
4008 | - radv: Perform multiple sorts in parallel
|
---|
4009 | - radv/ci: Improve ray tracing skips
|
---|
4010 | - ac/llvm: Fix typed loads with 16bit formats
|
---|
4011 | - ac/llvm: Use the correct return type for uadd_carry and usub_borrow
|
---|
4012 | - ac/llvm: Use float types for float atomics
|
---|
4013 | - radv: Don't advertise features requiring PS epilogs with LLVM
|
---|
4014 | - radv: Update navi21 llvm fails
|
---|
4015 | - radv/rt: Handle stages without nir properly
|
---|
4016 | - radv: Remove ray tracing shader module identifier skips
|
---|
4017 | - radv/bvh: Treat instances with mask == 0 as inactive
|
---|
4018 | - radv/ray_queries: Skip cull_mask handling if it is FF
|
---|
4019 | - radv/rt: Skip cull_mask handling if it is FF
|
---|
4020 | - aco/spill: Make sure that offset stays in bounds
|
---|
4021 | - nir: Add nir_cf_node_cf_tree_prev
|
---|
4022 | - nir: Add nir_foreach_block_in_cf_node_reverse
|
---|
4023 | - nir: Add nir_rematerialize_deref_in_use_blocks
|
---|
4024 | - nir/lcssa: Fix rematerializing derefs
|
---|
4025 | - nir/deref: Layer rematerialization helpers
|
---|
4026 | - lavapipe/ci: Fix asan expectations
|
---|
4027 | - hasvk: Use the common GetPhysicalDeviceFeatures2 implementation
|
---|
4028 | - vulkan: Remove vk_get_physical_device_core_1_*_feature_ext
|
---|
4029 | - radv/bvh/ploc: Load child bounds from LDS
|
---|
4030 | - radv: Merge the sync_data and header initialization
|
---|
4031 | - radv: Do not sync after radv_update_buffer_cp
|
---|
4032 | - zink: Initialize primitive types to an invalid value
|
---|
4033 | - nir/passthrough_gs: Support edge flags with points
|
---|
4034 | - zink: Enable edge flags with points
|
---|
4035 | - mesa: Fix glBegin/End when LINE_LOOP is not supported
|
---|
4036 | - llvmpipe: Compile a nop texture function for unsupported configurations
|
---|
4037 | - radv/rt: Use nir_shader_instructions_pass for lower_rt_instructions
|
---|
4038 | - radv/sqtt: Fix tracing acceleration structure commands
|
---|
4039 |
|
---|
4040 | Lang Yu (5):
|
---|
4041 |
|
---|
4042 | - amd/common: add AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 property
|
---|
4043 | - radeonsi: use AMD_CODE_PROPERTY_ENABLE_WAVEFRONT_SIZE32 to determine wave size
|
---|
4044 | - radeonsi: use wave size to determine index stride
|
---|
4045 | - amd/common: add missing stuff for gfx11.5
|
---|
4046 | - amd/radeonsi: add missing stuff for gfx11.5
|
---|
4047 |
|
---|
4048 | Leandro Ribeiro (13):
|
---|
4049 |
|
---|
4050 | - egl: rewrite outdated comment in _eglFindDevice()
|
---|
4051 | - egl: remove unused parameter from _eglAddDRMDevice()
|
---|
4052 | - egl: simplify _eglAddDRMDevice()
|
---|
4053 | - egl: make explicit that we don't support render nodes for software EGLDevice
|
---|
4054 | - egl: move is_render_node flag to platform_wayland
|
---|
4055 | - loader: rename loader_open_render_node() to loader_open_render_node_platform_device()
|
---|
4056 | - loader: add driver list as parameter in loader_open_render_node_platform_device()
|
---|
4057 | - pipe-loader: add pipe_loader_get_compatible_render_capable_device_fd()
|
---|
4058 | - dri: add queryCompatibleRenderOnlyDeviceFd() to __DRI_MESA extension
|
---|
4059 | - kmsro: try to use only compatible render-capable devices
|
---|
4060 | - loader: add loader_is_device_render_capable()
|
---|
4061 | - egl/drm: get compatible render-only device fd for kms-only device
|
---|
4062 | - egl: error out if we can't find an EGLDevice in _eglFindDevice()
|
---|
4063 |
|
---|
4064 | Leo Liu (4):
|
---|
4065 |
|
---|
4066 | - radeonsi: add AV1 profile to supported profile
|
---|
4067 | - radeonsi/vcn: fix the incorrect dt_size
|
---|
4068 | - Revert "frontends/va: Also map VAImageBufferType for reading"
|
---|
4069 | - ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG
|
---|
4070 |
|
---|
4071 | Lina Versace (14):
|
---|
4072 |
|
---|
4073 | - docs: Add row for VK_KHR_maintenance5
|
---|
4074 | - intel/pci_ids: Consistently use lowercase
|
---|
4075 | - venus: Sync protocol for VK_EXT_graphics_pipeline_library
|
---|
4076 | - venus: Erase pViewports and pScissors in fewer cases
|
---|
4077 | - venus: Fix crash when VkGraphicsPipelineCreateInfo::layout is missing
|
---|
4078 | - venus: Fix subpass attachments
|
---|
4079 | - venus: Drop incorrectly-used always-true pipeline vars
|
---|
4080 | - venus: Use VkImageAspectFlags in vn_subpass
|
---|
4081 | - venus: Add enum vn_pipeline_type
|
---|
4082 | - venus: Renames for VkGraphicsPipelineCreateInfo fixes
|
---|
4083 | - venus: Refactor pipeline fixup into two stages
|
---|
4084 | - venus: Do pipeline fixes for VK_EXT_graphics_pipeline_library
|
---|
4085 | - venus: Enable VK_EXT_graphics_pipeline_library behind debug flag
|
---|
4086 | - venus: Fix -Wmaybe-uninitialized
|
---|
4087 |
|
---|
4088 | LingMan (22):
|
---|
4089 |
|
---|
4090 | - rusticl/memory: fix potential use-after-free in clEnqueueSVMFree
|
---|
4091 | - rusticl: Rename XyzCB aliases to FuncXyzCB
|
---|
4092 | - rusticl: add structs to hold the C callbacks
|
---|
4093 | - rusticl: use CreateContextCB
|
---|
4094 | - rusticl: use DeleteContextCB
|
---|
4095 | - rusticl: use EventCB
|
---|
4096 | - rusticl: use MemCB
|
---|
4097 | - rusticl: use ProgramCB
|
---|
4098 | - rusticl: use SVMFreeCb
|
---|
4099 | - rusticl: Make EventSig take ownership of its environment
|
---|
4100 | - rusticl: add a safe abstraction to execute a DeleteContextCB
|
---|
4101 | - rusticl: add a safe abstraction to execute an EventCB
|
---|
4102 | - rusticl: add a safe abstraction to execute a MemCB
|
---|
4103 | - rusticl: add a safe abstraction to execute an SVMFreeCb
|
---|
4104 | - rusticl: add a safe abstraction to execute a CreateContextCB
|
---|
4105 | - rusticl: add a safe abstraction to execute a ProgramCB
|
---|
4106 | - rusticl/api: drop a few include paths
|
---|
4107 | - rusticl: mark the fields of callback structs private
|
---|
4108 | - rusticl: drop an \`#[allow(dead_code)]` marker
|
---|
4109 | - rusticl/core: don't take a lock while dropping \`Context`
|
---|
4110 | - rusticl: Show an error message if the build is attempted with an outdated bindgen version
|
---|
4111 | - rusticl: Show an error message if the version of bindgen can't be detected
|
---|
4112 |
|
---|
4113 | Lionel Landwerlin (169):
|
---|
4114 |
|
---|
4115 | - anv: hide exec_flags selection inside the i915 backend
|
---|
4116 | - isl: add a tool to query surface parameters
|
---|
4117 | - intel/fs: fix missing predicate on SEL instruction
|
---|
4118 | - intel/compiler: rework input parameters
|
---|
4119 | - ci/a530: switch a few tests to flakes to unblock CI
|
---|
4120 | - vulkan: bump header register to 1.3.258
|
---|
4121 | - intel/fs: don't try to rebuild sequences of non ssa values
|
---|
4122 | - intel/vec4: fix log_data pointer
|
---|
4123 | - intel/fs: consider UNDEF as non-partial write
|
---|
4124 | - intel/fs: add more UNDEFs around SEND messages
|
---|
4125 | - isl: add ability to store buffer size in unused RENDER_SURFACE_STATE fields
|
---|
4126 | - anv: simplify buffer address+size loads from descriptor buffer
|
---|
4127 | - intel/fs: add support for sparse accesses
|
---|
4128 | - intel/nir: handle image_sparse_load in storage format lowering
|
---|
4129 | - intel/nir: add lower for sparse images & textures
|
---|
4130 | - anv: wire image sparse loads
|
---|
4131 | - blorp: switch blorp_update_clear_color to early return
|
---|
4132 | - blorp: update and move fast clear PIPE_CONTROLs to drivers
|
---|
4133 | - anv: fix 3DSTATE_RASTER::APIMode field setting
|
---|
4134 | - anv: enable EDS3 ConservativeRasterizationMode
|
---|
4135 | - vulkan: skip non required extension structures
|
---|
4136 | - vulkan/runtime: add a layered implementation of vkCmdBindIndexBuffer
|
---|
4137 | - anv: enable INTEL_DEBUG=nofc
|
---|
4138 | - anv: fake non intel vendorID for Death Stranding
|
---|
4139 | - hasvk: fix null descriptor handling with A64 messages
|
---|
4140 | - anv: remove descriptor array bounds checking
|
---|
4141 | - hasvk: remove descriptor array bounds checking
|
---|
4142 | - anv/hasvk: track robustness per pipeline stage
|
---|
4143 | - anv: implement VK_EXT_pipeline_robustness
|
---|
4144 | - intel/fs: track more steps with INTEL_DEBUG=optimizer
|
---|
4145 | - intel/fs: add variable for output of debug backend optimizer
|
---|
4146 | - intel/decoder: constify some input parameters
|
---|
4147 | - blorp: drop programming of 3DSTATE_(MESH|TASK)_SHADER
|
---|
4148 | - anv: emit 3DSTATE_GS only once per pipeline
|
---|
4149 | - intel/decoder: add options to decode surfaces/samplers
|
---|
4150 | - anv: get rid of genX(emit_multisample)
|
---|
4151 | - anv: move genX(rasterization_mode) to gfx8_cmd_buffer.c
|
---|
4152 | - anv: don't try to access dynamic buffers from surface states
|
---|
4153 | - iris: ensure stalling pipe control before fast clear
|
---|
4154 | - intel/compiler: disable per-sample interpolation modes with non-per-sample dispatch
|
---|
4155 | - intel/compiler: fix dynamic alpha-to-coverage handling
|
---|
4156 | - intel/fs: implement dynamic interpolation mode for dynamic persample shaders
|
---|
4157 | - intel/fs: move lower of non-uniform at_sample barycentric to NIR
|
---|
4158 | - zink+anv: add regression testing with pipeline libraries
|
---|
4159 | - anv: implement vkCmdBindIndexBuffer2KHR
|
---|
4160 | - anv: handle new VkBufferViewUsageCreateInfoKHR
|
---|
4161 | - anv: add vkGetRenderingAreaGranularityKHR()
|
---|
4162 | - anv: implement GetDeviceImageSubresourceLayoutKHR/GetImageSubresourceLayout2KHR
|
---|
4163 | - anv: add maintenance5 A8_UNORM/A1B5G5R5_UNORM support
|
---|
4164 | - anv: deal with new pipeline flags
|
---|
4165 | - anv: enable KHR_maintenance5
|
---|
4166 | - anv: add missing ISL storage usage
|
---|
4167 | - genxml/gfx11: remove Tiled Resource Mode field from HIER_DEPTH_BUFFER
|
---|
4168 | - genxml/gfx12: rename Tiled Resource Mode
|
---|
4169 | - isl: program 3DSTATE_HIER_DEPTH_BUFFER_BODY::TiledMode as documented
|
---|
4170 | - intel/isl: Disallow Yf, Ys and Tile64 for 3D depth/stencil surfaces
|
---|
4171 | - isl: disable Yf/Ys/Tile64 tilings for 1D images
|
---|
4172 | - isl: add a usage flag to request 2D/3D compatible views
|
---|
4173 | - isl: disallow TileYs/Yf on 3D storage images on Gfx9/11
|
---|
4174 | - intel/isl: Add a max_miptail_levels field to isl_tile_info
|
---|
4175 | - isl: make isl_surf_get_uncompressed_surf robust to argument accesses
|
---|
4176 | - isl: add Gfx12/12.5 restriction on 3D surfaces & compression
|
---|
4177 | - isl: disallow miptails on planar formats
|
---|
4178 | - isl: disable miptails on gfx12 with yuv formats
|
---|
4179 | - isl: disable CCS on Ys/Yf
|
---|
4180 | - blorp: allow 3D blits/copies on Ys/Yf/Tile64 tiling
|
---|
4181 | - intel/aux_map: correctly program tiling mode for Ys
|
---|
4182 | - isl: reorder tiling selection
|
---|
4183 | - anv: enable standard Y tiles
|
---|
4184 | - isl/tilememcpy_test: add multiple tile testing
|
---|
4185 | - anv: rename total_batch_size
|
---|
4186 | - anv: reuse cmd_buffer::total_batch_size
|
---|
4187 | - intel/measure: track batch buffer sizes
|
---|
4188 | - intel/nir: rerun lower_tex if it lowers something
|
---|
4189 | - intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL
|
---|
4190 | - hasvk: add state cache invalidation back before fast clears
|
---|
4191 | - blorp: remove unused variable
|
---|
4192 | - anv: remove ReorderMode from pipeline 3DSTATE_GS emission
|
---|
4193 | - anv: change anv_batch_emit_merge to also do packing
|
---|
4194 | - intel/anv: batch stats util
|
---|
4195 | - intel/decoder: implement accumulated prints
|
---|
4196 | - anv: move all dynamic state emission to cmd_buffer_flush_dynamic_state
|
---|
4197 | - anv: rename files to represent their usage
|
---|
4198 | - anv: categorize partial/final pipeline instruction
|
---|
4199 | - anv: split 3DSTATE_TE packing between static & dynamic parts
|
---|
4200 | - anv: split 3DSTATE_VFG emission
|
---|
4201 | - anv: add a flag tracking occlusion query count change
|
---|
4202 | - anv: split pipeline programming into instructions
|
---|
4203 | - vulkan/runtime: add helper to name dirty states
|
---|
4204 | - anv: add new low level emission & dirty state tracking
|
---|
4205 | - anv: remove unused state emission
|
---|
4206 | - anv: split BLEND_STATE packing from BLEND_STATE_POINTERS emit
|
---|
4207 | - docs: update Anv documentation about dynamic state emission
|
---|
4208 | - anv: create individual logical engines on i915 when possible
|
---|
4209 | - anv: Copy/Clear MSAA images over companion RCS while we are on compute
|
---|
4210 | - pps-producer: add ability to select device with DRI_PRIME
|
---|
4211 | - anv: remove aux checking asserts
|
---|
4212 | - anv: bound image usages to the associated queue family
|
---|
4213 | - anv: fix 3DSTATE_VFG emission
|
---|
4214 | - anv: emit 3DSTATE_URB_ALLOC_(MESH|TASK) only when mesh shaders are enabled
|
---|
4215 | - anv: ensure mesh pipeline have all pre-rasterization stages disabled
|
---|
4216 | - anv: ensure partially packed instructions are emitted in the pipeline
|
---|
4217 | - anv: fix missing 3DSTATE_SBE_MESH emission
|
---|
4218 | - anv: fix utrace timestamp buffer copies
|
---|
4219 | - anv: add a memcpy compute internal kernel
|
---|
4220 | - anv: add simple shader support without a command buffer
|
---|
4221 | - anv: move simple shaders code to its own object
|
---|
4222 | - anv: move utrace flush out of backends
|
---|
4223 | - anv: enable utrace timestamp buffer copies on compute engine
|
---|
4224 | - intel: don't assume Linux minor dev node
|
---|
4225 | - intel/ds: lock submissions to u_trace_context
|
---|
4226 | - util/u_trace: count number of tracepoints
|
---|
4227 | - intel/ds: track number of tracepoint timestamp copies
|
---|
4228 | - anv/utrace: trace CPU on timestamp buffer readiness
|
---|
4229 | - intel/ds: avoid dropping traces when running out of shared memory
|
---|
4230 | - anv/iris: widen Wa_14015946265 to Gfx11+
|
---|
4231 | - anv: add missing workaround for 3DSTATE_LINE_STIPPLE
|
---|
4232 | - iris: add missing workaround for 3DSTATE_LINE_STIPPLE
|
---|
4233 | - intel/fs: handle ishl in surface/sampler rematerialization
|
---|
4234 | - intel/fs: handle add3 in surface/sampler rematerialization
|
---|
4235 | - intel/fs: switch from SIMD 1 to 8 instructions surface/sampler rematerialization
|
---|
4236 | - anv: fix internal compute copy shader build
|
---|
4237 | - anv: reduce working temporary memory for BVH builds
|
---|
4238 | - anv: move bo_pool allocation flags to init caller
|
---|
4239 | - anv: use buffer pools for BVH build buffers
|
---|
4240 | - intel/ds: track acceleration RT commands
|
---|
4241 | - anv: fix index buffer size programming
|
---|
4242 | - anv: implement INTEL_DEBUG=reemit
|
---|
4243 | - anv: add missing workaround handling in simple shader
|
---|
4244 | - anv: fix a couple of missing input for 3DSTATE_RASTER programming
|
---|
4245 | - anv: flag 3DSTATE_RASTER as dirty after simple shader primitive
|
---|
4246 | - vulkan: bump headers/registry to 1.3.267
|
---|
4247 | - anv: rename primary in container in ExecuteCommands()
|
---|
4248 | - anv: add support for VK_EXT_nested_command_buffer
|
---|
4249 | - anv: simplify push descriptors
|
---|
4250 | - anv: fixup spirv cap for ImageReadWithoutFormat on Gfx12.5
|
---|
4251 | - Revert "intel/fs: limit register flag interaction of FIND_*LIVE_CHANNEL"
|
---|
4252 | - anv: update batch chaining to Gfx9 commands
|
---|
4253 | - anv: workaround Gfx11 with optimized state emission
|
---|
4254 | - u_trace: generate tracepoint index parameter in perfetto callbacks
|
---|
4255 | - u_trace: generate tracepoint name array in perfetto header
|
---|
4256 | - intel/ds: provide names for different events of a timeline's row
|
---|
4257 | - anv: reuse local variable for gfx state
|
---|
4258 | - anv: track render targets & render area changes separately
|
---|
4259 | - anv: don't uninitialize bvh_bo_pool is not initialized
|
---|
4260 | - anv: uninitialize queues before utrace
|
---|
4261 | - anv: move generation shader return instruction to last draw lane
|
---|
4262 | - anv: fix generated draws gl_DrawID with more than 8192 indirect draws
|
---|
4263 | - anv: extract out draw call generation
|
---|
4264 | - anv: identify internal shader in NIR
|
---|
4265 | - anv: avoid MI commands to copy draw indirect count
|
---|
4266 | - anv: move generation batch fields to a sub-struct
|
---|
4267 | - util/glsl2spirv: add ability to pass defines
|
---|
4268 | - anv: factor out host/gpu internal shaders interfaces
|
---|
4269 | - anv: index indirect data buffer with absolute offset
|
---|
4270 | - anv: add ring buffer mode to generated draw optimization
|
---|
4271 | - anv: merge gfx9/11 indirect draw generation shaders
|
---|
4272 | - anv: document the draw indirect optimization ring mode
|
---|
4273 | - anv: fixup 32bit build of internal shaders
|
---|
4274 | - anv: fix uninitialized use of compute initialization batch
|
---|
4275 | - intel/fs: fix dynamic interpolation mode selection
|
---|
4276 | - anv/meson: add missing dependency on the interface header
|
---|
4277 | - anv: fix corner case of mutable descriptor pool creation
|
---|
4278 | - isl: disable MCS compression on R9G9B9E5
|
---|
4279 | - intel/fs: rerun divergence analysis prior to convert_from_ssa
|
---|
4280 | - intel/nir/rt: fix reportIntersection() hitT handling
|
---|
4281 | - anv: fix CC_VIEWPORT pointer dirty after blorp/simple-shaders
|
---|
4282 | - anv: fix dirty state tracking for 3DSTATE_PUSH_CONSTANT_ALLOC
|
---|
4283 | - intel/perf: fix querying of configurations
|
---|
4284 |
|
---|
4285 | Louis-Francis Ratté-Boulianne (15):
|
---|
4286 |
|
---|
4287 | - panfrost: Fix error in comment
|
---|
4288 | - panfrost: Add methods to determine slice and body alignment
|
---|
4289 | - panfrost: Add method to get size of AFBC subblocks
|
---|
4290 | - panfrost: Precalculate stride and nr of blocks for AFBC layouts
|
---|
4291 | - panfrost: Add panfrost_batch_write_bo
|
---|
4292 | - panfrost: Make panfrost_resource_create_with_modifier public
|
---|
4293 | - panfrost: Split out internal of \`panfrost_launch_grid`
|
---|
4294 | - panfrost: Add infrastructure for internal AFBC compute shaders
|
---|
4295 | - panfrost: Add method to get size of AFBC superblocks valid data
|
---|
4296 | - panfrost: Add support for AFBC packing
|
---|
4297 | - panfrost: Legalize resource when attaching to a batch
|
---|
4298 | - panfrost: Don't force constant modifier after converting
|
---|
4299 | - panfrost: Add debug flag to force packing of AFBC textures on upload
|
---|
4300 | - panfrost: Add some debug utility methods for resources
|
---|
4301 | - panfrost: Add env variable for max AFBC packing ratio
|
---|
4302 |
|
---|
4303 | Lucas Stach (33):
|
---|
4304 |
|
---|
4305 | - ci/etnaviv: update ci expectation
|
---|
4306 | - etnaviv: move resource seqnos to level
|
---|
4307 | - etnaviv: flush destination before executing blit
|
---|
4308 | - etnaviv: optimize resource copies by skipping clean levels
|
---|
4309 | - etnaviv: add helper to mark resource level as flushed
|
---|
4310 | - etnaviv: add helper to mark resource level as changed
|
---|
4311 | - etnaviv: add helper to transfer resource level age to another
|
---|
4312 | - etnaviv: add helper to get TS validity
|
---|
4313 | - etnaviv: add helper to set TS validity
|
---|
4314 | - etnaviv: move TS meta into etna_resource_level
|
---|
4315 | - etnaviv: add tile status buffer status into TS metadata
|
---|
4316 | - etnaviv: optimize sampler source update
|
---|
4317 | - etnaviv: allow sampler TS even if the resource is flushed
|
---|
4318 | - etnaviv: keep blit destination tile status valid if possible
|
---|
4319 | - etnaviv: optimize render resource update
|
---|
4320 | - etnaviv: optimize transfers when whole resource level is discarded
|
---|
4321 | - etnaviv: split etna_copy_resource_box levels parameter in src/dst
|
---|
4322 | - etnaviv: don't allocate full resource as transfer staging
|
---|
4323 | - etnaviv: check for valid TS as condition to create the staging resource
|
---|
4324 | - etnaviv: reword comment about staging resource usage
|
---|
4325 | - etnaviv: remove huge outdated comment
|
---|
4326 | - etnaviv: move buffer range tracking into the PIPE_MAP_WRITE clause
|
---|
4327 | - etnaviv: remove superfluous braces
|
---|
4328 | - etnaviv: remove always true assert in etna_transfer_unmap
|
---|
4329 | - etnaviv: remove bogus comment about replacing resource storage
|
---|
4330 | - etnaviv: initialize VIVS_GL_BUG_FIXES
|
---|
4331 | - etnaviv: fix read staging buffer leak
|
---|
4332 | - Revert "ci/etnaviv: allow failure on failing test"
|
---|
4333 | - mesa: enable NV_texture_barrier in GLES2+ (again)
|
---|
4334 | - etnaviv: use correct blit box sizes when copying resource
|
---|
4335 | - etnaviv: zero shared TS metadata block
|
---|
4336 | - Revert "etnaviv: use correct blit box sizes when copying resource"
|
---|
4337 | - mesa: add GL_APPLE_sync support
|
---|
4338 |
|
---|
4339 | Luigi Santivetti (1):
|
---|
4340 |
|
---|
4341 | - pvr: do not claim support for ASTC texture compression
|
---|
4342 |
|
---|
4343 | M Henning (31):
|
---|
4344 |
|
---|
4345 | - nv50/ir: Drop nir_jump_return handling
|
---|
4346 | - nv50/ir: Remove ArgumentMovesPass
|
---|
4347 | - nv50/ir: Remove Function.stackPtr
|
---|
4348 | - nv50/ir: Remove dead loop from assignSlot
|
---|
4349 | - nv50/ir: Remove SpillSlot
|
---|
4350 | - nvc0: Keep nir directly in nvc0_program
|
---|
4351 | - nv50: Keep nir directly in nv50_program
|
---|
4352 | - nouveau: Delete nv50_ir_from_tgsi.cpp
|
---|
4353 | - nouveau: Drop tgsi support from nv50_ir_prog_info
|
---|
4354 | - nouveau: Drop ConverterCommon::Subroutine
|
---|
4355 | - nouveau: Drop BuildUtil::DataArray
|
---|
4356 | - nouveau: Drop BuildUtil::Location
|
---|
4357 | - nouveau: Delete the nouveau_compiler tool
|
---|
4358 | - nv/codegen: Call nir_shader_gather_info
|
---|
4359 | - nv/codegen: Implement nir_op_fquantize2f16
|
---|
4360 | - nvk: Remove reference to genUserClip
|
---|
4361 | - nv/codegen: Use nir_lower_clip
|
---|
4362 | - nv50_ir_from_nir: Use nir's lower_fpow
|
---|
4363 | - nv/codegen: Delete OP_POW
|
---|
4364 | - nv/codegen: Fix an uninitialized variable warning
|
---|
4365 | - nv/codegen: Delete OP_WRSV
|
---|
4366 | - nv/codegen: Delete OP_EXP, OP_LOG
|
---|
4367 | - nv/codegen: Remove fragCoord variable.
|
---|
4368 | - nv/codegen: Merge from_common into from_nir
|
---|
4369 | - nv/codegen: Remove unused clipVertexOutput var
|
---|
4370 | - nv50_ir_ra: Delete unused functions
|
---|
4371 | - nv/codegen: Delete unused OP_CONSTRAINT
|
---|
4372 | - nv/codegen: Delete periodicMask32
|
---|
4373 | - nv/codegen: Remove Function::buildDefSets
|
---|
4374 | - nv/codegen: Change copy-constructor call to assign
|
---|
4375 | - nv/codegen: Delete copy and assign
|
---|
4376 |
|
---|
4377 | Maaz Mombasawala (2):
|
---|
4378 |
|
---|
4379 | - svga: Make surfaces shareable at creation.
|
---|
4380 | - svga: Unify gmr and mob surface pool managers
|
---|
4381 |
|
---|
4382 | Marcin Ślusarz (16):
|
---|
4383 |
|
---|
4384 | - iris: avoid duplicating validation entries
|
---|
4385 | - hasvk: remove dead code & comments related to mesh shading
|
---|
4386 | - anv: drop support for VK_NV_mesh_shader
|
---|
4387 | - intel/compiler: remove NV_mesh_shader support
|
---|
4388 | - intel/compiler: remove redundant code
|
---|
4389 | - anv: drop unused function
|
---|
4390 | - anv: merge cases leading to the same code
|
---|
4391 | - intel/compiler/mesh: compactify MUE layout
|
---|
4392 | - intel/compiler,anv: put some vertex and primitive data in headers
|
---|
4393 | - intel/compiler: load debug mesh compaction options once
|
---|
4394 | - intel/compiler/test: fix crashes when TEST_DEBUG is set
|
---|
4395 | - intel/compiler: add lsc_msg_desc_wcmask
|
---|
4396 | - intel/compiler: add initial support for URB_LOGICAL_SRC_CHANNEL_MASK to lower_urb_write_logical_send_xe2
|
---|
4397 | - intel/compiler/mesh: fix position of output URB handle for xe2
|
---|
4398 | - intel/compiler/mesh: implement IO for xe2
|
---|
4399 | - intel/compiler: mask GS URB handles at thread payload construction
|
---|
4400 |
|
---|
4401 | Marek Olšák (125):
|
---|
4402 |
|
---|
4403 | - Revert "ac/nir/ngg: Follow intrinsic sources when analyzing before culling."
|
---|
4404 | - glthread: determine global locking once every 64 batches to fix get_time perf
|
---|
4405 | - mesa: fix 38% decrease in display list performance of Viewperf2020/NX8_StudioAA
|
---|
4406 | - freedreno,lima,zink: update CI fixes and flakes
|
---|
4407 | - util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock
|
---|
4408 | - util/u_queue: always enable UTIL_QUEUE_INIT_SCALE_THREADS, remove the flag
|
---|
4409 | - radeonsi: fix a CDNA regression breaking compute
|
---|
4410 | - glthread: sync for VDPAU sync functions
|
---|
4411 | - radeonsi: turn sh_base[PIPE_SHADER_VERTEX] into a constant in emit_draw_packets
|
---|
4412 | - radeonsi: restructure the loop for non-indexed multi draws
|
---|
4413 | - radeonsi: cosmetic changes to radeon_opt_* macros
|
---|
4414 | - radeonsi: handle draw user SGPRs as tracked registers
|
---|
4415 | - radeonsi: update obsolete comments about compiler queues
|
---|
4416 | - radeonsi: remove si_compute.h, move the contents into si_pipe.h
|
---|
4417 | - radeonsi: move si_update/emit_tess_io_layout_state into si_state_shaders.cpp
|
---|
4418 | - radeonsi: move si_emit_spi_map into si_state_shaders.cpp
|
---|
4419 | - radeonsi: move si_emit_rasterizer_prim_state out of si_emit_all_states
|
---|
4420 | - radeonsi: remove splitting IBs that use too much memory
|
---|
4421 | - radeonsi: add padding to si_resource to fix Viewperf2020/catiav5test1 perf
|
---|
4422 | - radeonsi: remove unused check_mem parameter from si_sampler_view_add_buffer
|
---|
4423 | - radeonsi: remove the draw counter with primitive restart from the HUD
|
---|
4424 | - radeonsi: always inline si_prefetch_shaders
|
---|
4425 | - radeonsi: specialize si_draw_rectangle using a C++ template
|
---|
4426 | - radeonsi: add index parameter into si_atom::emit
|
---|
4427 | - radeonsi: split direct pm4 emission from si_pm4_emit
|
---|
4428 | - radeonsi: move code around si_pm4_emit_state into si_pm4_emit_state
|
---|
4429 | - radeonsi: merge pm4 state and atom emit loops into one
|
---|
4430 | - radeonsi: add a simple version of si_pm4_emit_state for non-shader states
|
---|
4431 | - radeonsi: handle deferred cache flushes as a state (si_atom)
|
---|
4432 | - radeonsi: remove render condition logic from si_draw by reordering atoms
|
---|
4433 | - radeonsi: abort when failing to upload descriptors instead of skipping draws
|
---|
4434 | - radeonsi: rename shader_pointers state -> gfx_shader_pointers
|
---|
4435 | - radeonsi: merge si_upload_*_descriptors into si_emit_*_shader_pointers
|
---|
4436 | - radeonsi: convert si_gfx_resources_add_all_to_bo_list to a state atom
|
---|
4437 | - radeonsi/ci: update gfx11 failures
|
---|
4438 | - radeonsi: move GE_CNTL emission from si_draw into si_emit_vgt_pipeline_state
|
---|
4439 | - radeonsi: use num_patches_per_workgroup directly in si_get_ia_multi_vgt_param
|
---|
4440 | - radeonsi: enable shader culling by default because it helps Viewperf
|
---|
4441 | - radeonsi: rewrite how occlusion query precision is determined for performance
|
---|
4442 | - radeonsi: set PIPE_CONTEXT_LOSE_CONTEXT_ON_RESET on aux_context explicitly
|
---|
4443 | - radeon_winsys: move allow_context_lost from cs_create to ctx_create
|
---|
4444 | - winsys/amdgpu: rework how SW reset status is generated and reported
|
---|
4445 | - radeon_winsys: add a ctx_set_sw_reset_status callback
|
---|
4446 | - radeonsi: don't abort for descriptor failures, let the winsys handle it
|
---|
4447 | - radeonsi: don't use threadID.yz/blockID.yz for copy_image if those are always 0
|
---|
4448 | - radeonsi: don't use threadID.yz/blockID.yz for compute_blit if they're always 0
|
---|
4449 | - nir: fix constant evaluation of fddx/fddy sourcing Inf & NaN constant
|
---|
4450 | - nir/algebraic: collapse ALU opcodes sourcing NaN
|
---|
4451 | - ac/gpu_info: add the /dev/dri/ filename into radeon_info
|
---|
4452 | - Revert "ac: don't call ac_query_pci_bus_info from ac_query_gpu_info"
|
---|
4453 | - ac: implement AMD_FORCE_FAMILY properly, remove SI_FORCE_FAMILY
|
---|
4454 | - ac: document ac_shader_args::gs_vtx_offset
|
---|
4455 | - ac: minor updates to packet documentation and definitions
|
---|
4456 | - ac: change offsets of DMA_DATA dwords to prevent reg offset conflicts
|
---|
4457 | - ac: improve the IB parser
|
---|
4458 | - ac: update gfx11 shadowed register tables
|
---|
4459 | - ac: add a standalone IB parser program
|
---|
4460 | - ac/surface: trivial non-functional changes
|
---|
4461 | - ac/surface: add radeon_surf::u::gfx9::uses_custom_pitch
|
---|
4462 | - radeonsi: allow setting any index in radeon_set_sh_reg_idx
|
---|
4463 | - radeonsi: rename uses_subgroup_info to uses_tg_size
|
---|
4464 | - radeonsi: improve the heuristic when to use Wave32 for compute shaders
|
---|
4465 | - radeonsi: simplify/merge emit_shader_ngg functions
|
---|
4466 | - radeonsi: don't pass gl_Layer to PS for blit shaders
|
---|
4467 | - radeonsi/gfx11: pass attribute ring addr via SGPR instead of memory for blits
|
---|
4468 | - radeonsi: fix templated si_draw_rectangle callback for Navi14
|
---|
4469 | - nir: replace undef only used by ALU opcodes with 0 or NaN
|
---|
4470 | - nir: remove nir_op_unpack_64 handling from nir_opt_undef
|
---|
4471 | - ac/llvm: don't convert undef to 0 because nir_opt_undef does it now
|
---|
4472 | - meson: use llvm-config instead of cmake to fix linking errors with meson 1.2.1
|
---|
4473 | - gallivm: fix build with LLVM 18
|
---|
4474 | - amd/llvm: fix build with LLVM 18
|
---|
4475 | - radeonsi: fix compute-only contexts
|
---|
4476 | - ac/llvm: replace removed amdgcn.ldexp for LLVM 18
|
---|
4477 | - ac/perfcounter: remove a bogus assert to fix an assertion failure on gfx11
|
---|
4478 | - ac/llvm: set !fpmath 3.0 for llvm.sqrt
|
---|
4479 | - ac/gpu_info: don't align IBs to the GL2 cache line size
|
---|
4480 | - ac/llvm: fix flat PS input corruption
|
---|
4481 | - amd: rename GFX110x to NAVI31-33
|
---|
4482 | - ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments
|
---|
4483 | - ac/gpu_info: pad IBs according to ib_size_alignment
|
---|
4484 | - winsys/amdgpu: pad gfx and compute IBs with a single NOP packet
|
---|
4485 | - Revert "radeonsi: specialize si_draw_rectangle using a C++ template"
|
---|
4486 | - radeonsi/ci: update navi10 results
|
---|
4487 | - gallium/util: fix GALLIUM_TESTS=1 by using cso_set_vertex_buffers_and_elements
|
---|
4488 | - gallium/util: add more tests for compute-only contexts
|
---|
4489 | - radeonsi: add another aux context for uploading shaders
|
---|
4490 | - radeonsi: upload shaders via a staging buffer so as not to map VRAM directly
|
---|
4491 | - ac/surface: don't require exact pitch for gfx6-8 tiled imports
|
---|
4492 | - Revert "ac/gpu_info: override ib_size_alignment for VCN_DEC and JPEG"
|
---|
4493 | - Revert "radv/amdgpu: fix alignment of command buffers"
|
---|
4494 | - Revert "radv: fix alignment of DGC command buffers"
|
---|
4495 | - Revert "winsys/amdgpu: pad gfx and compute IBs with a single NOP packet"
|
---|
4496 | - Revert "ac/gpu_info: pad IBs according to ib_size_alignment"
|
---|
4497 | - Revert "ac/gpu_info: replace ib_alignment with per-IP IB base and size alignments"
|
---|
4498 | - nir: sort variables by location in nir_lower_io_passes to work around a bug
|
---|
4499 | - nir: recompute IO bases after DCE in nir_lower_io_passes
|
---|
4500 | - nir: add dual-slot input information into load_input intrinsics
|
---|
4501 | - nir: take dual slot input info into account when computing IO driver locations
|
---|
4502 | - nir: gather dual slot input information
|
---|
4503 | - nir: expose reusable linking helpers for cloning uniform loads
|
---|
4504 | - nir: handle nir_var_mem_ubo in nir_clone_uniform_variable
|
---|
4505 | - ac/gpu_info: split ib_alignment as ip[type].ib_alignment
|
---|
4506 | - ac/gpu_info: move ib_pad_dw_mask into ip[]
|
---|
4507 | - ac/gpu_info: drop the hack unifying all IB alignments
|
---|
4508 | - ac/gpu_info: conservatively decrease IB alignment and padding to 256B
|
---|
4509 | - ac/gpu_info: set gfx and compute IB padding to only 8 dwords
|
---|
4510 | - winsys/amdgpu: properly pad the IB in amdgpu_submit_gfx_nop
|
---|
4511 | - winsys/amdgpu: correctly pad noop IBs for RADEON_NOOP=1
|
---|
4512 | - winsys/amdgpu: pad gfx and compute IBs with only 1 NOP
|
---|
4513 | - ac/gpu_info: don't allow register shadowing with SR-IOV due to bad performance
|
---|
4514 | - radeonsi: disable register shadowing without SR-IOV to fix bad performance
|
---|
4515 | - winsys/amdgpu: don't send CP_GFX_SHADOW chunk if shadow address is not set
|
---|
4516 | - radeonsi/ci: update gfx1100 results
|
---|
4517 | - nir: split FLOAT_CONTROLS_SIGNED_ZERO_INF_NAN_PRESERVE_FP* flags
|
---|
4518 | - nir/algebraic: use only signed_zero_preserve_* for addition by 0 patterns, etc.
|
---|
4519 | - mesa: don't pass Infs to the shader via gl_Fog.scale
|
---|
4520 | - radeonsi/ci: update the runner for new build scripts
|
---|
4521 | - radeonsi/ci: enable GTF tests in the runner
|
---|
4522 | - radeonsi/ci: enable GLES CTS in the runner
|
---|
4523 | - radeonsi/ci: update failures and flakes
|
---|
4524 | - amd/common: update DCC for gfx11.5
|
---|
4525 | - radeonsi: initialize perfetto in the right place
|
---|
4526 | - radeonsi/gfx11: don't set OREO_MODE to fix rare corruption
|
---|
4527 | - nir: fix gathering TESS_LEVEL_INNER/OUTER usage with lowered IO
|
---|
4528 |
|
---|
4529 | Marek Vasut (1):
|
---|
4530 |
|
---|
4531 | - etnaviv: Fully replicate back stencil config
|
---|
4532 |
|
---|
4533 | Mark Collins (10):
|
---|
4534 |
|
---|
4535 | - tu/a7xx: Adapt r3d blits for A7xx
|
---|
4536 | - freedreno/rnn: Remove %n usage in fprintf
|
---|
4537 | - freedreno: Only add drm/computerator when system_has_kms_drm
|
---|
4538 | - freedreno/decode: Support building replay for multiple KMDs
|
---|
4539 | - freedreno+meson: Add lua+libarchive+libxml from Meson WrapDB
|
---|
4540 | - meson: Warn about side-effects from DRM for FD KMDs
|
---|
4541 | - meson: Update libarchive to v3.7.2-2
|
---|
4542 | - freedreno/common: Add max_sets property to A6xxGPUInfo
|
---|
4543 | - tu: Support higher descriptor set count for A7XX
|
---|
4544 | - tu,util/driconf: Add option to not reserve descriptor set
|
---|
4545 |
|
---|
4546 | Mark Janes (1):
|
---|
4547 |
|
---|
4548 | - intel: allow reduced memory usage for INTEL_MEASURE
|
---|
4549 |
|
---|
4550 | Martin Roukala (né Peres) (22):
|
---|
4551 |
|
---|
4552 | - radv/ci: drop the auto-reboot-on-hang for vkcts-navi10
|
---|
4553 | - radv/ci: use the default kernel on vkcts-navi10
|
---|
4554 | - zink/ci: automatically reboot when hitting a kernel BUG on vangogh
|
---|
4555 | - zink/ci: document more flakes seen on vangogh
|
---|
4556 | - radv/ci: move vkcts-navi10 testing to KWS
|
---|
4557 | - radv/ci: add more tests to the navi10 vkcts flake list
|
---|
4558 | - radv/ci: increase the parallelism of the vkcts-navi21 job
|
---|
4559 | - radv/ci: add more tests to the navi21 vkcts flake list
|
---|
4560 | - radv/ci/vkcts-navi21: catch all the line_stipple_(enable|params) flakes
|
---|
4561 | - radv/ci/vkcts-navi21: document more flakes
|
---|
4562 | - radv/ci/vkcts-navi10: catch all the line-related flakes
|
---|
4563 | - radv/ci: update the vkcts gfx1100 flake/fail lists
|
---|
4564 | - radv/ci: add a manual job to run vkcts on navi31
|
---|
4565 | - radv/ci: add a manual job for vkd3d-proton on navi31
|
---|
4566 | - ci/vkcts-vangogh: mark dEQP-VK.dynamic_rendering.primary_cmd_buff.basic.* as flake
|
---|
4567 | - ci/vkcts-navi21: mark more of the RT handles checks as flakes
|
---|
4568 | - ci: make B2C_JOB_VOLUME_EXCLUSIONS to all .b2c-test jobs
|
---|
4569 | - zink/ci: remove 19 tests from the zink-radv-polaris10-fails list
|
---|
4570 | - ci/b2c: switch containers to a back-up ahead of valve-infra renaming
|
---|
4571 | - zink/ci: remove 42 tests from the zink-radv-polaris10-fails list
|
---|
4572 | - radv/ci: tighten the vkcts-navi21 timeouts
|
---|
4573 | - zink/ci: tighten the zink-radv-vangogh timeouts
|
---|
4574 |
|
---|
4575 | Martin Stransky (1):
|
---|
4576 |
|
---|
4577 | - llvmpipe: fix UAF in lp_scene_is_resource_referenced.
|
---|
4578 |
|
---|
4579 | Mary (6):
|
---|
4580 |
|
---|
4581 | - nouveau/mme: Add initial Fermi definition
|
---|
4582 | - nouveau/mme: Add Fermi builder
|
---|
4583 | - nouveau/mme: Add Fermi simulator
|
---|
4584 | - nouveau/mme: Add Fermi hardware tests
|
---|
4585 | - agx: Move nir_lower_fragcolor out of agx_preprocess_nir
|
---|
4586 | - agx: Ensure to lower 1D image load/store to 2D
|
---|
4587 |
|
---|
4588 | Mary Guillemard (4):
|
---|
4589 |
|
---|
4590 | - nir: Add NVIDIA-specific geometry shader opcodes
|
---|
4591 | - venus: skip bind sparse info when checking for feedback query
|
---|
4592 | - zink: Check for VK_EXT_extended_dynamic_state3 before setting A2C
|
---|
4593 | - venus: Do not submit batch manually when no feedback is required
|
---|
4594 |
|
---|
4595 | Matt Coster (21):
|
---|
4596 |
|
---|
4597 | - pvr: Pad rogue_regarray_cache_key union members to avoid UB
|
---|
4598 | - pvr: Clean up extension tables
|
---|
4599 | - pvr: Refactor pvr_GetPhysicalDeviceProperties2()
|
---|
4600 | - docs: Fixup imagination/pvr extension support
|
---|
4601 | - pvr: Add VK_KHR_get_display_properties2
|
---|
4602 | - pvr: Add VK_KHR_get_memory_requirements2
|
---|
4603 | - pvr: Add VK_KHR_get_surface_capabilities2
|
---|
4604 | - pvr: Print VkStructureType name on pvr_debug_ignored_stype()
|
---|
4605 | - pvr: Add VK_KHR_copy_commands2
|
---|
4606 | - pvr: Don't override commands copied to new buffer when extending cs
|
---|
4607 | - pvr: Do not require TA_STATE_HEADER.pres_ispctl_dbsc for {db,sc}enable
|
---|
4608 | - pvr: Zero tail of cs buffers after linking when dumping cs
|
---|
4609 | - pvr: Cleanup comments in pvr_physical_device_get_supported_*()
|
---|
4610 | - pvr: Don't rely on GNU void pointer arithmetic
|
---|
4611 | - pvr: Force compile error on GNU void pointer arithmetic
|
---|
4612 | - pvr: Switch to common pipeline cache implementation
|
---|
4613 | - pvr: Use vk_sampler base
|
---|
4614 | - pvr: Clean up & fix sampler border color support
|
---|
4615 | - pvr: Don't pass pvr_physical_device when only device info is needed
|
---|
4616 | - pvr: Minor refactor of pvr_device.c
|
---|
4617 | - pvr: Use common physical device properties
|
---|
4618 |
|
---|
4619 | Matt Turner (10):
|
---|
4620 |
|
---|
4621 | - Revert "intel/fs: only avoid SIMD32 if strictly inferior in throughput"
|
---|
4622 | - intel: Rearrange for next commit
|
---|
4623 | - intel: Consider with_intel_clc in with_any_intel
|
---|
4624 | - intel: Only build blorp if drivers are enabled
|
---|
4625 | - intel: Only build ds if drivers are enabled
|
---|
4626 | - intel: Only build perf if drivers or tools are enabled
|
---|
4627 | - intel: Allow using intel_clc from the system
|
---|
4628 | - intel: Limit Intel Vulkan RT to x86_64
|
---|
4629 | - r600: Add missing dep on git_sha1.h
|
---|
4630 | - util: Include stdint.h in libdrm.h
|
---|
4631 |
|
---|
4632 | Mauro Rossi (7):
|
---|
4633 |
|
---|
4634 | - nouveau/ws: fix building error in nouveau_ws_push_dump()
|
---|
4635 | - vulkan/meta: fix gnu-empty-initializer build error
|
---|
4636 | - nouveau/mme: fix print inst for case MME_FERMI_OP_MERGE
|
---|
4637 | - anv/android: remove numFds check
|
---|
4638 | - hasvk/android: remove numFds check
|
---|
4639 | - Android.mk: filter out cflags to build with Android 14 bundled clang
|
---|
4640 | - Android.mk: disable android-libbacktrace to build with Android 14
|
---|
4641 |
|
---|
4642 | Mike Blumenkrantz (293):
|
---|
4643 |
|
---|
4644 | - ci: bump VVL to 1.3.257
|
---|
4645 | - zink: set pipeline dynamic state count after all dynamic states are set
|
---|
4646 | - zink: set feedback attachments on batch init
|
---|
4647 | - zink: be even dumber about buffer refs when replacing storage
|
---|
4648 | - zink: emit SpvCapabilitySampleMaskPostDepthCoverage with SpvExecutionModePostDepthCoverage
|
---|
4649 | - zink: fix the fix for separate shader program refcounting
|
---|
4650 | - kopper: handle pixmap creation failure more gracefully
|
---|
4651 | - glxsw: check geometry of drawables on creation
|
---|
4652 | - kopper: move pixmap param for drawable creation to info struct
|
---|
4653 | - glx/dri3: split out modifier check
|
---|
4654 | - glx/sw: check for modifier support in the kopper path
|
---|
4655 | - kopper: pass modifier availability to drawable creation
|
---|
4656 | - kopper: determine modifier support per-drawable
|
---|
4657 | - zink: don't clobber descriptor mode on multiple screen creation
|
---|
4658 | - nir: fix slot calculations for compact variables with location_frac
|
---|
4659 | - lavapipe: use the component offset directly for xfb
|
---|
4660 | - nir: add a helper for calculating variable slots
|
---|
4661 | - radv: bump max xfb output to 128
|
---|
4662 | - ir3: bump max xfb output to 128
|
---|
4663 | - gallium: bump PIPE_MAX_SO_OUTPUTS to 128
|
---|
4664 | - zink: add feedback loop exts to optimal profile
|
---|
4665 | - glsl: only explicitly check GS components in PSIZ injection with output variables
|
---|
4666 | - lavapipe: statically allocate fb attachment array
|
---|
4667 | - lavapipe: zero fb attachment array at rp start
|
---|
4668 | - lavapipe: don't check geometry for fb attachments
|
---|
4669 | - lavapipe: be slightly more permissive for bad apps (and cts) with dynrender
|
---|
4670 | - lavapipe: VK_EXT_host_image_copy
|
---|
4671 | - zink: better handle separate shader dsl creation when no bindings exist
|
---|
4672 | - zink: force image barriers after dmabuf import
|
---|
4673 | - ci: bump VVL to 1.3.261
|
---|
4674 | - zink: use VK_WHOLE_SIZE when binding null db buffer descriptors
|
---|
4675 | - zink: unset line stipple ds3 state flags when stipple not available
|
---|
4676 | - nir/lower_io_to_scalar: fix 64bit io splitting
|
---|
4677 | - nir/linking_helpers: force type matching in does_varying_match
|
---|
4678 | - nir/print: print location names for (some) tess slots
|
---|
4679 | - nir/print: always group variables by type when printing
|
---|
4680 | - zink: add batch refs for transient images
|
---|
4681 | - zink: fix zs resolve attachment indexing
|
---|
4682 | - zink: don't add VK_IMAGE_USAGE_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT for transient images
|
---|
4683 | - zink: don't append msrtss to dynamic render if not supported
|
---|
4684 | - zink: set msrtss depth resolve mode when enabled
|
---|
4685 | - zink: hook up VK_KHR_workgroup_memory_explicit_layout
|
---|
4686 | - zink: propagate have_workgroup_memory_explicit_layout to ntv
|
---|
4687 | - zink: use SPV_KHR_workgroup_memory_explicit_layout when available
|
---|
4688 | - zink: add more locking for pipeline cache
|
---|
4689 | - zink: add VK_PIPELINE_CACHE_CREATE_EXTERNALLY_SYNCHRONIZED_BIT_EXT
|
---|
4690 | - aux/trace: fix winsys handle dumping
|
---|
4691 | - zink: generated tcs is on the tes, not the vs
|
---|
4692 | - zink: apply ZINK_DEBUG=noopt to linked separate shaders
|
---|
4693 | - gallivm: handle A8_UNORM image stores
|
---|
4694 | - llvmpipe: enable A8_UNORM for shader images
|
---|
4695 | - llvmpipe: export PIPE_CAP_IMAGE_LOAD_FORMATTED
|
---|
4696 | - lavapipe: GetRenderingAreaGranularityKHR
|
---|
4697 | - llvmpipe: block weird uses of subsampled formats in buffers
|
---|
4698 | - llvmpipe: fix early depth + alpha2coverage + occlusion query interaction
|
---|
4699 | - lavapipe: fix BindVertexBuffers2 buffer size handling
|
---|
4700 | - lavapipe: fix resolves where src image has a layer offset
|
---|
4701 | - lavapipe: block yuv formats from getting blit feature flags
|
---|
4702 | - lavapipe: BindIndexBuffer2
|
---|
4703 | - lavapipe: GetDeviceImageSubresourceLayoutKHR
|
---|
4704 | - lavapipe: VK_REMAINING_ARRAY_LAYERS for copy ops
|
---|
4705 | - lavapipe: maintenance5
|
---|
4706 | - zink: fix xfb buffer array sizing to use buffer limit, not output
|
---|
4707 | - zink: move ZINK_DEBUG=nir printing to just before compile
|
---|
4708 | - draw: fix so debug offset printing
|
---|
4709 | - zink: reindex ssa defs before dumping debug shaders
|
---|
4710 | - lavapipe: zero-init pipe_sampler_state
|
---|
4711 | - zink: explicitly set non-optimal last_vertex_stage shader key on ctx create
|
---|
4712 | - zink: fix big tcs output io
|
---|
4713 | - zink: don't try to replace separate shader prog in noopt mode
|
---|
4714 | - zink: pre-convert mode in fixup_io_locations
|
---|
4715 | - zink: add a special separate shader i/o mode for legacy variables
|
---|
4716 | - nir: minor fixes for io_to_scalar
|
---|
4717 | - nir/lower_io: add a new doubles-only 64bit lowering option
|
---|
4718 | - nir: add a filter cb to lower_io_to_scalar
|
---|
4719 | - d3d10umd: use cso_context to set vertex buffers and elements
|
---|
4720 | - virgl: move virgl_vertex_elements_state to header
|
---|
4721 | - virgl: fix some indentation
|
---|
4722 | - nouveau: calloc vertex csos
|
---|
4723 | - gallium: move vertex stride to CSO
|
---|
4724 | - zink: fix null config screen creation
|
---|
4725 | - zink: fix crash in lower_pv_mode_gs_store
|
---|
4726 | - u/draw: skip zero-sized indirect draws
|
---|
4727 | - lavapipe: handle VkPipelineCreateFlagBits2KHR
|
---|
4728 | - lavapipe: handle VkBufferUsageFlags2KHR
|
---|
4729 | - zink: ci updates
|
---|
4730 | - zink: track start/stop of a couple query types
|
---|
4731 | - zink: require EDS1 for CWE usage
|
---|
4732 | - zink: unset primgen suspended flag when ending a primgen query
|
---|
4733 | - zink: rework rast-discard for primgen queries
|
---|
4734 | - zink: rip out some awkward parts of the old non-cwe path
|
---|
4735 | - zink: drop CWE requirement for renderpass tracking with primgen queries
|
---|
4736 | - nir/zink: fix gs emulation xfb_info sizing
|
---|
4737 | - zink: move fragcolor lowering further along the compile process
|
---|
4738 | - zink: add a mode param to find_var_with_location_frac
|
---|
4739 | - zink: use lowered io (kinda) for i/o vars
|
---|
4740 | - zink: stop lowering indirect derefs
|
---|
4741 | - ntt: handle interp intrinsics as derefs
|
---|
4742 | - zink: delete split_blocks pass
|
---|
4743 | - zink: delete lower_64bit_vertex_attribs pass
|
---|
4744 | - zink: fix clip/cull dist xfb inlining
|
---|
4745 | - zink: delete all the extra gross xfb handling
|
---|
4746 | - zink: stop using pipe_stream_output
|
---|
4747 | - zink: remove pipe_stream_output from function params
|
---|
4748 | - zink: ci updates
|
---|
4749 | - aux/trace: print bindless handles as pointers
|
---|
4750 | - zink: remove unused param from create_ici
|
---|
4751 | - zink: split create_ici to init and eval
|
---|
4752 | - zink: add maintenance extensions to profile
|
---|
4753 | - zink: use maintenance5
|
---|
4754 | - zink: use real A8_UNORM when possible
|
---|
4755 | - vk/graphics: fix CWE handling with DS3
|
---|
4756 | - Revert "vk/wsi/x11: handle geometry updating more asynchronously"
|
---|
4757 | - r600: store the mask of buffers used by a vertex state
|
---|
4758 | - r600: better tracking for vertex buffer emission
|
---|
4759 | - zink: wait on async fence during ctx program removal
|
---|
4760 | - zink: handle patch variable locations for separate shaders better
|
---|
4761 | - zink: don't start multiple cache jobs for the same program
|
---|
4762 | - zink: use the "set" optimal key for prog last_variant_hash for consistency
|
---|
4763 | - zink: sanitize optimal keys
|
---|
4764 | - zink: copy some cs shader properties to the program struct
|
---|
4765 | - zink: handle global atomic intrinsics
|
---|
4766 | - zink: use Aligned with global load/store ops
|
---|
4767 | - zink: fix rewrite_read_as_0 filtering
|
---|
4768 | - rusticl: fixes for zink shader images
|
---|
4769 | - zink: pass KERNEL shaders through successfully
|
---|
4770 | - zink: add a618 flake
|
---|
4771 | - zink: break out ds3 state resetting
|
---|
4772 | - zink: be consistent with ds3 state resetting for blits
|
---|
4773 | - zink: fix optimal_keys warning message
|
---|
4774 | - zink: force-reset unordered flags for buffer barriers on non-matching batch access
|
---|
4775 | - zink: reset unordered flags for image barriers on non-matching batch access
|
---|
4776 | - zink: make image barrier init functions void return
|
---|
4777 | - zink: simplify some image barrier conditionals
|
---|
4778 | - zink: remove sync TODO
|
---|
4779 | - zink: add lavapipe flake
|
---|
4780 | - ci: disable nouveau shaderdb
|
---|
4781 | - egl/dri3: only set driver_name if not already set
|
---|
4782 | - egl: call dri3_x11_connect() for zink
|
---|
4783 | - egl: bind dri2_set_WL_bind_wayland_display for zink when necessary
|
---|
4784 | - zink: be more precise about flagging rp changes around unordered u_blitter
|
---|
4785 | - zink: don't block reordering during ref updates in unordered blits
|
---|
4786 | - lavapipe: update vbo indices before propagating stride
|
---|
4787 | - lavapipe: fix pipeline stride propagation
|
---|
4788 | - zink: fix linear modifier dmabuf imports
|
---|
4789 | - zink: polaris ci updates
|
---|
4790 | - aux/tc: handle stride mismatch during rp-optimized subdata
|
---|
4791 | - zink: always add a per-prog ref for gpl libs
|
---|
4792 | - zink: use a pointer to simplify submit struct mechanics
|
---|
4793 | - zink: make zink_resource_image_barrier2_init public
|
---|
4794 | - zink: add a third submitinfo (unused for now)
|
---|
4795 | - zink: make submitinfo handling easier to manage with enum
|
---|
4796 | - zink: add another submitinfo for fd semaphore waits
|
---|
4797 | - zink: add a screen cache for fd semaphores
|
---|
4798 | - zink: add a util for getting cached fd semaphores
|
---|
4799 | - zink: hook up cached fd semaphore usage for batch signal/waits
|
---|
4800 | - zink: handle implicit sync for dmabufs
|
---|
4801 | - zink: handle multi-plane implicit sync
|
---|
4802 | - zink: ci updates
|
---|
4803 | - zink: set is_xfb=false for all i/o variables
|
---|
4804 | - zink: reorder bindless io lowering
|
---|
4805 | - zink: fix typing on bindless io lowering
|
---|
4806 | - zink: delete some bindless io lowering code
|
---|
4807 | - zink: use nir_io_semantics::num_slots for indirect var creation
|
---|
4808 | - zink: simplify an arrayed io check during variable creation
|
---|
4809 | - zink: use explicit stride from types instead of copying old_var stride
|
---|
4810 | - zink: use MAX_PATCH_VERTICES directly for arrayed io var sizing
|
---|
4811 | - zink: use explicit sizing for builtins when creating variables
|
---|
4812 | - zink: create new vars without copying existing ones
|
---|
4813 | - zink: add a new linker pass to handle mismatched i/o components
|
---|
4814 | - zink: use right function to get src_type in eliminate_io_wrmasks
|
---|
4815 | - zink: re-rework i/o variable handling to make having variables entirely optional
|
---|
4816 | - ci: bump VVL to 1.3.263
|
---|
4817 | - zink: simplify redundant is_buffer check
|
---|
4818 | - zink: use VkFormatProperties3
|
---|
4819 | - lavapipe: handle VkHostImageCopyDevicePerformanceQueryEXT
|
---|
4820 | - lavapipe: don't advertise UNDEFINED layout for HIC
|
---|
4821 | - zink: hook up VK_EXT_host_image_copy
|
---|
4822 | - zink: move mem type detection up in file
|
---|
4823 | - zink: disable HIC without resizable BAR
|
---|
4824 | - zink: add a fixup method for extra driver props
|
---|
4825 | - zink: fix some off-by-one indentation
|
---|
4826 | - zink: use some return codes for check_ici errors
|
---|
4827 | - zink: check/use suboptimal HIC during ici init
|
---|
4828 | - zink: use HIC for image subdata when possible
|
---|
4829 | - zink: slightly refactor psiz deletion during linking
|
---|
4830 | - zink: delete all psiz=1.0 stores if maintenance5 is present
|
---|
4831 | - nir/inline_uniforms: fix oob access with nir_find_inlinable_uniforms
|
---|
4832 | - zink: add ZINK_DEBUG=quiet
|
---|
4833 | - zink: imply ZINK_DEBUG=quiet if ZINK_DEBUG=optimal_keys is set on turnip
|
---|
4834 | - zink: set optimal_keys for turnip jobs
|
---|
4835 | - aux/tc: fix staging buffer sizing for texture_subdata
|
---|
4836 | - aux/tc: fix address calc for segmented texture subdata
|
---|
4837 | - zink: ci updates
|
---|
4838 | - lavapipe: KHR_map_memory2
|
---|
4839 | - zink: slightly refactor pipeline compile selection
|
---|
4840 | - zink: add a flag for combined pipeline compile for doing FAIL_ON_PIPELINE_COMPILE_REQUIRED
|
---|
4841 | - zink: remove an intermediate variable in pipeline compile selection
|
---|
4842 | - zink: use FAIL_ON_PIPELINE_COMPILE_REQUIRED for GPL path
|
---|
4843 | - zink: pass a stage mask to pipeline create functions
|
---|
4844 | - glsl: check for xfb setting xfb info
|
---|
4845 | - zink: don't warn about missing scalarBlockLayout on v3dv
|
---|
4846 | - aux/tc: fix renderpass tracking fb state clobber scenario
|
---|
4847 | - vk/enum2str: add more max enum vendors
|
---|
4848 | - aux/tc: fix rp info handling around tc_sync calls
|
---|
4849 | - aux/tc: don't use pipe_buffer_create_with_data() for rp-optimized subdata
|
---|
4850 | - zink: flag db maps as unsynchronized
|
---|
4851 | - lavapipe: clamp cache uuid size
|
---|
4852 | - lavapipe: EXT_load_store_op_none
|
---|
4853 | - tu: handle unused color attachments without crashing
|
---|
4854 | - zink: use much bigger dummy surfaces
|
---|
4855 | - zink: propagate rp_tc_info_updated across unordered blits
|
---|
4856 | - zink: use null attachments for null attachments with dynamic render
|
---|
4857 | - egl/swrast: expose EXT_swap_buffers_with_damage and EXT_present_opaque
|
---|
4858 | - egl/wayland: split out wl drm extension init
|
---|
4859 | - egl/wayland: use more registry listeners to better handle device init
|
---|
4860 | - egl/wayland: enable WL_bind_wayland_display for zink
|
---|
4861 | - zink: delete injected pointsize during shader creation
|
---|
4862 | - zink: require maintenance5 for shobj
|
---|
4863 | - zink: delete a non-maintenance5 workaround for shobj use
|
---|
4864 | - lavapipe: set separate_shaders for shader objects
|
---|
4865 | - zink: set workgroup_memory_explicit_layout for shader validation
|
---|
4866 | - zink: add a ZINK_DEBUG=validation alias
|
---|
4867 | - zink: fix semaphore signal ordering
|
---|
4868 | - zink: move swapchain fence to swapchain object
|
---|
4869 | - zink: avoid UAF on wayland async present with to-be-retired swapchain
|
---|
4870 | - zink: always trace_screen_unwrap in acquire
|
---|
4871 | - lavapipe: fix variable descriptor count support handling
|
---|
4872 | - lavapipe: always set independent blend
|
---|
4873 | - lavapipe: more vertex stride fixups
|
---|
4874 | - lavapipe: set default viewport and scissor count for cmdbufs
|
---|
4875 | - lavapipe: set default min sample shading to 1
|
---|
4876 | - glx: XFree visual info
|
---|
4877 | - radv: fix external handle type queries for dmabuf/fd
|
---|
4878 | - zink: fix crashing in image rebinds
|
---|
4879 | - zink: move push descriptor disable to driver workarounds
|
---|
4880 | - zink: move v3dv scalarBlockLayout workaround
|
---|
4881 | - zink: fix end-of-batch barrier pipeline stages
|
---|
4882 | - zink: guarantee egl syncobj lifetime
|
---|
4883 | - aux/trace: dump enum names for map usage
|
---|
4884 | - gallium: add PIPE_MAP_NONE
|
---|
4885 | - Revert "egl/wayland: Add image loader extension for swrast"
|
---|
4886 | - egl/wayland: don't block in swrast when updating buffers for zink
|
---|
4887 | - egl/wayland: return sooner from swrast_update_buffers() if zink
|
---|
4888 | - zink: don't check submit count for unflushed usage
|
---|
4889 | - egl: don't set ForceSoftware for all zink loading
|
---|
4890 | - zink: error at handle export on missing EXT_image_drm_format_modifier
|
---|
4891 | - gbm: delete some zink handling
|
---|
4892 | - zink: apply ZINK_DEBUG=quiet to all missing feature warnings
|
---|
4893 | - zink: set ZINK_DEBUG=quiet for polaris jobs
|
---|
4894 | - lavapipe: don't block begin/end cmdbuf pipeline barriers
|
---|
4895 | - ci: add a630 trace flakes
|
---|
4896 | - zink: shrink vectors during optimization
|
---|
4897 | - zink: always clamp shader stage in descriptor handling
|
---|
4898 | - zink: add set_global_binding
|
---|
4899 | - zink: eliminate samplers from no-sampler CL texops
|
---|
4900 | - zink: add some checks to determine whether queue is init on screen destroy
|
---|
4901 | - zink: don't destroy any simple_mtx_t objects during screen destroy
|
---|
4902 | - zink: don't destroy uninitialized disk cache thread
|
---|
4903 | - zink: reorder glsl_type_singleton_init_or_ref call
|
---|
4904 | - zink: use screen destructor for creation fails
|
---|
4905 | - zink: fix readback_present locking
|
---|
4906 | - zink: add automatic swapchain readback using heuristics
|
---|
4907 | - lavapipe: VK_EXT_nested_command_buffer
|
---|
4908 | - zink: ignore unacquired swapchain images during end-of-frame flush
|
---|
4909 | - nir/lower_fragcolor: preserve location_frac
|
---|
4910 | - zink: update pointer for GPL pipeline cache entry formats
|
---|
4911 | - zink: fix legacy depth texture rewriting for single component reads
|
---|
4912 | - egl: unify dri2_egl_display creation
|
---|
4913 | - egl: init dri3 version info during screen creation
|
---|
4914 | - egl/glx: don't load non-sw zink without dri3 support
|
---|
4915 | - egl: add automatic zink fallback loading between hw and sw drivers
|
---|
4916 | - glx: add automatic zink fallback loading between hw and sw drivers
|
---|
4917 | - ci: don't set GALLIUM_DRIVER for zink
|
---|
4918 | - egl/wayland: only add more registry listeners for hardware devices
|
---|
4919 | - zink: only increment image_rebind_counter on image export if binds exist
|
---|
4920 | - zink: check for sampler view existence during zink_rebind_all_images()
|
---|
4921 | - zink: use weston for anv ci
|
---|
4922 | - zink: blow up broken xservers more reliably
|
---|
4923 | - zink: delete some dead modifier handling
|
---|
4924 | - ci: skip implicit modifier piglits for zink
|
---|
4925 | - zink: don't block large vram allocations
|
---|
4926 | - zink: add copy box locking
|
---|
4927 | - zink: emit SpvCapabilitySampleRateShading with SampleId
|
---|
4928 | - zink: always set VK_EXTERNAL_MEMORY_HANDLE_TYPE_HOST_ALLOCATION_BIT_EXT for usermem
|
---|
4929 | - zink: clamp resolve extents to src/dst geometry
|
---|
4930 | - zink: only emit xfb execution mode for last vertex stage
|
---|
4931 | - aux/u_transfer_helper: set rendertarget bind for msaa staging resource
|
---|
4932 | - zink: unset explicit_xfb_buffer for non-xfb shaders
|
---|
4933 | - mesa/st/texture: match width+height for texture downloads of cube textures
|
---|
4934 | - zink: add more locking for compute pipelines
|
---|
4935 | - radv: correctly return oom from the device when failing to create a cs
|
---|
4936 | - zink: check for cbuf0 writes before setting A2C
|
---|
4937 |
|
---|
4938 | Mohamed Ahmed (19):
|
---|
4939 |
|
---|
4940 | - vulkan/util: Support 10-bit and 12-bit color formats in ycbcr_info in vk_format.c
|
---|
4941 | - vulkan/util: Support VK_EXT_ycbcr_2plane_444_formats color formats in vk_format.c
|
---|
4942 | - vulkan/util: Use ycbcr_info for multiplane helpers in vk_format.c
|
---|
4943 | - nvk: implement vkGetDeviceImageMemoryRequirementsKHR()
|
---|
4944 | - nvk: add stub for vkGetDeviceImageSparseMemoryRequirementsKHR()
|
---|
4945 | - nvk: implement vkGetDeviceBufferMemoryRequirementsKHR()
|
---|
4946 | - nvk: advertise VK_KHR_maintenance4
|
---|
4947 | - nvk: advertise DemoteToHelperInvocation
|
---|
4948 | - nvk: Enable multiplane images and image views
|
---|
4949 | - nouveau/nvk: Add YCbCr sampler NIR lowering pass
|
---|
4950 | - nouveau/nvk: Support multi-plane descriptors in nvk_nir_lower_descriptors.c
|
---|
4951 | - nouveau/nvk: Create helper function for sampler creation
|
---|
4952 | - nouveau/nvk: Add multiple sampler planes for CONVERSION_SEPARATE_RECONSTRUCTION_FILTER_BIT
|
---|
4953 | - nouveau/nvk: Enable VK_KHR_sampler_ycbcr
|
---|
4954 | - util/format: Add G8B8_G8R8_422_UNORM and B8G8_R8G8_422_UNORM formats
|
---|
4955 | - vulkan/format: Translate G8B8G8R8_422_UNORM and B8G8R8G8_422_UNORM properly
|
---|
4956 | - nvk: Enable SEPARATE_RECONSTRUCTION_FILTER_BIT for multi-planar formats only
|
---|
4957 | - nvk: Enable MIDPOINT_CHROMA_SAMPLES_BIT for multi-planar formats only
|
---|
4958 | - nil: Add support for G8B8_G8R8_UNORM and B8G8_R8G8_UNORM
|
---|
4959 |
|
---|
4960 | Nanley Chery (33):
|
---|
4961 |
|
---|
4962 | - iris: Remap DRM_FORMAT_MOD_INVALID more often during import
|
---|
4963 | - anv: Don't support ASTC images with modifiers
|
---|
4964 | - intel: Add and use isl_drm_modifier_get_plane_count
|
---|
4965 | - anv: Handle explicit surface layout of DG2_RC_CCS
|
---|
4966 | - anv: Reduce accesses of isl_mod_info->aux_usage
|
---|
4967 | - iris: Reduce accesses of mod_info->aux_usage
|
---|
4968 | - crocus: Delete modifier with aux code
|
---|
4969 | - hasvk: Delete modifier with aux code
|
---|
4970 | - iris: Swap stencil and modifier aux assignment order
|
---|
4971 | - intel: Describe modifier compression with booleans
|
---|
4972 | - intel/isl: Move the Tile4 modifier score case down
|
---|
4973 | - intel/isl: Add a score for DG2_RC_CCS
|
---|
4974 | - intel/blorp: Ambiguate after CCS resolves on gfx7-8
|
---|
4975 | - iris: Reorder render_aux_usage parameters
|
---|
4976 | - iris: Pass the render format to prepare_render
|
---|
4977 | - iris: Create BLORP surfaces after resource preparation
|
---|
4978 | - iris: Handle clear color compatibility in prepare_render
|
---|
4979 | - iris: Sample more texture view fast-clears on gfx11+
|
---|
4980 | - iris: Fix aux usage tracking in prepare_render
|
---|
4981 | - iris: Fix iris_copy_region calls involving FCV_CCS_E
|
---|
4982 | - iris: Drop get_copy_region_aux_settings
|
---|
4983 | - iris: Inline iris_can_sample_mcs_with_clear
|
---|
4984 | - anv: Initialize the clear color more often for FCV
|
---|
4985 | - intel: Return a bool from intel_aux_map_add_mapping
|
---|
4986 | - anv: Move scope of CCS binding determination
|
---|
4987 | - anv: Allocate space for aux-map CCS in image bindings
|
---|
4988 | - anv: Wrap aux surface image binding queries
|
---|
4989 | - anv: Refactor CCS disabling at image bind time
|
---|
4990 | - anv: Place images into the aux-map when safe to do so
|
---|
4991 | - anv: Loosen anv_bo_allows_aux_map
|
---|
4992 | - anv: Meet CCS alignment reqs with dedicated allocs
|
---|
4993 | - anv: Delete implicit CCS code
|
---|
4994 | - intel/isl: Add scores for GEN12_RC_CCS and MTL_RC_CCS
|
---|
4995 |
|
---|
4996 | Neal Gompa (1):
|
---|
4997 |
|
---|
4998 | - asahi: Fix 32-bit x86 build with correct data type for overflow error message
|
---|
4999 |
|
---|
5000 | Neha Bhende (1):
|
---|
5001 |
|
---|
5002 | - ntt: lower indirect tesslevels in ntt
|
---|
5003 |
|
---|
5004 | Paul Gofman (2):
|
---|
5005 |
|
---|
5006 | - driconf: add a workaround for Captain Lycop: Invasion of the Heters
|
---|
5007 | - driconf: add a workaround for Rainbow Six Extraction
|
---|
5008 |
|
---|
5009 | Paulo Zanoni (15):
|
---|
5010 |
|
---|
5011 | - anv: rename the vm_bind vfuncs
|
---|
5012 | - anv: add a new vm_bind vfunc
|
---|
5013 | - anv/xe: make vm_binds async
|
---|
5014 | - anv/xe: return failure in case waiting for the vm_bind syncobj fails
|
---|
5015 | - anv: remove misleading comment about batch_len
|
---|
5016 | - iris: assert bufmgr->bo_deps_lock is held
|
---|
5017 | - iris: avoid stack overflow in iris_bo_wait_syncobj()
|
---|
5018 | - iris: assert(bo->deps) after realloc()
|
---|
5019 | - intel/isl: add ISL_SURF_USAGE_SPARSE_BIT
|
---|
5020 | - intel/isl: simplify the check for maximum surface size
|
---|
5021 | - anv/sparse: add the initial code for Sparse Resources
|
---|
5022 | - anv/sparse: get ready to issue a single vm_bind ioctl per non-opaque bind
|
---|
5023 | - anv/sparse: add INTEL_DEBUG=sparse
|
---|
5024 | - anv: enable sparse resources by default
|
---|
5025 | - vulkan: fix potential memory leak in create_rect_list_pipeline()
|
---|
5026 |
|
---|
5027 | Pavel Ondračka (44):
|
---|
5028 |
|
---|
5029 | - r300: update RV370 failures
|
---|
5030 | - r300: check for index overflow when translating from TGSI
|
---|
5031 | - r300: source register index is always unsigned
|
---|
5032 | - r300: bump the RC_MAX_INDEX_BITS
|
---|
5033 | - r300: normal instruction can't have presubtract op
|
---|
5034 | - r300: add a helper for checking number of temporary sources
|
---|
5035 | - r300: cycles estimate for shader-db
|
---|
5036 | - r300: fix cycles calculation
|
---|
5037 | - r300: don't abort on flow control when using draw for vs
|
---|
5038 | - r300: add dEQP baseline for RV370 with forced swtcl
|
---|
5039 | - r300: copy ntt to r300 compiler
|
---|
5040 | - r300: add lower_sqrt to nir option
|
---|
5041 | - r300: remove unused intrinsics in ntr
|
---|
5042 | - r300: remove irrelevant opcodes in ntr
|
---|
5043 | - r300: remove unused integer support in ntr
|
---|
5044 | - r300: remove ntr_tgsi_usage_mask
|
---|
5045 | - r300: remove more unused 64-bit pieces from ntr
|
---|
5046 | - r300: simplify vectorization rules
|
---|
5047 | - r300: remove more ntr unused helpers
|
---|
5048 | - r300: remove the unneeded ntr_lower_vec_to_reg callback
|
---|
5049 | - r300: remove unneeded 64bit and atomic lowering passes
|
---|
5050 | - r300: remove unused ntr default settings
|
---|
5051 | - r300: remove ntr default options
|
---|
5052 | - r300: simplify ntr_emit_load_ubo
|
---|
5053 | - r300: simplify ntr_emit_load_input
|
---|
5054 | - r300: remove some virglrenderer specifics from ntr
|
---|
5055 | - r300: simplify ntr_setup_uniforms
|
---|
5056 | - r300: simplify ntr_output_decl
|
---|
5057 | - r300: simplify ntr_try_store_in_tgsi_output
|
---|
5058 | - r300: remove some unsupported texture opcodes
|
---|
5059 | - r300: remove unused barrier code from ntr
|
---|
5060 | - r300: simplify ntr_get_gl_varying_semantic
|
---|
5061 | - r300: remove the nrt main optimization loop
|
---|
5062 | - r300: reorder for easier presubtract 1-x pattern recognition
|
---|
5063 | - r300: exit early in presubtract is not supported
|
---|
5064 | - r300: implement bias presubtract
|
---|
5065 | - r300: convert x * 2 into x + x for presubtract
|
---|
5066 | - r300: move power of two multipliers down
|
---|
5067 | - r300: there is no limitation on presubtract source file
|
---|
5068 | - r300: use w channel for scalar opcodes if possible
|
---|
5069 | - r300: reduce number of iterations for vertex shader loops
|
---|
5070 | - r300: enable nir_move_vec_src_uses_to_dest
|
---|
5071 | - nir/move_vec_src_uses_to_dest: skip reuse if vec is used only once in store_output
|
---|
5072 | - nir/move_vec_src_uses_to_dest: allow to skip reuse of constant sources
|
---|
5073 |
|
---|
5074 | Philipp Zabel (1):
|
---|
5075 |
|
---|
5076 | - etnaviv: fix segfault after compile failure
|
---|
5077 |
|
---|
5078 | Pierre-Eric Pelloux-Prayer (18):
|
---|
5079 |
|
---|
5080 | - radeonsi/sdma: use multiple commands if required
|
---|
5081 | - radv/sdma: use multiple commands if required
|
---|
5082 | - radv/sdma: use correct limits for gfx10.3
|
---|
5083 | - glx: drop the 'libGL' log prefix
|
---|
5084 | - loader: refactor DRI_PRIME handling code
|
---|
5085 | - loader: extend DRI_PRIME to support =N
|
---|
5086 | - loader: add DRI_PRIME_DEBUG env var
|
---|
5087 | - device_select_layer: support DRI_PRIME=n
|
---|
5088 | - docs: update DRI_PRIME documentation
|
---|
5089 | - device_select: add shortcut for MESA_VK_DEVICE_SELECT_FORCE_DEFAULT_DEVICE
|
---|
5090 | - st/mesa: check renderbuffer before using it
|
---|
5091 | - radeonsi: emit framebuffer state after allocating cmask
|
---|
5092 | - amd/common: update addrlib for gfx11.5
|
---|
5093 | - amd/common: add registers for gfx11.5
|
---|
5094 | - ac/nir: extract must_wait_attr_ring helper
|
---|
5095 | - amd, radeonsi: Add code to enable gfx11.5
|
---|
5096 | - mesa: restore call to _mesa_set_varying_vp_inputs from set_vertex_processing_mode
|
---|
5097 | - radeonsi: check sctx->tess_rings is valid before using it
|
---|
5098 |
|
---|
5099 | Piotr Kocia (2):
|
---|
5100 |
|
---|
5101 | - nir: Remove dead nir_const_value variables
|
---|
5102 | - glsl: ir_function_param_visitor::visit_enter always true condition
|
---|
5103 |
|
---|
5104 | Qiang Yu (77):
|
---|
5105 |
|
---|
5106 | - aco,radv: replace tess_input_vertices shader info param
|
---|
5107 | - radeonsi: aco does not pass LS outputs to HS by arg
|
---|
5108 | - radeonsi: extract si_get_prev_stage_nir_shader to be shared with aco
|
---|
5109 | - radeonsi: init aco shader info for merged LS/HS
|
---|
5110 | - radeonsi: simplify si_build_wrapper_function
|
---|
5111 | - radeonsi: move vertex shader vb desc input sgpr args to last
|
---|
5112 | - radeonsi: remove param type check in wrapper function
|
---|
5113 | - radeonsi: refine si_llvm_ls_build_end
|
---|
5114 | - radeonsi: refine si_llvm_es_build_end
|
---|
5115 | - radeonsi: aco compile support merged mono shader
|
---|
5116 | - radeonsi: calculate lds size for merged shaders
|
---|
5117 | - radeonsi: enable aco compile for mono merged LS/HS
|
---|
5118 | - radeonsi: enable aco compile for mono merged ES/GS
|
---|
5119 | - aco: extract aco_compile_shader_part from aco_compile_ps_epilog
|
---|
5120 | - aco: add p_end_with_regs pseudo instruction
|
---|
5121 | - aco: move jump to epilog out of ic_merged_wave_info
|
---|
5122 | - aco: add tcs end regs for epilog usage
|
---|
5123 | - aco: allow tcs with epilog to keep nir store output instruction
|
---|
5124 | - aco: add pending_lds_access option for insert waitcnt
|
---|
5125 | - aco: add tcs epilog generation for radeonsi
|
---|
5126 | - aco: don't emit s_endpgm for tcs with epilog
|
---|
5127 | - aco: skip scratch init when no scratch arg provide
|
---|
5128 | - aco,radeonsi: save const addr to symbol
|
---|
5129 | - ac/nir/tess: move tess factor output out of control flow
|
---|
5130 | - aco: use semantic location as io temp index
|
---|
5131 | - radeonsi: add exec_size to shader binary
|
---|
5132 | - radeonsi: support upload multi part shader binary
|
---|
5133 | - radeonsi: share si_get_tcs_out_patch_stride with aco
|
---|
5134 | - radeonsi: fill part mode tcs aco shader info
|
---|
5135 | - radeonsi: extract si_llvm_build_shader_part
|
---|
5136 | - radeonsi: remove separate_prolog arg from prolog/epilog build
|
---|
5137 | - radeonsi: add si_get_tcs_epilog_args
|
---|
5138 | - radeonsi: change si_fill_aco_options args
|
---|
5139 | - radeonsi: add si_aco_build_shader_part
|
---|
5140 | - radeonsi: part mode standalone tcs support aco compile
|
---|
5141 | - radeonsi: remove unused arg of get_tcs_tes_buffer_address
|
---|
5142 | - aco: simplify setup_tcs_info
|
---|
5143 | - aco: pass sw_stage when setup_isel_context
|
---|
5144 | - aco: prepare fix_ls_vgpr_init_bug to be used by gl vs prolog
|
---|
5145 | - aco: add vs prolog instruction selection for radeonsi
|
---|
5146 | - aco: add aco compile interface for radeonsi vs prolog
|
---|
5147 | - aco: do not fix_exports when program is prolog
|
---|
5148 | - radeonsi: fill aco_shader_info->is_monolithic
|
---|
5149 | - radeonsi: remove is_monolithic from vs prolog key
|
---|
5150 | - radeonsi: extract si_get_vs_prolog_args to be shared with aco
|
---|
5151 | - radeonsi: fix aco options has_ls_vgpr_init_bug setup
|
---|
5152 | - radeonsi: add vs prolog aco build
|
---|
5153 | - radeonsi: set vs has prolog aco shader info
|
---|
5154 | - radeonsi: enable aco compile for part mode standalone vs
|
---|
5155 | - aco,radv,radeonsi: rename is_monolithic to merged_shader_compiled_separately
|
---|
5156 | - ac,radeonsi: move ps arg pos_fixed_pt to ac_shader_args
|
---|
5157 | - aco: do not eliminate final exec write when p_end_with_regs block
|
---|
5158 | - aco: remove p_end_with_regs from needs_exact()
|
---|
5159 | - aco: add ps prolog generation for radeonsi
|
---|
5160 | - aco: handle ps outputs from radeonsi
|
---|
5161 | - aco: add create_fs_end_for_epilog for radeonsi
|
---|
5162 | - aco,radv: remove unused ps epilog info fields
|
---|
5163 | - aco,radv: rename ps epilog info inputs to colors
|
---|
5164 | - aco: simplify export_fs_mrt_color
|
---|
5165 | - aco,radv: add radeonsi spec ps epilog code
|
---|
5166 | - aco: compact ps expilog color export for radeonsi
|
---|
5167 | - aco,radv,radeonsi: pass spi ps input ena and addr
|
---|
5168 | - aco: do not fix_exports when program has epilog
|
---|
5169 | - aco: fix assertion fail when program contains empty block
|
---|
5170 | - aco: create exit block for p_end_with_regs to branch to
|
---|
5171 | - aco: wait memory ops done before go to next shader part
|
---|
5172 | - radeonsi: reduce sgpr count for scratch_offset when aco
|
---|
5173 | - radeonsi: init spi_ps_input_addr for part mode ps
|
---|
5174 | - radeonsi: extract si_prolog_get_internal_binding_slot
|
---|
5175 | - radeonsi: extract si_get_ps_prolog_args to be shared with aco
|
---|
5176 | - ac,radeonsi: remove unused ps prolog key fields
|
---|
5177 | - radeonsi: add ps prolog shader part build
|
---|
5178 | - radeonsi: extract si_get_ps_epilog_args to be shared with aco
|
---|
5179 | - radeonsi: fill aco shader info for ps part
|
---|
5180 | - radeonsi: add ps epilog shader part build
|
---|
5181 | - radeonsi: enable aco compile for part mode ps
|
---|
5182 | - radeonsi: disable disk cache when use aco
|
---|
5183 |
|
---|
5184 | Rebecca Mckeever (32):
|
---|
5185 |
|
---|
5186 | - vulkan/runtime: Add helper functions for VK_EXT_host_image_copy
|
---|
5187 | - nouveau/codegen: Support nir_intrinsic_load_workgroup_id_zero_base
|
---|
5188 | - nouveau/codegen: Set lower_device_index_to_zero
|
---|
5189 | - nvk: Convert system values for gl_PointCoord and PointCoord into inputs
|
---|
5190 | - nvk: Add base_group to root descriptor table
|
---|
5191 | - nvk: Lower base_workgroup_id
|
---|
5192 | - nvk: Implement nvk_CmdDispatchBase and delete nvk_CmdDispatch
|
---|
5193 | - nvk: Advertise KHR_device_group
|
---|
5194 | - nvk: Add VK_FORMAT_B4G4R4A4_UNORM_PACK16 format to nil_format_info table
|
---|
5195 | - nvk: Add A4B4G4R4 formats to nil_format_info table
|
---|
5196 | - nvk: Advertise EXT_4444_formats
|
---|
5197 | - nvk: Enable shadow sampling
|
---|
5198 | - nvk: Implement VK_EXT_non_seamless_cube_map
|
---|
5199 | - nouveau/nil: Add macros for ufixed
|
---|
5200 | - nvk: Implement VK_EXT_image_view_min_lod
|
---|
5201 | - nvk: Update mutable descriptor struct type
|
---|
5202 | - nvk: Replace asserts with conditional that sets type_list = NULL
|
---|
5203 | - nvk: Implement nvk_GetDescriptorSetLayoutSupport
|
---|
5204 | - nvk: Enable VK_KHR_maintenance3
|
---|
5205 | - nvk: Advertise VK_EXT_mutable_descriptor_type
|
---|
5206 | - nvk: Set image index to zero for NULL nvk_buffer_view
|
---|
5207 | - nvk: Advertise VK_EXT_image_robustness
|
---|
5208 | - nvk: Advertise VK_EXT_robustness2
|
---|
5209 | - nvk: Add view_index to root descriptor table
|
---|
5210 | - nvk: Lower nir_intrinsic_load_view_index
|
---|
5211 | - nvk: Add draw support for multiview
|
---|
5212 | - nvk: Add query support for multiview
|
---|
5213 | - nvk: Add input attachments support for multiview
|
---|
5214 | - nvk: Advertise VK_KHR_multiview
|
---|
5215 | - nvk: Load view_mask to shadow scratch in nvk_CmdBeginRendering
|
---|
5216 | - nvk: Combine CLEAR_VIEWS and CLEAR_LAYERS MME macros
|
---|
5217 | - nvk: Move code inside view mask loops to a helper function
|
---|
5218 |
|
---|
5219 | Rhys Perry (89):
|
---|
5220 |
|
---|
5221 | - ac/llvm: fix AC_TM_CHECK_IR
|
---|
5222 | - radv: fix radv_get_ballot_bit_size with CS
|
---|
5223 | - ac/llvm: fix wave32 ac_build_mbcnt_add with 64-bit mask
|
---|
5224 | - ac/llvm: skip ballot zext for 32-bit dest with wave32-as-wave64
|
---|
5225 | - radv: add conformant_trunc_coord to cache UUID
|
---|
5226 | - radv: don't unset TRUNC_COORD if conformant_trunc_coord=true
|
---|
5227 | - ac/nir: always round cube array layers
|
---|
5228 | - nir/unsigned_upper_bound: fix phi(bcsel)
|
---|
5229 | - nir/tests: add test for unsigned_upper_bound with loop header phis
|
---|
5230 | - nir/opt_dead_cf: remove nodes after a jump earlier
|
---|
5231 | - nir/tests: add nir_opt_dead_cf_test.jump_before_constant_if
|
---|
5232 | - aco: insert s_nop before VGPR deallocation
|
---|
5233 | - nir/lower_shader_calls: vectorize stack access for all shaders
|
---|
5234 | - radv: workaround WWZ exporting index=1 through location=1
|
---|
5235 | - radv: correctly skip MRT output NaN fixup for meta shaders
|
---|
5236 | - radv: don't set vertex_attribute_strides on GFX8+
|
---|
5237 | - radv/ci: skip some mesh shader tests on GFX1100
|
---|
5238 | - aco: summarize register demand after handling branches
|
---|
5239 | - aco: don't create sendmsg(dealloc_vgprs) if scratch is used
|
---|
5240 | - radv: disable 64-bit color attachments
|
---|
5241 | - radv: fix 128bpp comp-to-single clears
|
---|
5242 | - radv: support 128bpp comp-to-single with all colors
|
---|
5243 | - radv/gfx11: re-enable 0001/1110 clear values
|
---|
5244 | - nir/lower_shader_calls: fix align_offset
|
---|
5245 | - nir/opt_load_store_vectorize: support scratch access
|
---|
5246 | - radv: vectorize RT stack access
|
---|
5247 | - radv: vectorize scratch access
|
---|
5248 | - aco: fix p_bpermute_gfx6 with input at non-zero byte
|
---|
5249 | - aco: fix p_bpermute_gfx6's exec save/restore with wave32
|
---|
5250 | - aco: clarify bpermute pseudo opcode names
|
---|
5251 | - aco: add adjust_bpermute_dst helper
|
---|
5252 | - aco/spill: skip p_branch in process_block
|
---|
5253 | - aco/spill: add all live-in to merge block spill candidates
|
---|
5254 | - nir/lower_system_values change num_workgroups to uint32_t
|
---|
5255 | - radv: optimize mesh workgroup ID using ts_mesh_dispatch_dimensions
|
---|
5256 | - radv: use shortcut_1d_workgroup_id
|
---|
5257 | - aco: remove fast path in insert_exec_mask's process_instructions
|
---|
5258 | - aco/optimizer_postRA: check overwritten_subdword in is_overwritten_since()
|
---|
5259 | - aco: check logical_phi_info at p_logical_end when eliminating exec writes
|
---|
5260 | - aco: remove unused p_logical_end check when optimizing branching sequence
|
---|
5261 | - radv: disable mesh dispatch XYZ_DIM when possible
|
---|
5262 | - nir/deref: remove rematerialize_deref_in_block cache
|
---|
5263 | - aco: reset prefetch in the correct block after removing the exit
|
---|
5264 | - aco/waitcnt: replace wait_cnt::\*_cnt with booleans
|
---|
5265 | - aco/waitcnt: add print helpers
|
---|
5266 | - nir/lower_int64: fix find_lsb(0)
|
---|
5267 | - nir/algebraic: optimize u2u32(a >> 32)
|
---|
5268 | - aco/optimizer_postRA: don't combine DPP across exec on GFX8/9
|
---|
5269 | - aco: don't combine DPP into v_cmpx
|
---|
5270 | - aco: disable zero offset optimization for strict WQM coords
|
---|
5271 | - nir/constant_folding: remove zero texel offset
|
---|
5272 | - aco: remove zero offset optimization
|
---|
5273 | - aco: shrink DPP8_instruction
|
---|
5274 | - aco: add fetch_inactive field to DPP instructions
|
---|
5275 | - nir: add fetch inactive index to quad_swizzle_amd/masked_swizzle_amd
|
---|
5276 | - aco: disable FI for quad/masked swizzle
|
---|
5277 | - aco: fix LdsDirectVMEMHazard WaW with the wrong waitcnt
|
---|
5278 | - aco: only mitigate VcmpxExecWARHazard when necessary
|
---|
5279 | - aco: fix s_setreg hazards
|
---|
5280 | - aco: consider exec_hi in reads_exec()
|
---|
5281 | - aco: resolve all possible hazards at the end of shader parts
|
---|
5282 | - aco/tests: test that hazards are resolved at the end of shader parts
|
---|
5283 | - radv: skip zero-sized memcpy
|
---|
5284 | - ac/nir: fix out-of-bounds access in ac_nir_export_position
|
---|
5285 | - radv: fix signed integer overflow
|
---|
5286 | - Revert "radv: pre-init surface info"
|
---|
5287 | - nir: improve ms_cross_invocation_output_access with local_invocation_id
|
---|
5288 | - aco,nir: add export_row_amd intrinsic
|
---|
5289 | - ac/nir: add row parameter to helpers
|
---|
5290 | - ac/nir: remove dead code
|
---|
5291 | - ac/nir: refactor mesh vertex/primitive export
|
---|
5292 | - ac/nir: implement mesh shader gs_fast_launch=2
|
---|
5293 | - ac/nir: optimize mesh shader local_invocation_index
|
---|
5294 | - radv: implement mesh shader gs_fast_launch=2
|
---|
5295 | - ac/nir: add emit_ms_outputs helper
|
---|
5296 | - ac/nir,radv: pass workgroup size to ac_nir_lower_ngg_ms
|
---|
5297 | - ac/nir: implement mesh shader multi-row export
|
---|
5298 | - radv: implement mesh shader multi-row export
|
---|
5299 | - radv: enable mesh shader gs_fast_launch=2 and multi-row export
|
---|
5300 | - nir/serialize: fix signed integer overflow
|
---|
5301 | - nir/lower_shader_calls: skip zero-sized qsort
|
---|
5302 | - util: skip zero-sized SHA1Update
|
---|
5303 | - radv: call lower_array_deref_of_vec before lower_io_arrays_to_elements
|
---|
5304 | - radv: skip radv_remove_varyings for mesh shaders
|
---|
5305 | - radv: disable gs_fast_launch=2 by default
|
---|
5306 | - docs: fix RADV_THREAD_TRACE_CACHE_COUNTERS default
|
---|
5307 | - radv: add radv_disable_trunc_coord option
|
---|
5308 | - radv: enable radv_disable_trunc_coord for vkd3d-proton/DXVK
|
---|
5309 | - ac/nir: fix partial mesh shader output writes on GFX11
|
---|
5310 |
|
---|
5311 | Rob Clark (60):
|
---|
5312 |
|
---|
5313 | - freedreno: move virtgpu msm_proto.h to common
|
---|
5314 | - freedreno/drm/virtio: Remove unused header
|
---|
5315 | - tu/msm: staticify a couple things
|
---|
5316 | - tu/knl: Remove some random const'ness
|
---|
5317 | - drm-uapi: Update virtgpu header
|
---|
5318 | - freedreno: Update virtgpu proto
|
---|
5319 | - freedreno/drm/virtio: Use global_faults
|
---|
5320 | - tu: close submitqueues before device_finish()
|
---|
5321 | - tu/drm: Factor out shared helpers
|
---|
5322 | - tu/drm: Add missing error path cleanup
|
---|
5323 | - tu/drm: Split out helper for iova alloc
|
---|
5324 | - tu: Add virtgpu support
|
---|
5325 | - util: Decouple disk cache from EGL_ANDROID_blob_cache
|
---|
5326 | - docs: Followup to !24636
|
---|
5327 | - tu: Workaround bionic _SC_LEVEL1_DCACHE_LINESIZE
|
---|
5328 | - ir3+tu: Simplify ir3_find_sysval_regid callers
|
---|
5329 | - freedreno/a6xx: Drop unused screen args
|
---|
5330 | - freedreno/a6xx: Re-work fd6_emit_shader
|
---|
5331 | - freedreno/a6xx: Re-write the function-of-doom
|
---|
5332 | - freedreno: Implement ATI_meminfo
|
---|
5333 | - freedreno/a6xx: ARB_post_depth_coverage
|
---|
5334 | - freedreno/a6xx: ARB_sample_locations
|
---|
5335 | - freedreno/a6xx: ARB_texture_filter_minmax
|
---|
5336 | - freedreno/a6xx: EXT_demote_to_helper_invocation
|
---|
5337 | - freedreno/a6xx: EXT_shader_image_load_formatted
|
---|
5338 | - freedreno/a6xx: EXT_depth_bounds_test
|
---|
5339 | - freedreno/a6xx: Use pipe_blit_info::sample0_only
|
---|
5340 | - freedreno/a6xx: Handle PIPE_BIND_BLENDABLE
|
---|
5341 | - freedreno/a6xx: ARB_shader_viewport_layer_array
|
---|
5342 | - tu: Fix heap size
|
---|
5343 | - freedreno: Fix crash with debug msgs enabled
|
---|
5344 | - freedreno/layout: Handle 565/etc MSAA special case
|
---|
5345 | - freedreno/decode: Fix printing chip-id
|
---|
5346 | - freedreno/a6xx: Add L8_SRGB
|
---|
5347 | - freedreno: Add reformatting commits to .git-blame-ignore-revs
|
---|
5348 | - freedreno/fence: Hold a strong ref to batch
|
---|
5349 | - freedreno/decode: Lookup device info
|
---|
5350 | - freedreno/decode: Use info->chip to decode
|
---|
5351 | - freedreno/decode: Remove gpu_id
|
---|
5352 | - freedreno: Indentation fix
|
---|
5353 | - freedreno: Use explicit QCOM_TILED3 modifier
|
---|
5354 | - freedreno/a6xx: Remove dummy packet for globals
|
---|
5355 | - freedreno: Fix streamout offset_buf dirtiness
|
---|
5356 | - freedreno: Fix user const buffer dirtiness
|
---|
5357 | - freedreno/batch: Move query_buf allocation
|
---|
5358 | - freedreno: Add private-BO tracking
|
---|
5359 | - freedreno: Add missing indirect_draw_count tracking
|
---|
5360 | - freedreno: Move/add some attach_bo()
|
---|
5361 | - freedreno: Add attach-bo debugging
|
---|
5362 | - freedreno: Rework supported-modifiers handling
|
---|
5363 | - mesa: Introduce MESA_texture_const_bandwidth
|
---|
5364 | - mesa: Implement MESA_texture_const_bandwidth
|
---|
5365 | - freedreno: Add PIPE_CAP_HAS_CONST_BW support
|
---|
5366 | - panfrost: Add PIPE_CAP_HAS_CONST_BW support
|
---|
5367 | - iris: Add PIPE_CAP_HAS_CONST_BW support
|
---|
5368 | - radeonsi: Add PIPE_CAP_HAS_CONST_BW support
|
---|
5369 | - tu/msm: Fix timeline semaphore support
|
---|
5370 | - tu/virtio: Fix timeline semaphore support
|
---|
5371 | - freedreno/drm: Fix race in zombie import
|
---|
5372 | - freedreno: Always attach bo to submit
|
---|
5373 |
|
---|
5374 | Robert Foss (9):
|
---|
5375 |
|
---|
5376 | - egl: Expose access to DeviceList
|
---|
5377 | - egl: Rename _eglRefreshDeviceList() to _eglDeviceRefreshList()
|
---|
5378 | - egl: Refresh DeviceList during eglInitialize()
|
---|
5379 | - egl/surfaceless: Use EGL DeviceList instead of drmGetDevices2()
|
---|
5380 | - egl/android: Use EGL DeviceList instead drmGetDevices2()
|
---|
5381 | - egl: Rename _eglAddDevice() to _eglFindDevice()
|
---|
5382 | - egl: Rename _eglAddDevice() to _eglFindDevice()
|
---|
5383 | - egl: Fix attrib_list[0] == EGL_NONE check
|
---|
5384 | - egl: Always set _EGLDisplay->Device during eglGetPlatformDisplay()
|
---|
5385 |
|
---|
5386 | Robert Mader (6):
|
---|
5387 |
|
---|
5388 | - egl/wayland: wait for compositor to release shm buffers
|
---|
5389 | - iris: Support parameter queries for main planes
|
---|
5390 | - util: Add new helpers for pipe resources
|
---|
5391 | - panfrost: Support parameter queries for main planes
|
---|
5392 | - vc4/resource: Support offset query for multi-planar planes
|
---|
5393 | - v3d/resource: Support offset query for multi-planar planes
|
---|
5394 |
|
---|
5395 | Rohan Garg (33):
|
---|
5396 |
|
---|
5397 | - iris: migrate WA 14013910100 to use the WA framework
|
---|
5398 | - iris: migrate WA 14016118574 to use the WA framework
|
---|
5399 | - iris: fix iris for WA 16013000631
|
---|
5400 | - intel/perf: add perf query support for Intel Raptorlake
|
---|
5401 | - intel/genxml: set a default value for "Pixel Position Offset Enable" in genxml
|
---|
5402 | - anv: use the WA infrastructure where possible when generating state
|
---|
5403 | - anv: use the correct GFX_VERx10 macro for WA
|
---|
5404 | - anv,iris: program the maximum number of threads on compute queue init
|
---|
5405 | - anv: drop CFE state validation checks
|
---|
5406 | - iris: track reset signalling instead of replacing the context
|
---|
5407 | - iris: allow for a unsynchronized device reset query
|
---|
5408 | - anv: partially revert 2e8b1f6d
|
---|
5409 | - anv: emitting 3DSTATE_PRIMITIVE_REPLICATION is required on Gen12+
|
---|
5410 | - anv: use the pre defined _3DPRIMITIVE_DIRECT macro
|
---|
5411 | - anv: drop dead ifdef
|
---|
5412 | - iris: use the correct WA macros and lineage numbers
|
---|
5413 | - anv: use the lineage number for WA
|
---|
5414 | - crocus: add a __gen_get_batch_address declaration
|
---|
5415 | - crocus: fix GFX_VERx10 macro
|
---|
5416 | - blorp: drop undefined macro
|
---|
5417 | - iris: migrate preemption streamwout wa to WA infra
|
---|
5418 | - intel/genxml: update PIPE_CONTROL instruction for dg2
|
---|
5419 | - anv: define clear color localy within can_fast_clear_color_att
|
---|
5420 | - intel/compiler: Adjust CS payload registers for new register width on Xe2+
|
---|
5421 | - intel/compiler: Adjust fence message lengths for new register width on Xe2+
|
---|
5422 | - intel/compiler: Adjust barrier emission for Xe2+
|
---|
5423 | - intel/genxml: fix 3DSTATE_3D_MODE length to align with BSpec
|
---|
5424 | - anv: ensure that FCV_CCS_E fast clears are properly tracked
|
---|
5425 | - anv: enable FCV for Gen12.5
|
---|
5426 | - anv: fix debug string for PC flush
|
---|
5427 | - anv: cleanup includes
|
---|
5428 | - anv: turn off non zero fast clears for CCS_E
|
---|
5429 | - anv: selectively enable FCV optimization for DG2
|
---|
5430 |
|
---|
5431 | Roland Scheidegger (1):
|
---|
5432 |
|
---|
5433 | - lavapipe: further limit accurate_a0 hack
|
---|
5434 |
|
---|
5435 | Roman Stratiienko (22):
|
---|
5436 |
|
---|
5437 | - egl: android: Remove legacy name-based shared buffers support
|
---|
5438 | - util: Add NONNULL macro
|
---|
5439 | - android: Introduce the Android buffer info abstraction
|
---|
5440 | - android: Fix num_planes assignment in u_gralloc_fallback
|
---|
5441 | - v3dv/android: Use u_gralloc code
|
---|
5442 | - v3dv/android: Enable shared presentable image support
|
---|
5443 | - v3dv: Migrate to vk_device_memory
|
---|
5444 | - v3dv/android: Skip swapchain binding
|
---|
5445 | - v3dv: Rely on the internal tiled flag instead of the common vk structure
|
---|
5446 | - v3dv/android: Add a helper function to support explicit layouts
|
---|
5447 | - v3dv/android: Rework Android native buffer importing logic
|
---|
5448 | - v3dv: Use format stored in vk_image and vk_image_view after init
|
---|
5449 | - v3dv: Split v3dv_image_init to use layout setting logic separately
|
---|
5450 | - v3dv/android: Add AHardwareBuffer support
|
---|
5451 | - v3dv: Enable VK API v1.2 for Android
|
---|
5452 | - panvk: Add Android ICD loader entry point
|
---|
5453 | - u_gralloc: Remove inline modifiers from the functions
|
---|
5454 | - u_gralloc: Remove usage of NONNULL macro
|
---|
5455 | - Revert "util: Add NONNULL macro"
|
---|
5456 | - u_gralloc: Add a function that returns gralloc type
|
---|
5457 | - dri: Remove __driDriverExtensions leftovers
|
---|
5458 | - v3d: Don't implicitly clear the content of the imported buffer
|
---|
5459 |
|
---|
5460 | Ruijing Dong (2):
|
---|
5461 |
|
---|
5462 | - frontends/va: checking va version for av1enc support
|
---|
5463 | - radeonsi/vcn: change max_poc to fixed value for hevc encoder.
|
---|
5464 |
|
---|
5465 | Ryan Neph (1):
|
---|
5466 |
|
---|
5467 | - vulkan/android: add missed STACK_ARRAY_FINISH()
|
---|
5468 |
|
---|
5469 | Sagar Ghuge (34):
|
---|
5470 |
|
---|
5471 | - intel/compiler: Look at 2 register worth of data instead of 4
|
---|
5472 | - isl: Disable MCS compression just on ACM platform
|
---|
5473 | - intel: Add env variable to add break point on/before draw
|
---|
5474 | - anv: Add GPU breakpoint before/after specific draw call
|
---|
5475 | - iris: Add GPU breakpoint before/after draw call
|
---|
5476 | - blorp: Implement blorp hooks to emit breakpoint
|
---|
5477 | - docs: Add INTEL_DEBUG_BKP_BEFORE/AFTER_DRAW_COUNT
|
---|
5478 | - intel/isl: Enable INTEL_DEBUG=noccs/nohiz in ISL helpers
|
---|
5479 | - anv,hasvk: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks
|
---|
5480 | - iris,crocus: drop unnecessary DEBUG_NO_CCS/NO_HIZ checks
|
---|
5481 | - blorp: Drop unnecessary assertions in blorp_can_hiz_clear_depth
|
---|
5482 | - anv: Add helper to create companion RCS command buffer
|
---|
5483 | - anv: Split out End/Destroy/Reset cmd buffer code into helper
|
---|
5484 | - anv: Handle companion RCS in end/destory/reset code path
|
---|
5485 | - intel: Add helper to create/destroy i915 VM
|
---|
5486 | - intel: Pass virtual memory address space ID while creating context
|
---|
5487 | - anv: Create companion RCS engine
|
---|
5488 | - anv: Move compute specfic bits under compute queue init
|
---|
5489 | - anv: Execute RCS init batch on companion RCS context/engine
|
---|
5490 | - anv: Setup companion RCS command buffer submission
|
---|
5491 | - anv: Execute an empty batch to sync main and companion RCS batch
|
---|
5492 | - anv: Add secondary companion RCS cmd buffer to primary
|
---|
5493 | - anv: Skip layout transition on the compute queue
|
---|
5494 | - anv: Extract batch print code to anv_print_batch helper
|
---|
5495 | - iris: Enable always flush cache with DEBUG_STALL option
|
---|
5496 | - intel/genxml: Add STATE_COMPUTE_MODE instruction
|
---|
5497 | - anv: Program and emit STATE_COMPUTE_MODE
|
---|
5498 | - anv: Enable barrier handling on video engines
|
---|
5499 | - isl: Use 16-bit instead of 8-bits for surface format info fields
|
---|
5500 | - anv: Handle end of pipe with MI_FLUSH_DW on transfer queue
|
---|
5501 | - anv: Enable transfer queue only on ACM+ platforms
|
---|
5502 | - blorp: Use the correct miptail start LOD for surfaces
|
---|
5503 | - anv: Write timestamp using MI_FLUSH_DW on blitter
|
---|
5504 | - anv: Flush data cache while clearing depth using HIZ_CCS_WT
|
---|
5505 |
|
---|
5506 | Saleemkhan Jamadar (1):
|
---|
5507 |
|
---|
5508 | - radeonsi/vcn: set jpeg reg version for gfx 1150
|
---|
5509 |
|
---|
5510 | Samuel Holland (3):
|
---|
5511 |
|
---|
5512 | - Android.mk: Allow building only Vulkan drivers
|
---|
5513 | - Android.mk: Explicitly enable/disable LLVM support
|
---|
5514 | - Android.mk: Only link LLVM for radeonsi, not amd_vk
|
---|
5515 |
|
---|
5516 | Samuel Pitoiset (299):
|
---|
5517 |
|
---|
5518 | - radv: remove support for VK_INDIRECT_COMMANDS_TOKEN_TYPE_STATE_FLAGS_NV
|
---|
5519 | - radv: make radv_get_pa_su_sc_mode_cntl() static
|
---|
5520 | - zink/ci: update list of expected failures for NAVI10
|
---|
5521 | - radv: stop using a pipeline for emitting VGT_VERTEX_REUSE_BLOCK_CNTL
|
---|
5522 | - radv: remove unused param in radv_pipeline_emit_vgt_gs_out()
|
---|
5523 | - radv: pass a shaders array for computing ia_multi_vgt_param
|
---|
5524 | - radv: bind the pre-compiled PS epilog to the cmdbuf state
|
---|
5525 | - radv: stop using an array of binaries when compiling a compute shader
|
---|
5526 | - radv: add radv_compile_cs() to compile a compute shader
|
---|
5527 | - radv: remove the pipeline dependency for creating a GS copy shader
|
---|
5528 | - radv: add a helper to compute the ESGS itemsize
|
---|
5529 | - radv: use the number of GS linked inputs to compute the ESGS itemsize
|
---|
5530 | - radv: determine ES info for VS/TES with GS earlier
|
---|
5531 | - radv: determine as_ls earlier by using the next stage
|
---|
5532 | - radv: simplify getting next VS stage for VS prologs
|
---|
5533 | - radv: use next_stage for determining the stage to lower NGG
|
---|
5534 | - radv/amdgpu: fix dumping CS with the chained IBs path
|
---|
5535 | - radv/amdgpu: rename old_ib to ib in radv_amdgpu_winsys_cs_dump()
|
---|
5536 | - radv: pass submit info to radv_check_gpu_hangs()
|
---|
5537 | - radv: initialize stage/next_stage earlier
|
---|
5538 | - radv: set next_stage to MESA_SHADER_NONE if there is no FS
|
---|
5539 | - radv: rework considering force VRS without relying on graphics pipeline
|
---|
5540 | - radv: stop passing radv_graphics_pipeline to radv_fill_shader_info()
|
---|
5541 | - radv: move removing all varyings when the FS is a noop
|
---|
5542 | - radv: rename graphics pipeline linking helpers
|
---|
5543 | - radv: simplify lowering NGG GS intrinsics
|
---|
5544 | - radv: rework determining the NGG stage without a graphics pipeline
|
---|
5545 | - radv: cleanup pipeline compute emit helpers
|
---|
5546 | - radv: rename radv_pipeline_stage to radv_shader_stage
|
---|
5547 | - radv: rename NGG query state to be more generic
|
---|
5548 | - radv: declare the shader query user SGPR for emulating GS counters
|
---|
5549 | - radv: enable pipelinestat query emulation for legacy GS
|
---|
5550 | - radv: simplify the NGG vs legacy pipelinestat query path
|
---|
5551 | - radv: rename RADV_SHADER_QUERY_PIPELINE_STAT_OFFSET
|
---|
5552 | - radv: implement nir_intrinsic_atomic_add_gs_invocation_count_amd
|
---|
5553 | - radv: emulate GEOMETRY_SHADER_INVOCATIONS query on RDNA1-2
|
---|
5554 | - radv: track whether inputs/outputs are linked per shader stage
|
---|
5555 | - radv: add support for VS/TES as ES without shaders IO linking
|
---|
5556 | - radv: use next_stage to determine if the layer should be exported
|
---|
5557 | - radv: use next stage to determine if primID/clip dist should be exported
|
---|
5558 | - radv: compute the legacy GS info earlier
|
---|
5559 | - radv: stop copying some NIR info fields from TES to TCS
|
---|
5560 | - radv: stop lowering patch vertices for TES
|
---|
5561 | - radv: do not always copy the number of tess patches to TES
|
---|
5562 | - radv: initialize tcs.tes_{patch}_inputs_read to a default value
|
---|
5563 | - radv: prevent linking TCS<->TES when TES is NULL
|
---|
5564 | - radv: use a packed user SGPR for the TES state
|
---|
5565 | - radv: stop checking if patch control points is dynamic everywhere
|
---|
5566 | - radv: copy the number of TCS vertices out to TES shader info
|
---|
5567 | - radv: add support for dynamic TCS vertices out for TES
|
---|
5568 | - radv: remove radv_shader_info::tes::num_linked_patch_inputs
|
---|
5569 | - amd,radeonsi: move si_shader_io_get_unique_index_patch() to common code
|
---|
5570 | - radv: allow to use fixed IO locations for VS<->TCS<->TES without linking
|
---|
5571 | - aco: add aco_shader_info::tcs::has_epilog
|
---|
5572 | - aco: add infra for compiling TCS epilogs
|
---|
5573 | - radv,aco: move has_epilog to radv_shader_info
|
---|
5574 | - radv: assume a TCS needs an epilog unless it's linked with a TES
|
---|
5575 | - radv: do not write tess factors in main TCS when it has an epilog
|
---|
5576 | - radv: track if TES reads tess factors differently
|
---|
5577 | - radv: declare new argument for the TCS epilog PC
|
---|
5578 | - radv: add radv_tcs_epilog_key
|
---|
5579 | - radv: add infra for creating TCS epilogs
|
---|
5580 | - radv: add support for a TCS epilogs cache in the device
|
---|
5581 | - radv: add support for emitting TCS epilogs in cmdbuf
|
---|
5582 | - radv: remove unnecessary check in radv_pipeline_nir_to_asm()
|
---|
5583 | - radv: stop passing a graphics pipeline to radv_pipeline_nir_to_asm()
|
---|
5584 | - radv: inline radv_pipeline_get_nir() in radv_graphics_pipeline_compile()
|
---|
5585 | - radv: add a struct for the retained shaders and GPL
|
---|
5586 | - radv: add radv_graphics_shaders_compile() to compile graphics shaders
|
---|
5587 | - radv: remove redundant check in radv_cmd_buffer_after_draw()
|
---|
5588 | - radv: track if patch control points is dynamic from the cmdbuf state
|
---|
5589 | - radv: re-emit binning state if the framebuffer is dirty
|
---|
5590 | - radv: track if vertex binding stride is dynamic from the cmdbuf state
|
---|
5591 | - vulkan: bump header register to 1.3.261
|
---|
5592 | - vulkan/runtime: add common implementation for GetImageSubresourceLayout()
|
---|
5593 | - vulkan/format: add VK_FORMAT_{A8_UNORM,A1B5G5R5_UNORM_PACK16}_KHR
|
---|
5594 | - radv: use the RT prolog scratch size directly for tracing rays
|
---|
5595 | - radv: add a helper to get the maximum number of scratch waves per shader
|
---|
5596 | - radv: update the number of scratch waves for RT prolog at bind time
|
---|
5597 | - radv: update cmdbuf scratch size info when shaders are bound
|
---|
5598 | - vulkan: add init/finish helpers for vk_buffer_view
|
---|
5599 | - radv: use vk_buffer_view
|
---|
5600 | - radv: use vk_sampler
|
---|
5601 | - radv: use common vkCmdBegin/EndQuery wrappers
|
---|
5602 | - radv: use vk_query
|
---|
5603 | - zink: fix setting VkShaderCreateInfoEXT::nextStage
|
---|
5604 | - radv/rt: fix capture/replay support
|
---|
5605 | - vulkan/render_pass: add common vkGetRenderingAreaGranularityKHR()
|
---|
5606 | - radv: implement vkCmdBindIndexBuffer2KHR()
|
---|
5607 | - radv: allow VK_WHOLE_SIZE for pSizes in vkCmdBindVertexBuffers2()
|
---|
5608 | - radv/rmv: remove unused pipeline create flags when logging pipelines
|
---|
5609 | - radv: store pipeline create flags to radv_pipeline::create_flags
|
---|
5610 | - radv: add support for VkPipelineCreateFlags2CreateInfoKHR
|
---|
5611 | - radv: add support for VkBufferUsageFlags2CreateInfoKHR
|
---|
5612 | - radv: allow VK_REMAINING_ARRAY_LAYERS with VkImageSubresourceLayers
|
---|
5613 | - radv: implement radv_Get{Device}ImageSubresourceLayout2KHR()
|
---|
5614 | - radv: advertise VK_KHR_maintenance5
|
---|
5615 | - radv: remove useless NULL for pipeline layout during shader info pass
|
---|
5616 | - radv: introduce radv_shader_layout for per-stage descriptor layout
|
---|
5617 | - radv: stop passing redundant stage to radv_shader_stage_init()
|
---|
5618 | - radv: re-introduce radv_pipeline_stage_init()
|
---|
5619 | - radv: add support for loading the LSHS vertex stride from a SGPR
|
---|
5620 | - radv: use the number of VS outputs for computing the tessellation info
|
---|
5621 | - vulkan: ignore VkPipelineColorWriteCreateInfoEXT if the state is dynamic
|
---|
5622 | - radv: reduce TCS_OFFCHIP_LAYOUT_NUM_PATCHES to 6-bits
|
---|
5623 | - radv: add missing comment about TCS_OFFCHIP_LAYOUT_LSHS_VERTEX_STRIDE
|
---|
5624 | - radv: fix emitting TCS epilogs for GFX6-9
|
---|
5625 | - radv: remove radv_cmd_buffer::cached_vertex_formats
|
---|
5626 | - radv: remove unused param from radv_pipeline_init_multisample_state()
|
---|
5627 | - radv: simplify declaring VS specific input SGPRs
|
---|
5628 | - radv: stop copying if VS or TES uses the InvocationID built-in
|
---|
5629 | - Revert "radv/amdgpu: workaround a kernel bug when replacing sparse mappings"
|
---|
5630 | - Revert "radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build"
|
---|
5631 | - radv/amdgpu: allow to execute external IBs on the compute queue
|
---|
5632 | - radv/amdgpu: add support for submitting external IBs with the chained path
|
---|
5633 | - zink/ci: update list of expected failures for NAVI10
|
---|
5634 | - radv: use the maximum possible workgroup size for TCS epilogs
|
---|
5635 | - radv: stop declaring the scratch offset argument for TCS epilogs
|
---|
5636 | - radv: declare shader arguments for TCS epilogs
|
---|
5637 | - radv: add tcs_out_patch_fits_subgroup to radv_tcs_epilog_key
|
---|
5638 | - aco: fix jumping from main TCS to epilog on GFX9+
|
---|
5639 | - aco: adjust TCS epilogs for RADV
|
---|
5640 | - aco: allow SGPRs operands with p_jump_to_epilog
|
---|
5641 | - aco: implement create_tcs_jump_to_epilog()
|
---|
5642 | - radv: track the pipeline bind point for indirect commands layout
|
---|
5643 | - radv: prepare radv_get_sequence_size() for DGC compute
|
---|
5644 | - radv: prepare radv_prepare_dgc() for DGC compute
|
---|
5645 | - radv: implement NV_device_generated_commands_compute
|
---|
5646 | - radv: allow DGC on the compute queue
|
---|
5647 | - radv: advertise NV_device_generated_commands_compute
|
---|
5648 | - aco: rework printing shader stages
|
---|
5649 | - radv: fix the per-patch data offset when TES isn't linked with TCS
|
---|
5650 | - radv: stop declaring unused SGPR arguments for PS epilogs
|
---|
5651 | - radv: add radv_shader_info::is_monolithic
|
---|
5652 | - radv: use info->uses_view_index directly when declaring shader arguments
|
---|
5653 | - radv: do not inline push constants for non-monolithic shaders
|
---|
5654 | - radv: force indirect descriptor sets for non-monolithic shaders
|
---|
5655 | - radv: always declare some arguments for non-monolithic VS/TCS shaders
|
---|
5656 | - radv: add a new shader argument for non-monolithic shaders PC
|
---|
5657 | - ac: allow to mark shader arguments as preserved
|
---|
5658 | - radv: preserve shader arguments for non-monolithic VS/TCS on GFX9+
|
---|
5659 | - aco: disable shared VGPRs for non-monolithic shaders on GFX9+
|
---|
5660 | - aco: ensure to initialize exec manually for VS as LS on GFX9+
|
---|
5661 | - aco: add support for compiling VS+TCS separately on GFX9+
|
---|
5662 | - radv: always declare some arguments for non-monolithic {VS,TES}/GS shaders
|
---|
5663 | - radv: preserve shader arguments for non-monolithic {VS,TES}/GS on GFX9+
|
---|
5664 | - aco: ensure to initialize exec manually for non-monolithic {VS,TES}/GS on GFX9+
|
---|
5665 | - aco: add support for compiling {VS,TES}+GS separately on GFX9+
|
---|
5666 | - radv,aco: remove unused clip/cull distances variables
|
---|
5667 | - radv: rename tcs_shader to tcs in radv_emit_tcs_epilog_state()
|
---|
5668 | - radv: small cleanups in radv_emit_patch_control_points()
|
---|
5669 | - radv: fix emitting TCS epilogs if TES and GS are linked on GFX9+
|
---|
5670 | - radv: remove the pipeline dependency for emitting VGT_GS_MODE
|
---|
5671 | - aco: fix emitting TCS epilogs end on GFX9+
|
---|
5672 | - radv: re-order IO slot layout for stages that aren't linked
|
---|
5673 | - amd/ci: update list of failures/flakes for glcts-vangogh-valve
|
---|
5674 | - ci: uprev vkd3d-proton
|
---|
5675 | - ci: uprev Fossilize
|
---|
5676 | - ci: add comment explaining which image tags to update for Fossilize
|
---|
5677 | - radv: preserve shader argument for separate compilation of NGG shaders
|
---|
5678 | - aco: flag blocks with long-jump as export_end for separate compilation
|
---|
5679 | - aco: adjust fix_exports() for VS/TES as NGG and non-monolithic shaders
|
---|
5680 | - aco: allow separate compilation of NGG shaders
|
---|
5681 | - zink/ci: add zink-radv-polaris10-valve
|
---|
5682 | - radv/ci: re-enable vkcts-polaris10-valve
|
---|
5683 | - radv: fix capturing indirect dispatches with SQTT
|
---|
5684 | - radv/ci: re-enable vkd3d-polaris10-valve
|
---|
5685 | - ci: do not fail vkd3d-proton job when the expectations match
|
---|
5686 | - radv/amdgpu: fix executing secondaries without IB2
|
---|
5687 | - radv/amdgpu: do not copy the original chain link for IBs
|
---|
5688 | - radv: avoid emitting SQTT markers for DGC calls
|
---|
5689 | - radv: add support for DGC with SQTT
|
---|
5690 | - zink/ci: merge GLCTS testing with GLESx for RADV
|
---|
5691 | - zink/ci: merge piglit testing with deqp-runner for RADV
|
---|
5692 | - radv: fix interactions with primitives generated queries and pipeline stats
|
---|
5693 | - radv: skip DGC calls when the indirect sequence count is zero with a predicate
|
---|
5694 | - radv: avoid emitting THREAD_TRACE_MARKER for predicated draws/dispatches
|
---|
5695 | - radv: adjust next stage for VS prologs and merged shaders compiled separately
|
---|
5696 | - radv: adjust emitted prolog regs for merged shaders compiled separately
|
---|
5697 | - radv: do not use pre-compiled prologs when VS is compiled separately
|
---|
5698 | - radv: remove useless PIPELINE_CREATE_2_LIBRARY_BIT check for retained shaders
|
---|
5699 | - radv: fix enabling DGCC
|
---|
5700 | - radv: fix emitting SQTT userdata when CAM is needed
|
---|
5701 | - radv: fix capturing RGP on RDNA3 with more than one Shader Engine
|
---|
5702 | - zink/ci: update list of expected failures for POLARIS10/NAVI10
|
---|
5703 | - radv: set THREAD_TRACE_TOKEN_MASK.BOP_EVENTS_TOKEN_INCLUDE on GFX10.3+
|
---|
5704 | - radv: disable unsupported hw shader stages for RGP on GFX11+
|
---|
5705 | - radv: fix instruction timing on GFX11
|
---|
5706 | - ac/rgp: use correct API stage string for mesh/task shaders
|
---|
5707 | - radv: set THREAD_TRACE_MARKER_ENABLE for mesh/task draws
|
---|
5708 | - radv: emit relocation for mesh/task shaders
|
---|
5709 | - issue_templates/Bug Report: fix outdated URL for GFXReconstruct
|
---|
5710 | - ac,radv,radeonsi: rework SPM counters configuration and share it
|
---|
5711 | - ac/perfcounter: add new SQ_WGP block for GFX11+
|
---|
5712 | - ac/spm: add SPM counters configuration for GFX11
|
---|
5713 | - radv: enable the PKT3 CAM bit for some SPM register writes
|
---|
5714 | - radv,radeonsi: use AC_SPM_SEGMENT_TYPE_xxx instead of magic values
|
---|
5715 | - ac/spm: remove useless SPM block setting for GFX9 and older GPUs
|
---|
5716 | - ac/spm: add SPM block definition for GFX10-GFX10.3
|
---|
5717 | - ac/gpu_info: init num_cu_per_sh from the kernel
|
---|
5718 | - ac/perfcounter: set the number of instances of GL1C to 4
|
---|
5719 | - ac/perfcounter: compute the number of global instances of TCP,SQ,GL1C and GL2C
|
---|
5720 | - ac/spm: fix checking if the counter instance is valid
|
---|
5721 | - ac/spm: rework how segment muxsel RAM are filled
|
---|
5722 | - ac/spm: initialize and set instance mapping for counters
|
---|
5723 | - radv: reserve more CS space in SQTT/SPM paths
|
---|
5724 | - ac/spm: use block flags to initialize instance mapping
|
---|
5725 | - ac/spm: select correct segment type for per-SE blocks
|
---|
5726 | - radv,radeonsi: make sure to emit GRBM_GFX_INDEX before SQ select registers
|
---|
5727 | - ac/spm: fix number of instances of GL2C
|
---|
5728 | - ac,radv,radeonsi: prepare support for multi-instance SPM SQ counters
|
---|
5729 | - ac,radv,radeonsi: prepare support for multi-instance SPM generic counters
|
---|
5730 | - ac/spm: move the counter instance to ac_spm_counter_create_info
|
---|
5731 | - ac/spm: enable support for multi-instance counters
|
---|
5732 | - radv: fix checking if RGP is enabled with others tracing tools
|
---|
5733 | - radv: fix missing ISA with RGP and GPL
|
---|
5734 | - ac/perfcounter: add SG_WQP group for GFX11
|
---|
5735 | - ac/perfcounter: add GFX11 groups
|
---|
5736 | - drirc: remove Path of Exile workarounds
|
---|
5737 | - radv: remove drirc workarounds for Path Of Exile
|
---|
5738 | - radv: remove absolute_depth_bias workaround
|
---|
5739 | - ac/gpu_info: define AMD_MAX_WGP
|
---|
5740 | - ac/spm: add new segment types for GFX11
|
---|
5741 | - ac/spm: add support for GFX11
|
---|
5742 | - radv: add SPM support for GFX11
|
---|
5743 | - radv: enable cache counters for RGP on GFX11
|
---|
5744 | - ci: update to vulkan-cts-1.3.6.3
|
---|
5745 | - radv/ci: skip dEQP-VK.robustness.* on Vangogh due to weird GPU hangs
|
---|
5746 | - nir: rename atomic_add_gs_invocation_count_amd to make it more generic
|
---|
5747 | - ac/nir: add lowering for mesh shader queries
|
---|
5748 | - ac/nir: add lowering for task shader queries
|
---|
5749 | - radv: add GDS counters offset for mesh/task queries
|
---|
5750 | - radv: adjust lowering of intrinsic queries for mesh/task shaders
|
---|
5751 | - radv: enable lowering of mesh/task shader queries when enabled
|
---|
5752 | - radv: declare shader_query_state for mesh/task shaders
|
---|
5753 | - radv: stop skip emitting CB states when there is no color attachment
|
---|
5754 | - radv: re-enable DCC with mipmaps on GFX11
|
---|
5755 | - radv: fix COMPUTE_SHADER_INVOCATIONS query on compute queue
|
---|
5756 | - radv: emit missing PA_{SC,SU}_LINE_STIPPLE_xxx regs in gfx preamble
|
---|
5757 | - radv: fix alignment of DGC command buffers
|
---|
5758 | - radv/ci: update list of expected failures on PITCAIRN
|
---|
5759 | - radv/ci: update list of flakes for NAVI10/VEGA10
|
---|
5760 | - radv/amdgpu: fix alignment of command buffers
|
---|
5761 | - radv: enable DCC for MSAA images on GFX11
|
---|
5762 | - zink/ci: update list of expectations for zink-anv-tgl
|
---|
5763 | - zink/ci: bump zink-anv-tgl-full timeout to 1h45m
|
---|
5764 | - radv/ci: rename GFX1100 lists to NAVI31
|
---|
5765 | - radv: fix emulated geometry shader primitives/invocations queries
|
---|
5766 | - radv/ci: remove duplicate skipped tests for RAVEN/STONEY
|
---|
5767 | - radv/ci: exclude dEQP-VK.texture.explicit_lod.2d.sizes.128x128_* for all jobs
|
---|
5768 | - radv: fix synchronization with emulated GS primitives/invocations queries
|
---|
5769 | - radv/ci: remove no longer existing test for VANGOGH
|
---|
5770 | - radv/ci: cleanup list of expected failures for NAVI10/NAVI21/VEGA10
|
---|
5771 | - radv: always write the sample positions when a new descriptor BO is created
|
---|
5772 | - radv: fill the scratch BO in radv_fill_shader_rings()
|
---|
5773 | - radv: fix gang submissions with chaining
|
---|
5774 | - radv: fix re-emitting streamout descriptors for NGG streamout
|
---|
5775 | - radv: fix IB alignment
|
---|
5776 | - zink: use warn_missing_feature for missing modifier support
|
---|
5777 | - radv: fix destroying GDS/OA BOs
|
---|
5778 | - radv: allocate only 1 GDS OA counter for gfx10 NGG streamout
|
---|
5779 | - ac/nir: only consider overflow for valid feedback buffers
|
---|
5780 | - radv/ci: update list of expected failures on RAVEN
|
---|
5781 | - radv/ci: update list of flakes for VANGOGH
|
---|
5782 | - radv/ci: update list of flakes for STONEY
|
---|
5783 | - radv: disable primitive restart for non-indexed draws on GFX11
|
---|
5784 | - radv: enable radv_disable_aniso_single_level=true for Zink too
|
---|
5785 | - amd/llvm,aco,radv: implement NGG streamout with GDS_STRMOUT registers on GFX11
|
---|
5786 | - radv: mark GDS as needed for XFB queries with NGG streamout on GFX11
|
---|
5787 | - radv: skip GDS allocation for NGG streamout on GFX11
|
---|
5788 | - zink/ci: remove expected failures that are skipped for RADV
|
---|
5789 | - ci: update CTS to vulkan-cts-1.3.7.0
|
---|
5790 | - ci: bump the number of tests per group from 500 to 5000 for Vulkan drivers
|
---|
5791 | - ci: bump DEQP_FRACTION for some jobs
|
---|
5792 | - radv: set ENABLE_PING_PONG_BIN_ORDER for GFX11.5
|
---|
5793 | - radv: initialize video decoder for GFX11.5
|
---|
5794 | - ac/gpu_info: query the maximum number of IBs per submit from the kernel
|
---|
5795 | - Revert "radv: fix finding shaders by PC"
|
---|
5796 | - radv: fix missing predicate bit for WRITE_DATA helper
|
---|
5797 | - ac/gpu_info: fix querying the maximum number of IBs per ring
|
---|
5798 | - radv: remove outdated RADV_DEBUG=vmfaults support
|
---|
5799 | - amd: update amdgpu_drm.h
|
---|
5800 | - amd: add has_gpuvm_fault_query
|
---|
5801 | - radv/amdgpu: add support quering the last GPUVM fault
|
---|
5802 | - radv: query and report the last GPUVM fault with RADV_DEBUG=hang
|
---|
5803 | - radv: report the last GPUVM fault when a device lost is detected
|
---|
5804 | - ac/gpu_info: remove bogus assertion about number of COMPUTE/SDMA queues
|
---|
5805 | - radv: fix a synchronization issue with primitives generated query on RDNA1-2
|
---|
5806 | - radv: bind the non-dynamic graphics state from the pipeline unconditionally
|
---|
5807 | - radv: fix compute shader invocations query on compute queue on GFX6
|
---|
5808 | - radv: emit COMPUTE_PIPELINESTAT_ENABLE for CS invocations on ACE
|
---|
5809 | - nir: fix inserting the break instruction for partial loop unrolling
|
---|
5810 | - radv: fix registering queues for RGP with compute only
|
---|
5811 | - radv: set radv_zero_vram=true for Unreal Engine 4/5
|
---|
5812 | - radv: fix a descriptor leak with debug names and host base descriptor set
|
---|
5813 | - radv: add a missing async compute workaround for Tonga/Iceland
|
---|
5814 | - radv: disable TC-compatible HTILE on Tonga and Iceland
|
---|
5815 | - radv: set radv_invariant_geom=true for War Thunder
|
---|
5816 | - radv: do not set OREO_MODE to fix rare corruption on GFX11
|
---|
5817 |
|
---|
5818 | Saroj Kumar (4):
|
---|
5819 |
|
---|
5820 | - radeonsi: Add perfetto support in radeonsi
|
---|
5821 | - radeonsi: Add u_trace init code in radeonsi
|
---|
5822 | - radeonsi: Add tracepoints in radeonsi driver
|
---|
5823 | - radeonsi: fixes compilaton error when perfetto is disabled
|
---|
5824 |
|
---|
5825 | Sathishkumar S (2):
|
---|
5826 |
|
---|
5827 | - radeonsi/vcn: support variable number of bs_bufs
|
---|
5828 | - radeonsi/vcn: num bs_bufs must be proportional to num jpeg engines
|
---|
5829 |
|
---|
5830 | Semjon Kravtsenko (1):
|
---|
5831 |
|
---|
5832 | - glx: Assign unique serial number to GLXBadFBConfig error
|
---|
5833 |
|
---|
5834 | Seppo Yli-Olli (1):
|
---|
5835 |
|
---|
5836 | - zink: Fix SyntaxWarning in zink_extensions script
|
---|
5837 |
|
---|
5838 | Sergi Blanch Torne (7):
|
---|
5839 |
|
---|
5840 | - Introduce ci-kdl builder and launcher.
|
---|
5841 | - Integrate ci-kdl in the building process and launch process.
|
---|
5842 | - ci: disable Collabora's LAVA lab for maintance
|
---|
5843 | - Revert "ci: disable Collabora's LAVA lab for maintance"
|
---|
5844 | - Revert "ci: disable Collabora's LAVA lab for maintance"
|
---|
5845 | - ci: disable Collabora's LAVA lab for maintance
|
---|
5846 | - Revert "ci: disable Collabora's LAVA lab for maintance"
|
---|
5847 |
|
---|
5848 | Sid Pranjale (1):
|
---|
5849 |
|
---|
5850 | - nvk: Enable VK_EXT_load_store_op_none
|
---|
5851 |
|
---|
5852 | Sil Vilerino (20):
|
---|
5853 |
|
---|
5854 | - util: Blake3 - Identify arm64ec as aarch64 instead of x64
|
---|
5855 | - d3d12: Fix Map/Unmap of YUV resources
|
---|
5856 | - d3d12: Fix H264 interlaced decode
|
---|
5857 | - d3d12: Video Decode - Remove unnecessary copy for texture array case
|
---|
5858 | - util/vl_vlc: Use UINT64_MAX instead of ~0UL with MSVC compiler
|
---|
5859 | - d3d12: Extend video screen AV1 encode tile support checking
|
---|
5860 | - aux/tc: Add ASSERTED to unreferenced release build variable
|
---|
5861 | - d3d12: Video - Relax ID3D12VideoDevice QI version for decode, process
|
---|
5862 | - frontends/va: Add profile param when querying PIPE_VIDEO_CAP_ENC_QUALITY_LEVEL
|
---|
5863 | - d3d12: Upgrade to D3D12 Agility SDK 1.611 Video interface
|
---|
5864 | - d3d12: Fixes AV1 tx_mode_support reporting and unsupported tx_mode overriding
|
---|
5865 | - d3d12: Video Decode - Wait for GPU completion before destroying decoder in-flight objects
|
---|
5866 | - d3d12: Do not destroy codec when destroying video buffer
|
---|
5867 | - d3d12: AV1 encode - Add lower resolution fallback check for uniform tile support
|
---|
5868 | - d3d12: AV1 encode - add fallback for app passing unsupported pic_params.InterpolationFilter
|
---|
5869 | - d3d12: AV1 Encode - Fix VAConfigAttribEncMaxRefFrames reporting
|
---|
5870 | - frontend/va: Add support for VAConfigAttribEncMaxTileRows/Cols
|
---|
5871 | - d3d12: Add support for PIPE_VIDEO_CAP_ENC_MAX_TILE_ROWS/COLS
|
---|
5872 | - d3d12: Allocate d3d12_video_buffer with higher alignment for compatibility
|
---|
5873 | - d3d12: d3d12_video_buffer_create_impl - Fix resource importing
|
---|
5874 |
|
---|
5875 | Simon Ser (7):
|
---|
5876 |
|
---|
5877 | - wayland: enable use of wayland-protocols as a subproject
|
---|
5878 | - vulkan/wsi/wayland: add support for IMMEDIATE
|
---|
5879 | - vulkan/wsi/wayland: fix unset present_mode
|
---|
5880 | - radv/winsys: check amdgpu_create_bo_from_user_mem() for EINVAL
|
---|
5881 | - egl: extract EGLDevice setup in dedicated function
|
---|
5882 | - egl: move dri2_setup_device() after dri2_setup_extensions()
|
---|
5883 | - egl: ensure a render node is passed to _eglFindDevice()
|
---|
5884 |
|
---|
5885 | Simon Zeni (1):
|
---|
5886 |
|
---|
5887 | - nouveau/winsys: use mmap instead of mmap64 in nouveau_bo
|
---|
5888 |
|
---|
5889 | SoroushIMG (1):
|
---|
5890 |
|
---|
5891 | - pvr: fix mipmap size calculation for bc formats
|
---|
5892 |
|
---|
5893 | Sviatoslav Peleshko (9):
|
---|
5894 |
|
---|
5895 | - dri: Use RGB internal formats for RGBX formats
|
---|
5896 | - intel/isl: Don't over-allocate CLEAR_COLOR size to use whole cache line
|
---|
5897 | - anv: Do fast clear color initialization more delicately
|
---|
5898 | - zink: Change zink_vertex_elements_hw_state::b.strides to VkDeviceSize
|
---|
5899 | - intel/fs: Check if the whole ubo load range is in the push const range
|
---|
5900 | - zink: Store zink_vertex_elements_hw_state::b.strides by binding id
|
---|
5901 | - intel/fs: Fix "packed word exception" condition for register regioning
|
---|
5902 | - intel/eu/validate: Validate "packed word exception" stricter
|
---|
5903 | - nir/loop_analyze: Fix inverted condition handling in iterations calculation
|
---|
5904 |
|
---|
5905 | Sylvain Munaut (9):
|
---|
5906 |
|
---|
5907 | - egl/dri2: Add a couple of missing mutex release in error path
|
---|
5908 | - mesa: Enable ARB_texture_border_clamp in GL Core
|
---|
5909 | - include: Fix the PFN declarations to be pointers as they should
|
---|
5910 | - glx: Add missing MesaGLInteropGLXFlushObjects
|
---|
5911 | - glx: Export the MESA GL Interop functions through glXGetProcAddress
|
---|
5912 | - egl: Export the MESA GL Interop functions through eglGetProcAddress
|
---|
5913 | - glx: Remove MESA_depth_float_bit from enum
|
---|
5914 | - glx: Advertise GLX_MESA_gl_interop extension if support present
|
---|
5915 | - egl: Advertise EGL_MESA_gl_interop extension if support present
|
---|
5916 |
|
---|
5917 | Tapani Pälli (34):
|
---|
5918 |
|
---|
5919 | - intel/blorp: add a new flag to communicate PSS sync need
|
---|
5920 | - anv: implement required PSS sync for Wa_18019816803
|
---|
5921 | - iris: implement required PSS sync for Wa_18019816803
|
---|
5922 | - vulkan/runtime: change assert to match specification needs
|
---|
5923 | - anv: remove assert, size is asserted in the runtime
|
---|
5924 | - anv: refactor batch_set_preemption to use batch_emit_pipe_control
|
---|
5925 | - anv: implement a dummy depth flush for Wa_14016712196
|
---|
5926 | - iris: implement a dummy depth flush for Wa_14016712196
|
---|
5927 | - mesa: fix some TexParameter and SamplerParameter cases
|
---|
5928 | - mesa: remove GL_UNSIGNED_BYTE as supported for snorm reads
|
---|
5929 | - ci: add a fix for KHR-GLES3.packed_pixels.*snorm tests
|
---|
5930 | - anv: implement Wa_14018912822
|
---|
5931 | - iris: implement Wa_14018912822
|
---|
5932 | - driconf: use lower_depth_range_rate for The Spirit and The Mouse
|
---|
5933 | - mesa: disable snorm readpix clamping with EXT_render_snorm
|
---|
5934 | - iris: modify Wa_14014414195 to use intel_needs_workaround
|
---|
5935 | - mesa: some cleanups for texparam extension checks
|
---|
5936 | - iris: avoid issues with undefined clip distance
|
---|
5937 | - crocus: avoid issues with undefined clip distance
|
---|
5938 | - anv: refactor to fix pipe control debugging
|
---|
5939 | - anv: fix a leak of fp64_nir shader
|
---|
5940 | - iris: use intel_needs_workaround for Wa_14014414195 part 2
|
---|
5941 | - iris: correct dst alpha blend factor in Wa_14018912822
|
---|
5942 | - iris/anv: move Wa_14018912822 as a drirc workaround
|
---|
5943 | - iris: flush data cache when flushing HDC on GFX < 12
|
---|
5944 | - anv: HDC flush is available only for GFX_VER 12+
|
---|
5945 | - iris: HDC flush is available only for GFX_VER 12+
|
---|
5946 | - intel/genxml: remove HDC from gen11.xml, it is not available
|
---|
5947 | - mesa/st: ignore StencilSampling if stencil not part of the format
|
---|
5948 | - intel/dev: expand existing fix for all gfx12 with small EU count
|
---|
5949 | - egl: fix leaking drmDevicePtr in _eglFindDevice
|
---|
5950 | - iris: add data cache flush for pre hiz op
|
---|
5951 | - anv/drirc: add option to disable FCV optimization
|
---|
5952 | - drirc: Set limit_trig_input_range option for Valheim
|
---|
5953 |
|
---|
5954 | Tatsuyuki Ishi (8):
|
---|
5955 |
|
---|
5956 | - radv/amdgpu: Remove unused bo_list variable from cs_submit.
|
---|
5957 | - radv/winsys: Remove unused struct radv_winsys_bo_list.
|
---|
5958 | - radv/amdgpu: Do not pass in a BO handle when clearing PRT VA region.
|
---|
5959 | - radv: Fix IB size for RADV_DEBUG=hang.
|
---|
5960 | - radv: Fix dumping vertex descriptors with RADV_DEBUG=hang.
|
---|
5961 | - radv/amdgpu: Use rwlock to protect access to virtual BOs.
|
---|
5962 | - zink: Fix missing sparse buffer bind synchronization.
|
---|
5963 | - zink: Fix waiting for texture commit semaphores.
|
---|
5964 |
|
---|
5965 | Thomas H.P. Andersen (65):
|
---|
5966 |
|
---|
5967 | - tgsi: remove unused tgsi_shader_info.num_tokens
|
---|
5968 | - tgsi: remove unused tgsi_shader_info.array_max
|
---|
5969 | - tgsi: remove unused tgsi_shader_info.num_memory_instructions
|
---|
5970 | - tgsi: remove unused tgsi_shader_info.colors_read
|
---|
5971 | - tgsi: remove unused tgsi_shader_info.colors_written
|
---|
5972 | - tgsi: remove unused tgsi_shader_info.reads_position
|
---|
5973 | - tgsi: remove unused tgsi_shader_info.reads_samplemask
|
---|
5974 | - svga: remove unused struct field
|
---|
5975 | - tgsi: remove unused tgsi_shader_info.reads_tess_factors
|
---|
5976 | - tgsi: remove unused tgsi_shader_info fields
|
---|
5977 | - tgsi: remove unused tgsi_shader_info fields
|
---|
5978 | - tgsi: remove unused tgsi_shader_info.uses_drawid
|
---|
5979 | - tgsi: remove unused tgsi_shader_info fields
|
---|
5980 | - tgsi: remove unused tgsi_shader_info.uses_subgroup_info
|
---|
5981 | - tgsi: remove unused tgsi_shader_info.writes_primid
|
---|
5982 | - tgsi: remove unused tgsi_shader_info.uses_doubles
|
---|
5983 | - tgsi: remove unused tgsi_shader_info.uses_derivatives
|
---|
5984 | - tgsi: remove unused tgsi_shader_info.uses_bindless_samplers
|
---|
5985 | - tgsi: remove unused tgsi_shader_info.uses_bindless_images
|
---|
5986 | - tgsi: remove unused tgsi_shader_info.clipdist_writemask
|
---|
5987 | - tgsi: remove unused tgsi_shader_info.culldist_writemask
|
---|
5988 | - tgsi: remove unused tgsi_shader_info.images_load
|
---|
5989 | - tgsi: remove unused tgsi_shader_info.images_store
|
---|
5990 | - tgsi: remove unused tgsi_shader_info.images_atomic
|
---|
5991 | - tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_load
|
---|
5992 | - tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_store
|
---|
5993 | - tgsi: remove unused tgsi_shader_info.uses_bindless_buffer_atomic
|
---|
5994 | - tgsi: remove unused tgsi_shader_info.uses_bindless_image_load
|
---|
5995 | - tgsi: remove unused tgsi_shader_info.uses_bindless_image_store
|
---|
5996 | - tgsi: remove unused tgsi_shader_info.uses_bindless_image_atomic
|
---|
5997 | - tgsi: remove unused tgsi_shader_info.indirect_files_read
|
---|
5998 | - tgsi: remove unused tgsi_shader_info.indirect_files_written
|
---|
5999 | - tgsi: remove unused tgsi_shader_info.const_buffers_indirect
|
---|
6000 | - tgsi: remove unused tgsi_shader_info.max_depth
|
---|
6001 | - tgsi: drop two unused functions
|
---|
6002 | - nvk: use common physical device enumeration
|
---|
6003 | - nvk: fix implicit-fallthrough warnings with clang
|
---|
6004 | - nvk: delete commented code
|
---|
6005 | - nvk: fix mem leaks
|
---|
6006 | - nvk: use common descriptor set layout code
|
---|
6007 | - nvk: use common pipeline layout code
|
---|
6008 | - nvk: advertise KHR_shader_non_semantic_info
|
---|
6009 | - nvk: advertise KHR_image_format_list
|
---|
6010 | - nvk: advertise EXT_private_data
|
---|
6011 | - nvk: advertise KHR_sampler_mirror_clamp_to_edge
|
---|
6012 | - nvk: KHR_descriptor_update_template
|
---|
6013 | - nvk: CmdPushDescriptorSetWithTemplateKHR
|
---|
6014 | - nvk: drop dead assignment
|
---|
6015 | - nvk: drop dead assignment
|
---|
6016 | - nvk: fix initialization override
|
---|
6017 | - nvk: sort extensions
|
---|
6018 | - nvk: advertize KHR_relaxed_block_layout
|
---|
6019 | - nvk: add check for VK_IMAGE_CREATE_2D_VIEW_COMPATIBLE_BIT_EXT
|
---|
6020 | - nvk: advertise EXT_image_2d_view_of_3d
|
---|
6021 | - nvk: fix maxPushDescriptors
|
---|
6022 | - nvk: call correct macro to clear views
|
---|
6023 | - nouveau/mme: use fermi enum in fermi builder
|
---|
6024 | - nvk: add warning on non-nouveau drm driver
|
---|
6025 | - nvk: Implement VK_KHR_draw_indirect_count on Turing+
|
---|
6026 | - nvk: set device info before use in nvk_get_device_extensions
|
---|
6027 | - nvk: simplify code by using new helpers
|
---|
6028 | - nvk: remove duplicated device features
|
---|
6029 | - nvk: EXT_conditional_rendering
|
---|
6030 | - nvk: advertise VK_EXT_tooling_info
|
---|
6031 | - nvk: set optimization level to 3
|
---|
6032 |
|
---|
6033 | Thong Thai (3):
|
---|
6034 |
|
---|
6035 | - radeonsi: enable vcn encoder rgb input support
|
---|
6036 | - Update radeon_vcn_enc.c
|
---|
6037 | - frontends/va/config: report max width and height for encoding/decoding
|
---|
6038 |
|
---|
6039 | Timothy Arceri (27):
|
---|
6040 |
|
---|
6041 | - glsl: fix validation of ES vertex attribs
|
---|
6042 | - nir/opt_copy_prop_vars: don't clone copies if branch empty
|
---|
6043 | - nir/opt_copy_prop_vars: speedup cloning of copy tables
|
---|
6044 | - nir/opt_copy_prop_vars: remove var hash entry on kill alias
|
---|
6045 | - nir/opt_copy_prop_vars: skip cloning of copies arrays until needed
|
---|
6046 | - nir/opt_copy_prop_vars: drop reuse of dynamic arrays
|
---|
6047 | - glsl: fix spirv sso validation
|
---|
6048 | - glsl: mark structs containing images as bindless
|
---|
6049 | - util: add radeonsi workaround for Nowhere Patrol
|
---|
6050 | - glsl: fix out params in glsl to nir
|
---|
6051 | - glsl_to_nir: add more unhandled function types
|
---|
6052 | - nir: replace use of nir_src_copy()
|
---|
6053 | - nir: remove unused nir_src_copy()
|
---|
6054 | - nir: remove unused param from nir_alu_src_copy()
|
---|
6055 | - glsl: remove field from gl_shader_program
|
---|
6056 | - glsl: move get_varying_type() declaration earlier
|
---|
6057 | - glsl: add nir version of validate_first_and_last_interface_explicit_locations()
|
---|
6058 | - glsl: switch to nir validate_first_and_last_interface_explicit_locations()
|
---|
6059 | - glsl: remove unused validate_first_and_last_interface_explicit_locations()
|
---|
6060 | - nir: fix typo in comment
|
---|
6061 | - nir: copy explicit_invariant flag to nir vars
|
---|
6062 | - glsl: move interpolation_string() to linker_util
|
---|
6063 | - glsl: move is_gl_identifier() to linker_util
|
---|
6064 | - nir: add used field to nir variables
|
---|
6065 | - glsl: implement cross_validate_outputs_to_inputs() in nir linker
|
---|
6066 | - glsl: switch to nir linkers cross_validate_outputs_to_inputs()
|
---|
6067 | - glsl: remove now unused varying linker code
|
---|
6068 |
|
---|
6069 | Timur Kristóf (39):
|
---|
6070 |
|
---|
6071 | - aco: Fix subgroup_id intrinsic on GFX10.3+.
|
---|
6072 | - ac/nir: Simplify arg unpacking when shift is zero.
|
---|
6073 | - ac/nir: Add new pass to lower intrinsics to shader args.
|
---|
6074 | - radv: Move radv_select_hw_stage to radv_shader_info.
|
---|
6075 | - radv: Use ac_nir_lower_intrinsics_to_args.
|
---|
6076 | - radeonsi: Move si_select_hw_stage to si_shader_info.
|
---|
6077 | - radeonsi: Use ac_nir_lower_intrinsics_to_args.
|
---|
6078 | - aco: Remove subgroup_id and num_subgroups intrinsics.
|
---|
6079 | - ac/llvm: Remove subgroup_id and num_subgroups intrinsics.
|
---|
6080 | - aco: Refactor select_program to smaller functions.
|
---|
6081 | - nir/opt_dead_cf: Remove if branches with undef condition.
|
---|
6082 | - ac/nir: Add done arg to ac_nir_export_position.
|
---|
6083 | - ac/nir: Slightly refactor how pos0 exports are added when missing.
|
---|
6084 | - ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export.
|
---|
6085 | - ac/nir/ngg: Refactor mesh shader primitive export.
|
---|
6086 | - ac/nir/ngg: Wait for attribute ring stores in mesh shaders.
|
---|
6087 | - ac/nir/ngg: Extract nogs_export_vertex_params function.
|
---|
6088 | - ac/gpu_info: Add some SDMA related information.
|
---|
6089 | - ac: Clarify SDMA opcode defines.
|
---|
6090 | - ac: Add amd_ip_type argument to ac_parse_ib and ac_parse_ib_chunk.
|
---|
6091 | - ac: Rename ac_do_parse_ib to parse_pkt3_ib.
|
---|
6092 | - ac: Print IP type for IBs.
|
---|
6093 | - ac: Add rudimentary implementation of printing SDMA IBs.
|
---|
6094 | - radv: Rename SDMA file to radv_sdma.c
|
---|
6095 | - radv: Use const device argument in radv_sdma_copy_buffer.
|
---|
6096 | - radv: Use const on vi_alpha_is_on_msb arguments.
|
---|
6097 | - radv: Only call si_cp_dma_wait_for_idle on GFX and ACE queues.
|
---|
6098 | - radv: Move radv_cp_wait_mem to radv_cs.h and add queue family argument.
|
---|
6099 | - radv: Refactor WRITE_DATA helper function.
|
---|
6100 | - radv: Use new WRITE_DATA helper in more places.
|
---|
6101 | - radv: Add queue family argument to some functions.
|
---|
6102 | - radv: Wait for bottom of pipe in ACE gang wait postamble.
|
---|
6103 | - radv: Simplify gang CS and semaphore initialization.
|
---|
6104 | - radv: Allow gang submit use cases other than task shaders.
|
---|
6105 | - radv: Slightly refactor gang semaphore functions.
|
---|
6106 | - radv: Add gang follower semaphore functions.
|
---|
6107 | - radv: Support SDMA in radv_cs_write_data_head.
|
---|
6108 | - radv: Support SDMA in radv_cp_wait_mem.
|
---|
6109 | - radv: Support SDMA in si_cs_emit_write_event_eop.
|
---|
6110 |
|
---|
6111 | Vignesh Raman (4):
|
---|
6112 |
|
---|
6113 | - ci: add Vignesh Raman into restricted traces access list
|
---|
6114 | - Do explicit cast to suppress clang warnings
|
---|
6115 | - ci: enforce -Wimplicit-const-int-float-conversion for clang
|
---|
6116 | - ci: Uprev crosvm
|
---|
6117 |
|
---|
6118 | Vinson Lee (8):
|
---|
6119 |
|
---|
6120 | - nvk: Fix assert
|
---|
6121 | - lavapipe: Fix struct initialization
|
---|
6122 | - intel/decoder: Fix memory leak on error path
|
---|
6123 | - nv50: Remove unused value
|
---|
6124 | - vk/wsi/x11: Remove dead code
|
---|
6125 | - freedreno/replay: Fix implicit-function-declaration error
|
---|
6126 | - anv: Fix transfer type assert
|
---|
6127 | - broadcom/qpu: Remove duplicate variable opcode
|
---|
6128 |
|
---|
6129 | Vitaliy Triang3l Kuzmin (3):
|
---|
6130 |
|
---|
6131 | - r600/asm: Fix AR force_add_cf setting if a clause is not open
|
---|
6132 | - r600/asm: Make sure MOVA and SET_CF_IDX are in the same clause
|
---|
6133 | - r600: Replace R600_BIG_ENDIAN with UTIL_ARCH_BIG_ENDIAN
|
---|
6134 |
|
---|
6135 | Vlad Schiller (15):
|
---|
6136 |
|
---|
6137 | - pvr: Implement VK_EXT_tooling_info
|
---|
6138 | - pvr: Add 'info' PVR_DEBUG flag
|
---|
6139 | - pvr: Implement VK_KHR_format_feature_flags2
|
---|
6140 | - pvr: Remove PVR_WINSYS_BO_FLAG_ZERO_ON_ALLOC flag
|
---|
6141 | - pvr: Add VK_KHR_driver_properties
|
---|
6142 | - pvr: Use correct index when writing query availability data
|
---|
6143 | - pvr: Enable VK_EXT_scalar_block_layout
|
---|
6144 | - pvr: Enable KHR_image_format_list
|
---|
6145 | - pvr: Enable VK_KHR_uniform_buffer_standard_layout
|
---|
6146 | - pvr: Implement VK_KHR_external_fence
|
---|
6147 | - pvr: Implement VK_KHR_external_semaphore
|
---|
6148 | - pvr: Enable VK_KHR_bind_memory2 extension
|
---|
6149 | - pvr: Implement VK_EXT_texel_buffer_alignment
|
---|
6150 | - pvr: Implement VK_EXT_host_query_reset
|
---|
6151 | - pvr: Fix VK_EXT_texel_buffer_alignment
|
---|
6152 |
|
---|
6153 | WinLinux1028 (1):
|
---|
6154 |
|
---|
6155 | - radeonsi: prefix function with si\_ to prevent name collision
|
---|
6156 |
|
---|
6157 | Xaver Hugl (1):
|
---|
6158 |
|
---|
6159 | - vulkan wsi: add support for PresentOptionAsyncMayTear
|
---|
6160 |
|
---|
6161 | Yiwei Zhang (46):
|
---|
6162 |
|
---|
6163 | - venus: handle query feedback creation failure
|
---|
6164 | - venus: ensure consistency of query overflow behavior
|
---|
6165 | - venus: add a missing barrier before copying query feedback
|
---|
6166 | - venus: refactor query feedback cmd record
|
---|
6167 | - venus: reduce to use 4K mem suballoc align on platforms known to fit
|
---|
6168 | - turnip: flush cache for dstBuffer in vkCmdCopyQueryPoolResults
|
---|
6169 | - lvp: avoid reading immutable sampler from desc write info
|
---|
6170 | - ci/venus: update venus-lavapipe expectations
|
---|
6171 | - venus: fix a cmd builder render_pass state leak across reset
|
---|
6172 | - venus: fix cmd state leak across implicit reset
|
---|
6173 | - venus: log and doc the broken query feedback in suspended render pass
|
---|
6174 | - venus: move transient storage from cmd to pool
|
---|
6175 | - venus: remove redundant fb tracking from cmd builder
|
---|
6176 | - venus: use tracked queue_family_index from the cmd pool
|
---|
6177 | - venus: cleanup vn_cmd_begin_render_pass usage
|
---|
6178 | - venus: add helpers to track subpass view mask
|
---|
6179 | - venus: avoid redundant tracking of render pass
|
---|
6180 | - venus: refactor more cmd states into cmd builder
|
---|
6181 | - venus: use in_render_pass to skip present_src counting
|
---|
6182 | - ci/venus: remove fixed tests that no longer run
|
---|
6183 | - ci/venus: reenable pipeline cts
|
---|
6184 | - venus: suppress a false logging
|
---|
6185 | - venus: add no_sparse debug option to disable sparse resource support
|
---|
6186 | - venus: set deviceMemoryReport feature
|
---|
6187 | - venus: expose at least one cached memory type
|
---|
6188 | - venus: expose KHR_external_fence/sempahore_fd extensions
|
---|
6189 | - venus: fix a device memory report leak
|
---|
6190 | - vulkan: remove a dup entry from vk_image_usage_to_ahb_usage
|
---|
6191 | - vulkan/android: improve vkQueueSignalReleaseImageANDROID
|
---|
6192 | - vulkan/android: add missing AHARDWAREBUFFER_USAGE_GPU_DATA_BUFFER usage
|
---|
6193 | - vulkan/android: drop vk_buffer dependency from common AHB impl
|
---|
6194 | - venus: use common vk_queue object
|
---|
6195 | - venus: use common ANB implementation
|
---|
6196 | - venus: use more common vk_queue related implementations
|
---|
6197 | - venus: drop device, family, index, flags tracking from vn_queue
|
---|
6198 | - venus: fix re-export of imported classic 3d resources
|
---|
6199 | - venus: remove redundant bo roundtrip and add more docs
|
---|
6200 | - venus: track VkPhysicalDeviceMemoryProperties instead
|
---|
6201 | - venus: refactor vn_device_memory to prepare for async alloc
|
---|
6202 | - venus: make device memory alloc async
|
---|
6203 | - venus: enable Vulkan 1.3 for Android 13 and above
|
---|
6204 | - zink: sync queue access for vkQueueWaitIdle
|
---|
6205 | - venus: properly expose KHR_external_fence/sempahore_fd
|
---|
6206 | - ci/venus: mark more flaky tests after recent cts uprev
|
---|
6207 | - venus: fix query feedback batch leak and race upon submission
|
---|
6208 | - zink: apply can_do_invalid_linear_modifier to Venus
|
---|
6209 |
|
---|
6210 | Yogesh Mohan Marimuthu (12):
|
---|
6211 |
|
---|
6212 | - gallium: remove start_slot parameter from pipe_context::set_vertex_buffers
|
---|
6213 | - ac/surface: add astc block size to bpe_to_format() function
|
---|
6214 | - util: move ASTCLutHolder from mesa/main to util
|
---|
6215 | - vulkan/formats,zink: move vk_format_from_pipe_format() function
|
---|
6216 | - vulkan/runtime: add compute astc decoder helper functions
|
---|
6217 | - vulkan add 3D texture support for compute astc decoder
|
---|
6218 | - radv: integrate meta astc compute decoder to radv
|
---|
6219 | - radeonsi: add more documentation for dpbb debug env variable
|
---|
6220 | - docs: remove document for unused variable dfsm from AMD_DEBUG
|
---|
6221 | - radeonsi: correct old comment in si_emit_framebuffer_state()
|
---|
6222 | - radeonsi: In gfx6_init_gfx_preamble_state() use gfx_level only from sctx
|
---|
6223 | - radeonsi: add radeonsi to GL_RENDERER string
|
---|
6224 |
|
---|
6225 | Yonggang Luo (43):
|
---|
6226 |
|
---|
6227 | - lima: Convert to use nir_foreach_function_impl when possible
|
---|
6228 | - freedreno: Switch to use nir_foreach_function_impl in tu_shader.cc
|
---|
6229 | - zink: Convert to use nir_foreach_function_impl when possible
|
---|
6230 | - lavapipe: Convert to use nir_foreach_function_impl
|
---|
6231 | - lavapipe: fixes indent of function lvp_inline_uniforms
|
---|
6232 | - microsoft/compiler: convert to use nir_foreach_function_with_impl in function emit_module
|
---|
6233 | - microsoft/clc/compiler: Convert to use nir_foreach_function_impl when possible
|
---|
6234 | - radeonsi: Convert to use nir_foreach_function_impl
|
---|
6235 | - ac: Switch to use nir_foreach_function_impl in function analyze_shader_before_culling
|
---|
6236 | - util: Move pipe_swizzle from p_defines.h to u_formats.h
|
---|
6237 | - util: Move PIPE_MASK_* from p_defines.h to u_formats.h
|
---|
6238 | - util: Move pipe_color_union from p_defines.h into u_formats.h
|
---|
6239 | - util: Move u_pack_color.h and dbughelp.h into src/util from/src/gallium/auxiliary/util/
|
---|
6240 | - util: Remove include "pipe/\*.h" in src/util/* files
|
---|
6241 | - util:Move only gallium used u_debug_refcnt.* and u_debug_describe.* into src/gallium/auxiliary/util/
|
---|
6242 | - util/meson: Getting mesa util core to be self contained
|
---|
6243 | - pvr: decouple vulkan driver and compiler from gallium
|
---|
6244 | - freedreno: decouple compiler and vulkan driver from gallium
|
---|
6245 | - glx: decouple from gallium
|
---|
6246 | - meson: Remove arm_neon_workaround
|
---|
6247 | - nouveau/drm-shim: Decouple from gallium
|
---|
6248 | - ac/radv: decouple radv vulkan driver and compiler from gallium
|
---|
6249 | - etnaviv: decouple drm from gallium
|
---|
6250 | - asahi: decouple layout from gallium
|
---|
6251 | - compiler: Move WRITEMASK_* from prog_instruction.h into shader_enums.h
|
---|
6252 | - intel/blorp: Use float directly to avoid #include "mesa/main/format_utils.h"
|
---|
6253 | - intel/blorp: brw_sampler_prog_key_data::swizzles is only and should only accessed in crocus
|
---|
6254 | - intel/brw: Define and use BRW_SWIZZLE_* instead of SWIZZLE_*
|
---|
6255 | - crocus: #include "program/prog_instruction.h" for SWIZZLE_*
|
---|
6256 | - intel/compiler,intel/blorp,intel/vulkan: decouple vulkan driver and compiler from gallium
|
---|
6257 | - util/treewide: Use alignas(x) instead __attribute__((aligned(x)))
|
---|
6258 | - v3dv: Use alignas(8) over 64 bit atomic value
|
---|
6259 | - svga: use alignas over struct MKSGuestStatInfoEntry
|
---|
6260 | - radv: Fixes mingw linkage error undefined reference to \`radv_GetCalibratedTimestampsEXT'
|
---|
6261 | - v3d: Use DIV_ROUND_UP instead div_round_up
|
---|
6262 | - freedreno: Use shared DIV_ROUND_UP instead div_round_up
|
---|
6263 | - sfn: Use 4 instead of ATOMIC_COUNTER_SIZE
|
---|
6264 | - intel/brw: use 4 instead of MAX_VERTEX_STREAMS to avoid #include "mesa/main/config.h"
|
---|
6265 | - d3d12: replace use of MAX_VERTEX_STREAMS with PIPE_MAX_VERTEX_STREAMS
|
---|
6266 | - compiler: use 4 instead ATOMIC_COUNTER_SIZE in glsl_types.h to avoid #include "mesa/main/config.h"
|
---|
6267 | - compiler/glsl: Move glsl_print_type from glsl_types.* to ir_print_visitor.cpp
|
---|
6268 | - util: Deduplicate macros between u_math.h and macros.h
|
---|
6269 | - nvk: Should use alignment instead of align
|
---|
6270 |
|
---|
6271 | Yusuf Khan (4):
|
---|
6272 |
|
---|
6273 | - nouveau/ws: remove the drm.h header
|
---|
6274 | - nvk: implement GetDeviceMemoryCommitment
|
---|
6275 | - nvk: support GetImageSparseMemoryRequirements2
|
---|
6276 | - nvk: expose KHR_driver_properties
|
---|
6277 |
|
---|
6278 | Zhang Ning (1):
|
---|
6279 |
|
---|
6280 | - Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR"
|
---|
6281 |
|
---|
6282 | antonino (14):
|
---|
6283 |
|
---|
6284 | - virgl: add ci flake
|
---|
6285 | - freedreno: add ci flake
|
---|
6286 | - zink: remove unused indices from \`nir_load_push_constant` calls
|
---|
6287 | - zink/nir: add a zink specific intrinsic for push constants
|
---|
6288 | - vulkan/wsi: add \`vk_wsi_force_swapchain_to_current_extent` driconf
|
---|
6289 | - drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "The Talos Principle"
|
---|
6290 | - drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "Serious Sam Fusion"
|
---|
6291 | - vulkan: Extend vkGet/SetPrivateDataEXT handling to all platforms
|
---|
6292 | - vulkan: Extend vkGet/SetPrivateDataEXT handling to VkSurface
|
---|
6293 | - vulkan: Handle vkSetDebugUtilsObjectNameEXT on WSI objects
|
---|
6294 | - zink: store bindless var when creating it to avoid creating it again
|
---|
6295 | - nir: fix several crashes in \`nir_lower_tex`
|
---|
6296 | - nir: don't take the derivative of the array index in \`nir_lower_tex`
|
---|
6297 | - vulkan: use instance allocator for \`object_name` in some objects
|
---|
6298 |
|
---|
6299 | cheyang (1):
|
---|
6300 |
|
---|
6301 | - isaspec : fix isaspec build error in aosp
|
---|
6302 |
|
---|
6303 | georgeouzou (1):
|
---|
6304 |
|
---|
6305 | - nvk: Support VK_EXT_line_rasterization
|
---|
6306 |
|
---|
6307 | jazzfool (1):
|
---|
6308 |
|
---|
6309 | - zink: Hash only first 32 bits of zink_gfx_pipeline_state with full DS3
|
---|
6310 |
|
---|
6311 | lorn10 (1):
|
---|
6312 |
|
---|
6313 | - docs: Update Clover's env variable documentation
|
---|
6314 |
|
---|
6315 | norablackcat (2):
|
---|
6316 |
|
---|
6317 | - spirv/nir_to_spirv: add expect assume op codes
|
---|
6318 | - rusticl: add cl_khr_expect_assume
|
---|
6319 |
|
---|
6320 | timmac-qmc (1):
|
---|
6321 |
|
---|
6322 | - glsl: fix potential crash with DisableUniformArrayResize
|
---|
6323 |
|
---|
6324 | twisted89 (1):
|
---|
6325 |
|
---|
6326 | - util/driconf: add workarounds for the Chronicles of Riddick
|
---|
6327 |
|
---|
6328 | wangra (1):
|
---|
6329 |
|
---|
6330 | - tu/kgsl: Fix bitfield of DITHER_MODE_MRT6
|
---|
6331 |
|
---|
6332 | xurui (1):
|
---|
6333 |
|
---|
6334 | - glx: There is no need to psc++
|
---|