VirtualBox

source: vbox/trunk/src/VBox/Additions/3D/mesa/mesa-24.0.2/docs/relnotes/23.2.1.rst

最後變更 在這個檔案是 103996,由 vboxsync 提交於 11 月 前

Additions/3D/mesa: export mesa-24.0.2 to OSE. bugref:10606

檔案大小: 263.7 KB
 
1Mesa 23.2.1 Release Notes / 2023-09-28
2======================================
3
4Mesa 23.2.1 is a new development release. People who are concerned
5with stability and reliability should stick with a previous release or
6wait for Mesa 23.2.2.
7
8Mesa 23.2.1 is an unusual first stable release due to the accidentl tagging of
923.2.0 durring the rc cycle.
10
11Mesa 23.2.1 implements the OpenGL 4.6 API, but the version reported by
12glGetString(GL_VERSION) or glGetIntegerv(GL_MAJOR_VERSION) /
13glGetIntegerv(GL_MINOR_VERSION) depends on the particular driver being used.
14Some drivers don't support all the features required in OpenGL 4.6. OpenGL
154.6 is **only** available if requested at context creation.
16Compatibility contexts may report a lower version depending on each driver.
17
18Mesa 23.2.1 implements the Vulkan 1.3 API, but the version reported by
19the apiVersion property of the VkPhysicalDeviceProperties struct
20depends on the particular driver being used.
21
22SHA256 checksum
23---------------
24
25::
26
27 64de0616fc2d801f929ab1ac2a4f16b3e2783c4309a724c8a259b20df8bbc1cc mesa-23.2.1.tar.xz
28
29
30
31New features
32------------
33
34- VK_EXT_attachment_feedback_loop_dynamic_state on RADV
35
36- extendedDynamicState3SampleLocationsEnable on RADV
37
38- VK_EXT_dynamic_rendering_unused_attachments on RADV
39
40- VK_EXT_mesh_shader on lavapipe
41
42- OpenGL 3.1 on Asahi
43
44- OpenGL ES 3.0 on Asahi
45
46- VK_KHR_fragment_shader_barycentric on RADV/GFX10.3+
47
48- VK_KHR_ray_tracing_pipeline on RADV/GFX10.3+
49
50- VK_EXT_depth_bias_control on RADV
51
52- VK_EXT_fragment_shader_interlock on RADV/GFX9+
53
54- VK_EXT_pipeline_robustness on RADV
55
56
57
58Bug fixes
59---------
60
61- intel: State cache invalidation after BLORP binding table setup ought to be unnecessary on ICL.
62- RadeonSI: glClear() causes clear texture for some frames on RX580
63- shader_test causing a crash in compiler
64- Crash in st_ReadPixels
65- [ANV] [DG2/A770] The Spirit and The Mouse, miscellaneous issues with Mesa Git
66- Penumbra: Overture hangs on new game loading screen
67- radv: Regression from 266b2cfe5bf3feda16747c50c1638fb5a0426958
68- h264 encoding picture showed randomly repeated frames.
69- [Google][Rex][anv] GLES dEQP test fails in anv when run via ANGLE-on-Venus on ChromeOS ARCVM.
70- VAAPI on VCN: bad stream may crash whole gfx system
71- aco: Assertion when compiling CP2077 shader
72- [RADV] Dead by Daylight memory leak (shader-related?) on 23.1.6
73- gpu hang on DG2 when running KHR-GLES31.core.texture_cube_map_array.image_op_tess*
74- KHR-GLES31.core.texture_cube_map_array.image_op_tessellation_evaluation_sh fail on GFX12+
75- wsi: deadlocks when DISPLAY is changed
76- VAAPI: AMDGPU crash on RX 6900 XT on corrupted video
77- [RADV] red and pink tinted shadows in Overwatch 2 on 7900 XTX
78- blorp regression on dg2
79- radv: commit 81641b01555faa4dd1dfc7de2513ad8d63e77ab7 leaded to artifacts in Quake II RTX
80- [radv] Colors are distorted in Cyberpunk 2077 with ray tracing enabled
81- Forza Horizon 5 stuttering since mesa 23.1.4 / 9b008673 revert as a FIX
82- glCopyTexSubImage2D is very slow on Intel
83- NVE4 (GeForce 710) fails to get vdpau in mesa git
84- nouveau prevents hardware acceleration with Chromium (Wayland)
85- Corrupt text rendering in Blender
86- DRI2 gallium frontend is using bad format type
87- Incorrect vlVaCreateBuffer/vlVaMapBuffer behavior for buffer type VAEncCodedBufferType in Gallium
88- ci: do not download perfetto on-fly in build jobs
89- Shared Memory Leak With Qt OpenGL Applications
90- OpenGL, SIGSEGV when program pipeline objects has separated vertex shader progam and separated fragment shader progam with in/out
91- 975a8ecc881873744d851ab0ef45ad7698eaa0ef "frontends/va: use resources instead of views" cause radeonsi can't play video.
92- Rusticl,radeonsi: ac_rtld error(2): too much LDS
93- aco, radv Rage 2 menu corruption - bisected
94- radv, aco: World War Z character texture regression on 7900xtx
95- lavapipe/llvmpipe: regressions since descriptor rewrite
96- Building llvmpipe with LP_USE_TEXTURE_CACHE set fails since 23.2.0-rc1: error C2039: dynamic_state is not member of lp_build_sampler_soa in lp_tex_sample.c
97- [anv] Death Stranding crashes
98- Can no longer build Clover without llvmspirvlib
99- Baldurs Gate 3 (DX11) - Graphical corruption on RDNA3 (ACO regression)
100- intel: Deathloop and other DX12 games fail assert(validated) with invalid SEL instruction
101- gpu hangs on dg2 with mesh shading enabled on vkcts
102- GTF-GL46.gtf21.GL.build.CorrectFull_vert regressed on intel platforms
103- radeonsi: Deadlock when creating a new GL context in parallel with linking a shader on another GL context
104- robustness2 raygen tests intermittently fail in Intel Mesa CI
105- glthread: huge performance regression
106- DirectX games do not launch on Intel HD Graphics 4000 (IVB GT2) [bisected]
107- [Vega 64] Newer Mesa-git revisions past 283be8ac3b8610a77b28ebe9e44b946b979f0381 crash the system when accessing hardware accelerated apps
108- Docs: Imagination driver not have documentation in https://docs.mesa3d.org/
109- Unigine Heaven broken on Navi 21 since https://gitlab.freedesktop.org/mesa/mesa/-/merge_requests/22846
110- [bisected] amdgpu graphics acceleration causing system crashes on 22f3bcfb5a33 or later
111- anv: GPU hangs on MSAA tests with Angle
112- AMD OpenGL texture corruption and crashing regression in java app
113- The filenumber part of the #line preprocessor directive is ignored for multiline error messages
114- r300: channel merging missed case for mad + mov
115- radv: incorrect RTE rounding in corner cases
116- Confidential issue #4103
117- r600 regression
118- clc: compiler_test gets built even if unit tests are disabled via -Dbuild-tests=false
119- x11 swapchain fails to check for DRI3 PixmapFromBuffers error
120- msys2: build fails with error: conflicting types for '_glapi_add_dispatch'
121- [bisected][anv] newly enabled test (dEQP-VK.api.info.format_properties.g8_b8r8_2plane_420_unorm) failing
122- deqp gles3.1 gpu hangs on DG2 A380 when running zink
123- [amd/drm-shim] build issue on ppc64
124- radeonsi: bogus advertisement for encode/decode support for 10 bit h264 video
125- radeon: Basphemous graphical glitch
126- aztec ruins gl benchmark slow to compile shaders on intel
127- anv: false cacheline flushing or insufficient buffer alignment on at least ADL
128- macOS build error timespec
129- intel: Borderlands 2 misrendering with ZINK with OpTerminateInvocation instruction
130- gallium: Error path in st_create_context_priv leads to segfault
131- [Vulkan][TGL] vkCmdCopyQueryPoolResults failed to write buffer after compute shader write with overlap
132- r600: GPU hang on The Long Dark on R600/R700
133- Add iris and crocus to features.txt
134- r600: Segfault on glxgears and almost every OpenGL applications on RV770 (regression)
135- R9 280 - Broken font rendering in Godot Engine (GLES2) - Radeonsi
136- radeonsi gcn1 regression
137- ANV crashes on init on 32 bit builds
138- eglCreateImageKHR should throw a error when called with anything but EGL_NO_CONTEXT
139- virgl: Stack overflow in virgl_bind_sampler_states on hosts with more than 32 samplers
140- [TGL] regression with r11_g11_b10 formats when running through virgl
141- anv: incorrect vkGetPhysicalDeviceImageFormatProperties2KHR success
142- r300: reconstruct ARR in shaders from wined3d
143- ci: enable pre-merge testing for Zink/RADV
144- rusticl: segmentation fault when enabling for llvmpipe and zink with the driver set to zink
145- unify load_ubo_dxil and load_ubo_vec4
146- Move \`lower_loads_and_stores_to_dxil` bit-size lowering logic to \`nir_lower_mem_access_bit_sizes`
147- Intel drivers fail to link with -Dxmlconfig=disabled
148- mesa: Remove dynamic dispatch stubs
149- radv: regression UE5.2 nanite d3d12 vertex explosion
150- [ANV/DG2] nvpro-samples/vk_raytracing_tutorial_KHR/ray_tracing_reflections crash
151- validate_intrinsic_instr: Assertion \`dst' failed.
152- anv: fails to build on aarch64
153- radv: crash/freeze/assert with raytracing and Elden Ring 1.09
154- Elden Ring freeze when summoning cooperator with Raytracing
155- gc_alloc_size returns unaligned allocations
156- Rusticl OpenCL: panicked at 'called \`Option::unwrap()` on a \`None` value' && void SPIRVKernelParser::applyDecoration(uint32_t, const spv_parsed_instruction_t*): Assertion \`c.first == id' failed.
157- Using a \`NULL` pointer as \`bitmap` in \`glBitmap` leads to \`GL_OUT_OF_MEMORY` while creating display-list ("glNewList -> glBitmap")
158- Bitwise and with constant 31 removed on width argument to BitFieldSExtract, causing incorrect result on RADV ACO
159- Rusticl OpenCL: Simple SYCL / DPC++ program hangs indefinitely at rusticl::api::event::wait_for_events()
160- radeonsi: Metro Last Light Redux graphical glitches
161- radv: VK_KHR_fragment_shader_barycentric support
162- freedreno/a6xx: assert(state->view_rsc_seqno[i] == seqno) failed with sway
163- radv: Trackmania 2020 crashing on mesa-git
164- radv crashes when using vertex format VK_FORMAT_B10G11R11_UFLOAT_PACK32
165- changes in commit e4b6a0a82457b3ef40c5857412e20bc344ff302c leads to GPU hang
166- radv,aco: In the game Rise of the Tomb Raider on RDNA 3 GPUs appeared artifacts after commit 290c3d360e5a6f5226c062d6a9267629adb1060e
167- CI: Linux CI jobs naming
168- docs.mesa3d.org contrast is bad in dark mode
169- iris now requires Linux v5.3
170- Wolfenstein II: The New Colossus vsync off crash
171- Surfaceless mode ES2.0 number of vertices limitation
172- freedreno: firefox crashes on video playback
173- radv: unaligned vertex input regression
174- rusticl build error: error[E0308]: mismatched types on some archs
175- GetInternalformativ with GL_TEXTURE_2D and GL_CLEAR_TEXTURE incrrectly returns GL_INVALID_ENUM
176- radeonsi: texturing is broken on R9.270x since eaf98b14220d8cbc186d67a929254acc3e7de41a
177- aco: KHR-GL46.shader_image_load_store.basic-allTargets-atomic asserts on Bonaire
178- Firefox / VA-API / H.264 decoding artifacts on AMD RX 6600 / Fedora 37
179- AMD/RX 6600 - VA-API video output is corrupted if decoded surfaces are exported by vaExportSurfaceHandle and then quickly returned to ffmpeg/va-api decoder and reused
180- Pixel Game Maker MV - Elfin Force
181- Anvil - Vulkan CTS tests fail if has_context_isolation set to false
182- anv: binding table pool leak or overly cached
183- [regression/bisected] Bone wireframes are no longer rendered correctly in Blender with RadeonSi/Vega
184- [nine, radeonsi] Texture missing in Heroes of Might and Magic 5
185- [REGRESSION] Crash in \`loader_dri3_wait_gl()` due to \`dri3_front_buffer(draw) == NULL`
186- r600: Visual glitches on The Long Dark with the NIR backend
187- some piglit tests seg-fault if -Dgles1=disabled is set
188- anv: Tom Clancy's Rainbow Six Siege [DX11: Image Corruption(FIXED)/Vulkan: crash on lauch]
189- glSpecializeShaderARB works on SPIR-V compiled with shader compilers ca. 2021, but fails for SPIR-V generated with current compilers
190- ANV: Vulkan driver regression in clearing Depth/Stencil
191- radv: Sample rate shading broken in AC:Valhalla
192- FTBFS: gallivm: src/gallium/auxiliary/gallivm/lp_bld_init.c:45:10: fatal error: llvm-c/Transforms/Scalar.h: No such file or directory (Legacy Pass Manager removed in LLVM 17)
193- intel: workaround mechanism initialized before device revision (stepping) is available
194- radv/rt: crash compiling Unity Enemies Demo RT pipelines
195- gfxbench4/5 crashing on android
196- mesa 23.0.3 build failure
197- gallium-aux msan failure on Debian bookworm
198- radv: Battlefield {1,5} hangs on RX 7900 XTX
199- radv: graphical artifacts in MSFS running via DXVK on RX7900XT
200- vulkancts regressions on bdw
201- [BDW] intel/blorp: MCS partial resolve produces unexpected result
202- Compile failure v23.0.0 - error: implicit declaration of function
203- ci/radv: Stop setting MESA_SPIRV_LOG_LEVEL
204- "frontends/va: report min width and min height values if available" broke VA-API tests on amd-raven
205- [radeonsi] flickering debug chunk border lines in Minecraft
206- nine: Lower alpha test in shader key? or require PIPE_CAP_ALPHA_TEST?
207- radv, radeonsi: Rogue Legacy 2 alpha-to-coverage rendering issues
208- [r600, TURKS] R600: Unsupported instruction: vec1 32 ssa_1 = intrinsic image_samples (ssa_0) on spec@arb_shader_texture_image_samples@[email protected] (23.1.0-rc4)
209- anv: Overwatch 2 hangs GPU with GPL enabled
210- Penumbra: Overture ingame enviroment not displaying on Proton version
211- Penumbra: Overture ingame enviroment not displaying on Proton version
212- nir: 'base' may be used uninitialized
213- vulkan/device_select: no way to select between GPUs of the same model due to bugs
214- radv: 7900 XTX hair flickering/rendering issues in VaM
215- radv: cache crashing
216- nouveau: Regression in arb_transform_feedback_overflow_query-basic from multithreading patches
217- regression in aco,ac/llvm,radv,radeonsi: handle ps bc optimization in nir for radv
218- radeonsi: vaapi: \`width >2880 && width % 64 != 0` results in wrong width in h265 stream
219- [regression] iris: unable to use driver as secondary GPU (primary AMD GPU)
220- iris: steam doesn't render on dg2
221- [llvm 16+] [microsoft-clc] opencl-c-base.h does not exist
222- Vulkancts clipping / tesselation tests trigger gpu hang on DG2
223- Swaped fields in picture in vlc and mythtv if hw accel is on
224- radeonsi: glGetGraphicsResetStatusEXT keeps returning GL_INNOCENT_CONTEXT_RESET after a GPU reset
225- WGL: Assert assigns dwThreadId variable
226- Intel/anv: Modifier problems running gamescope embedded
227- R600: drop TGSI code path
228- r600_shader.c:193 r600_pipe_shader_create - translation from TGSI failed !
229- nine regression with r600 (bisected)
230- [ACO] [RADV] Flickering squares in some areas in The Last of Us Part 1 (with workaround)
231- radv: Jedi Fallen Order flickering & blocky plants
232- qemu 7.2.0-rc4 with sdl output crashes with assert
233- nouveau: NV50 (NVAC) broken in latest master
234- [dozen]: [vkcube] force closing on WSL2
235- rusticl failed to build with rust-bindgen 0.65.0
236- nine: strange color or transparency of trees when called DrawIndexedPrimitive?
237- Regression, Bisected: glsl: Delete the lower_tess_level pass breaks r600 tesselation
238- vkcts-navi21-valve failing often with GCVM_L2_PROTECTION_FAULT_STATUS:0x00X00830
239- ci/radv: Remove vkctx-navi21-llvm-valve job?
240- Deep Rock Galactic GPU freeze (AMD, DX11 DXVK Proton)
241- radv: Resident Evil 4 Chainsaw Demo GPU hang with Navi 24
242- radv: Gotham Knights GPU hang with Navi 24
243- SPIR-V error "Invalid back or cross-edge in the CFG"
244- SPIR-V parsing FAILED: Loop breaks can only break out of the inner most nested loop level
245- ci: a618 traces performance broken
246- aco: s_load_dword with negative soffset cause GPU hang
247- d3d12: Attempting to display a framebuffer through GDI with low bpc produces on-screen corruption
248- piglit.spec.ext_image_dma_buf_import.ext_image_dma_buf_import crash shutting down
249- overlay layer: unable to launch titles on steam
250- radv/zink: spec@ext_texture_integer@multisample-fast-clear gl_ext_texture_integer
251- ci: a530-gl with 6.3 kernel
252- a530: hangs with newer firmware version on db820c (apq8096)
253- tu: debug marker support
254- VAAPI: Wrong H.264 playback on RX 6900 XT and RX 6700 XT (all Sienna?)
255- radv: possibly not setting state dirty bits correctly
256- RADV: VRS attachment not working in specific scenario
257- VAAPI/AMD: videos less than 64 pixels in width or height are decoded to black
258- d3d12: DirectX doesn't support seperate stencil functions for front and back face
259
260
261Changes
262-------
263
264Adam Jackson (1):
265
266- egl: Clear EGL_WINDOW_BIT for non-double-buffered EGLConfigs
267
268Alan Previn (2):
269
270- drm-uapi: bump headers (except AMD)
271- iris: Add GET_PARAM for protected context capability support
272
273Alejandro Piñeiro (12):
274
275- v3dv/pipeline: don't prepack up early-z configuration
276- v3d: use more an auxiliar devinfo
277- v3d: remove v3d_create_texture_shader_state_bo
278- v3d: remove v3d_tfu_supports_tex_format
279- v3d: remove v3d_get_internal_type_bpp_for_output_format
280- broadcom/compiler: return NULL if we fail to register allocate
281- v3d: assert if v3d_compile returns NULL
282- broadcom/compiler: disable tmu pipelining when needed
283- broadcom/compiler: clarify use of QFILE_VPM
284- v3dv: refactor copy_image_to_buffer_blit
285- v3dv: add a linear images to buffer copy codepath
286- v3dv/device: update conformanceVersion
287
288Aleksey Komarov (2):
289
290- pan/va: Fix MUX.v2i16 and MUX.v4i8 description
291- pan/va: fix typo in IADD_IMM.i32 description
292
293Alex Denes (1):
294
295- virgl: link VA driver with build-id
296
297Alexander von Gluck IV (1):
298
299- egl/haiku: Fix potential crash if double buffering is disabled
300
301Alyssa Rosenzweig (289):
302
303- gallium: Add u_default_get_sample_position
304- zink: Use u_default_get_sample_position
305- panfrost: Use u_default_get_sample_position
306- freedreno: Use u_default_get_sample_position
307- d3d12: Use u_default_get_sample_position
308- nir: Add more system values for lowering XFB
309- pan/bi: Don't set has_fsub
310- asahi: Fix disk cache disable with AGX_MESA_DEBUG
311- asahi: Minify width/height in create_surface
312- asahi: Don't use depth/stencil staging blits
313- asahi: Identify XML for barycentric coordinates
314- asahi: Track write to separate stencil
315- agx: Handle splits of uniforms
316- agx: Fix abs/neg propagation into fcmpsel
317- agx/lower_zs_emit: Fix progress returning
318- agx: Handle linear 2D array textureSize()
319- asahi: Explicitly ban MSAA, compression with linear
320- asahi: Use 2D array staging resources for cube/3D
321- asahi: Compress more texture targets
322- agx: Remove bogus assert
323- asahi: Use u_default_get_sample_position
324- agx: Defeature fsub
325- asahi: Use device_load shift for VBO loads
326- agx: Fix packing for iadd with shift
327- asahi: Rename no colour output to tag write disable
328- asahi: Copy resources if needed to shadow
329- agx: Don't wait at the end of the shader
330- asahi: Bind staging resources as RENDER_TARGET
331- agx/lower_address: Add helper to match multiplies
332- agx/lower_address: Match multiplies, not only shifts
333- agx: Ensure load_frag_coord has the right sizes
334- agx: Rework z/s emit
335- agx: Validate that collect sources are the same size
336- agx: Lower I/O to scalar later
337- asahi: Shrink disk cache size of push ranges
338- asahi: Bump MAX_PUSH_RANGES to the worst-case
339- asahi: Implement transform feedback
340- asahi: Fix depth load/store flags
341- nir: Add nir_alu_src_as_uint helper
342- pan/bi: Use nir_alu_src_as_uint
343- agx: Use nir_alu_src_as_uint
344- nir: Model AGX-specific multiply-shift-add
345- agx: Handle imadshl_agx, imsubshl_agx
346- agx: Fix packing of imsub instructions
347- agx: Optimize multiplies
348- zink: Always set a blend state for shader-db
349- ail: Handle larger block sizes
350- nir: Allow adding descriptions to ALU opcodes
351- nir: Make ALU descriptions machine-readable
352- docs: Include ALU opcode descriptions
353- nir: Add nir_foreach_phi(_safe) macro
354- nir: Use nir_foreach_phi(_safe)
355- dxil: Use nir_foreach_phi_safe
356- ac/llvm: Use nir_foreach_phi
357- nir: Use nir_block_last_phi_instr more
358- nir: Add unified atomics
359- nir: Add pass to lower atomics to unified
360- agx: Use unified atomics
361- pan/bi: Use unified atomics
362- pan/mdg: Fix icky formatting
363- pan/mdg: Use unified atomics
364- gallivm: Use unified atomics
365- ntt: Use unified atomics
366- ac/llvm: Don't handle atomic derefs
367- ac/llvm: Use unified atomics
368- aco,radv: Use unified atomics
369- zink: Use unified atomics
370- ir3: Use unified atomics
371- nir: Handle unified atomics in simple cases
372- nir/lower_task_shader: Handle unified atomics
373- nir/lower_io: Handle unified atomics
374- nir/lower_ssbo: Handle unified atomics
375- nir/opt_uniform_atomics: Handle unified atomics
376- nir/validate: Handle unified atomics
377- radv: Constify radv_device_supports_etc
378- radv: Use common GetPhysicalDeviceFeatures2
379- r600: Use unified atomics
380- lvp: Use common GetPhysicalDeviceFeatures2
381- tu: Use common GetPhysicalDeviceFeatures2
382- agx: Lower legacy atomics sooner
383- pan/mdg: Lower legacy atomics earlier
384- panvk: Lower legacy atomics earlier
385- tu: Lower legacy atomics earlier
386- v3dv: Lower legacy atomics earlier
387- lavapipe: Lower legacy atomics sooner
388- glsl/nir: Produce unified atomics
389- nir/lower_atomics_to_ssbo: Produce unified atomics
390- nir/lower_printf: Produce unified atomic
391- mesa/st: Produce unified atomics
392- vtn: Produce unified atomics
393- intel: Produce unified atomics
394- ac: Produce unified atomic
395- treewide: Stop lowering legacy atomics
396- nir: Drop nir_lower_legacy_atomics
397- ntt: Stop handling legacy atomics
398- nir: Drop legacy atomics in simple cases
399- nir/lower_io: Drop legacy atomics
400- nir/lower_task_shader: Drop legacy atomics
401- nir/validate: Drop legacy atomics
402- nir/opt_load_store_vectorize: Reclaim ATOMIC
403- nir/opt_uniform_atomics: Drop legacy atomics
404- nir: Remove legacy atomics
405- nir: Drop unused name from nir_ssa_dest_init
406- nir: Drop unused argument from nir_ssa_dest_init_for_type
407- nir: Remove stale TODOs
408- nir: Fix incorrect comment
409- util: Add common hex dump utility
410- asahi: Use common hexdump utility
411- pan/decode: Use common hexdump
412- CODEOWNERS: Update panfrost
413- gallium: Drop Asahi-as-a-swrast hack
414- asahi: Drop Asahi-as-a-swrast hack
415- nir: Document extra image source
416- nir: Add image_texel_address intrinsics
417- nir: Add pass to lower image atomics
418- pan/bi: Fix atomic exchange on Valhall
419- pan/bi: Use nir_lower_image_atomics_to_global
420- pan/mdg: Use nir_lower_image_atomics_to_global
421- gallium: Add pipe_image_view::single_layer_view
422- mesa/st: Set pipe_shader_image::single_layer_view
423- dxil: Rely on scoped_barrier
424- treewide: Avoid nir_lower_regs_to_ssa calls
425- nir/opt_barriers: Add a default callback
426- agx: Use common combine_all_barriers callback
427- nir: Drop stale comments
428- zink: Switch to scoped barriers
429- panfrost/ci: Skip Piglit tests known to crash
430- panfrost/ci: Skip hanging test
431- nir: Add intrinsics for multisampling on AGX
432- nir/builder: Add nir_replicate helper
433- treewide: Use nir_replicate
434- pan/lower_framebuffer: Use nir_replicate
435- radv/query: Use nir_trim_vector
436- intel/blorp: Use nir_trim_vector
437- nir/print: Print locations for geometry shader inputs
438- gallium: Add util_image_to_sampler_view helper
439- panfrost: Use util_pipe_image_to_sampler_view
440- nir: Add and use nir_tex_src_ssa
441- treewide: Use nir_tex_src_for_ssa
442- treewide: Use nir_trim_vector more
443- agx: Set support_16bit_alu
444- agx: Constant fold when optimizing int64
445- agx: Use textures_used, not num_textures
446- asahi: Add passes to lower MSAA
447- asahi: Add passes to lower sample intrinsics
448- asahi: Add alpha-to-coverage (and alpha-to-one) lowering
449- agx: Assert that sample shading is lowered
450- asahi: Set uses_sample_shading for background program
451- asahi: Plumb API sample mask into shaders
452- asahi: Plumb ppp_multisamplectl into shaders
453- agx: Model both sources of sample_mask
454- agx: Plumb in nir_intrinsic_load_sample_mask_in
455- agx: Handle sample_mask_agx
456- agx: Enable tag writes when sample mask written
457- agx: Lower discard in NIR
458- asahi,agx: Call lower_discard_zs_emit in the driver
459- agx: Split iter and iterproj instructions
460- agx: Model interpolation for iter instructions
461- agx: Handle centroid and sample interpolation
462- asahi: Lower MSAA
463- asahi: Use nonempty tib for MSAA
464- agx: Emit shader info late
465- asahi: Advertise GL 3.1
466- agx: Stop bit-inexact conversion propagation
467- asahi: Add ASAHI_MESA_DEBUG=nowc flag
468- asahi: Extract transition_resource helper
469- asahi: Decompress writable images
470- asahi: Decompress with format reinterpretation
471- asahi: Remove stale comments
472- pan/mdg: Drop lower_locals_to_regs call
473- lima: Drop lower_locals_to_regs call
474- ir2: Drop lower_locals_to_regs call
475- nir: Add AGX atomic intrinsics
476- agx: Refactor expressions in agx_nir_lower_address
477- agx: Fold addressing math into atomics
478- nir/builder: Add steal_tex_src helper
479- nir/lower_tex: Use nir_steal_tex_src
480- agx: Use common nir_steal_tex_src
481- nir: Add interleave_agx instruction
482- vtn: Handle atomic counter semantics
483- ir3: Drop reference to unsupported intrinsic
484- ttn: Emit scoped barriers when needed
485- ntt: Use scoped barriers
486- ac/llvm: Drop memory_barrier_buffer impl
487- glsl: Assume use_scoped_barrier
488- vtn: Assume use_scoped_barrier
489- nir: Assume use_scoped_barrier
490- ttn: Assume use_scoped_barrier
491- treewide: Remove use_scoped_barrier
492- nir/tests: Use scoped barriers internally
493- nir: Remove handling for non-scoped barriers
494- radeonsi: Scan for scoped barriers
495- nir: Remove non-scoped barriers
496- iris: Don't use STREAMING_LOAD without SSE
497- nir/builder: Add ubitfield_extract_imm helper
498- agx: Implement bitfieldExtract natively
499- asahi: Use bitfield_extract for texture lowering
500- nir: Remove integer and 64-bit modifiers
501- aco: Drop NIR parallel copy handling
502- nir: Add discard_agx intrinsic
503- agx: Update explanation of sample_mask behaviour
504- agx: Fix discards
505- agx: Extract coordinate register size calculation
506- agx: Recollect stored vectors at their use
507- agx: Add loop header? flag
508- agx: Validate predecessor information
509- agx/lower_parallel_copy: Lower 64-bit copies
510- agx: Implement vector live range splitting
511- nir/lower_bool_to_int32: Fix progress reporting
512- nir/lower_locals_to_regs: Add bool bitsize knob
513- gallivm: Use NIR_PASS macros
514- nir: Add pixel_coord, frag_coord_zw intrinsics
515- nir: Add lower_frag_coord_to_pixel_coord pass
516- pan/bi: Use lower_frag_coord_to_pixel_coord
517- agx: Use nir_lower_frag_coord_to_pixel_coord
518- asahi: Use txf for background program
519- nir/lower_blend: Optimize masked out RTs
520- nir: Add nir_builder_create returning nir_builder
521- nir: Use nir_builder_create
522- treewide: Use nir_builder_create more
523- treewide: Remove unused builders
524- nir: Add nir_foreach_function_impl helper
525- nir: Convert to nir_foreach_function_impl
526- nir/validate: Assert txf(_ms) matches dimension
527- nir: Add nir_lower_robust_access pass
528- broadcom/compiler: Use nir_lower_robust_access
529- broadcom/compiler: Remove v3d_nir_lower_robust_access
530- broadcom/compiler: Remove unused #define
531- broadcom/compiler: Use nir_steal_tex_src
532- nir: Add b32fcsel_mdg opcode for Midgard
533- pan/mdg: Optimize b32csel(inot) in NIR
534- pan/mdg: Type CSEL with a NIR pass
535- pan/mdg: Lower isub in common code
536- pan/mdg: Constant fold after algebraic_late
537- pan/mdg: Add is_ssa helper
538- pan/mdg: Fix IR from scheduling conditions
539- pan/mdg: Fix 2-const CSEL at block beginning
540- pan/mdg: Fix temp count calculation
541- pan/mdg: Lower special reads better
542- pan/mdg: Reset predicate.exclude while scheduling
543- pan/mdg: Copy-prop even with swizzle restrictions
544- pan/mdg: Propagate modifiers in the backend
545- nir: Rename load/store_reg -> load/store_register
546- nir: Rename nir_reg_{src,dest} -> nir_register_{src,dest}
547- agx: Add algebraic opt to help with discard lowering
548- agx: Smarten discard_agx -> sample_mask lowering
549- asahi: Strip ? in GenXML
550- asahi: Rename 'Render Target' to 'PBE'
551- asahi: Identify PBE::sRGB flag
552- asahi: Remove ; in perf_debug_ctx
553- agx: Use nir_opt_shrink_stores
554- agx: Use nir_opt_shrink_vectors
555- agx: Assert that barriers are not used in the preamble
556- asahi: Assert we don't transition shared resources
557- asahi: Fix scissor_culls_everything check
558- asahi: Use ralloc harder
559- asahi: Take ownership of compute shader NIR
560- agx: Don't leak ssa_to_reg_out
561- asahi: Use txf_ms for MSAA background programs
562- nir: Fix breaking in nir_foreach_phi(_safe)
563- vulkan: Add vk_index_type_to_bytes helper
564- lavapipe: Use vk_index_type_to_bytes
565- v3dv: Use vk_index_type_to_bytes
566- rogue: Remove commented convert_from_ssa call
567- nir: Add intrinsics for register access
568- nir: Add helpers for walking register uses
569- nir: Add pass for trivializing register access
570- nir: Add legacy data structures & helpers
571- nir: Add new version of lower_regs_to_ssa
572- nir: Produce intrinsics in lower_{phis,ssa_defs}_to_regs
573- nir: Add intrinsics version of locals_to_regs
574- nir: Add lower_vec_to_regs pass
575- gallium: Return SSA values from TTN ALU helpers
576- gallium: Convert TTN to register intrinsics
577- mesa: Simplify ptn_log() a bit
578- mesa: Return SSA defs from PTN ALU helpers
579- mesa: Convert PTN to register intrinsics
580- nir/lower_shader_calls: Convert to register intrinsics
581- nir: Remove nir_lower_regs_to_ssa
582- nir: Remove nir_register-based unit tests
583- gallivm: Switch to reg intrinsics
584- pan/mdg: Ingest new-style registers
585- panfrost: Fix transform feedback on v9
586- panfrost: Lower vertex_id for XFB
587- panfrost: Fix transform feedback on v9 harder
588- nir/trivialize: Handle more RaW hazards
589- nir/lower_blend: Fix 32-bit logicops
590- nir/lower_helper_writes: Consider bindless images
591- nir/passthrough_gs: Fix array size
592
593Amber (3):
594
595- turnip: fix buffer markers using wrong addresses
596- ir3, freedreno: implement GL_ARB_shader_draw_parameters
597- freedreno: implement GL_ARB_indirect_parameters
598
599Andres Calderon Jaramillo (1):
600
601- r600: Report multi-plane formats as unsupported
602
603Andres Gomez (3):
604
605- .mailmap: add an alias for Miguel Casas-Sanchez
606- .mailmap: add an alias for Clayton Craft
607- .mailmap: add an alias for Christian Gmeiner
608
609André Almeida (2):
610
611- radv: debug: Update decode ring umr command
612- radv: Search for guilty contexts at radv_check_status
613
614Antonio Gomes (3):
615
616- rusticl: Move nir compilation to Program
617- rusticl: Drop some Kernel data and have a NirKernelBuild ref instead
618- rusticl: Drop Program::kernel_count
619
620Asahi Lina (33):
621
622- asahi: Identify ZS resolve bits (tentative)
623- asahi: Broadcast Z for all components on texture fetch
624- asahi: Enable 2xMSAA (for deqp)
625- asahi: Add batch state debugging
626- asahi: Fix batch writer tracking for null batches
627- asahi: Clear batch->resolve on agx_batch_init
628- asahi: Assert that freed BOs have no pending writers
629- asahi: Fix batch writer_syncobj cleanup
630- asahi: Implement memory_barrier
631- asahi: Implement create_fence_fd and fence_server_sync
632- asahi: Make framebuffer texture barriers a no-op
633- asahi: Disable tilebuffer write masking optimization
634- asahi: Add missing stdbool include to lib/hexdump.h
635- asahi: Fix check for sprite coord mode in agx_bind_rasterizer_state
636- asahi: Add some more system registers
637- asahi: Partially identify some missing index list stuff
638- asahi: Lazily initialize batch state on first draw
639- asahi: Make bo->writer_syncobj atomic
640- ail: Implement multisampling for compression meta calculation
641- asahi: Use ail_can_compress() in agx_compression_allowed()
642- ail: Add MSAA tests
643- asahi: Use os_dupfd_cloexec() instead of dup()
644- asahi: Fix memory leak in agx_nir_lower_sysvals()
645- asahi: Do not leak meta shader NIR
646- asahi: Revert "Advertise ARB_texture_barrier"
647- asahi: Disable PIPE_CAP_SURFACE_SAMPLE_COUNT
648- asahi: Pass through surface sample count
649- asahi: match_soa: Treat offsets as signed
650- asahi: Identify the separate varying count fields
651- asahi: Gather flat/linear shaded input info from uncompiled FS
652- asahi: Fix type confusion for fragment shader keys
653- asahi: Add flat/linear shaded varyings mask to the VS shader key
654- asahi: Arrange VS varyings in the correct order
655
656Axel Davy (17):
657
658- frontend/nine: Fix missing clamping of pointsize for ff
659- frontend/nine: Apply writemask to pointsize
660- frontend/nine: fix fog key overflow
661- frontend/nine: fix wfog
662- frontend/nine: Fix num_textures count
663- frontend/nine: Drop max_ps_const_f
664- frontend/nine: Implement alpha test backup support
665- frontend/nine: Implement backup support for pointsize
666- frontend/nine: Improve VS_WINDOW_SPACE_POSITION fallback
667- frontend/nine: Print warning incomplete position_t support
668- frontend/nine: Enforce legacy pow behaviour
669- frontend/nine: Get rid of INTERPOLATE_COLOR
670- frontend/nine: initialize force_color_in_centroid
671- docs/gallium: Clarify PIPE_CAP_CLIP_PLANES
672- frontend/nine: Implement backup support for clip planes
673- frontend/nine: Fix shader cap test for POSITIONT
674- frontend/nine: Add debug driconf var force_features_emulation
675
676Bas Nieuwenhuizen (11):
677
678- radv: Reserve space for indirect descriptor set address writes.
679- radv: Reserve space in the ACE pre/postambles.
680- radv: Add stricter space checks.
681- radv: Add asserts in radeon_emit{,_array}.
682- radv: Move all the dirty flags from TES binding to TCS binding.
683- amd/drm-shim: Add vangogh entry.
684- amd/drm-shim: Add raphael&mendocino, polaris12 and gfx1100.
685- amd/drm-shim: Update docs for more devices.
686- aco: fix nir_op_vec8/16 with 16-bit elements.
687- aco: Fix some constant patterns in 16-bit vec4 construction with s_pack.
688- nir: Fix 16-component nir_replicate.
689
690Benjamin Cheng (1):
691
692- radv/video: use app provided hevc scaling list order
693
694Benjamin Lee (1):
695
696- intel: Fix stack overflow in intel_dump_gpu
697
698Billy Laws (1):
699
700- wgl: Fix depth/stencil image support when using zink kopper
701
702Blisto (1):
703
704- driconf: set vk_x11_strict_image_count for Wolfenstein II
705
706Boris Brezillon (4):
707
708- panfrost: Check blend enabled state in pan_allow_forward_pixel_to_kill()
709- renderonly: Fix potential NULL deref in the error path
710- renderonly: Make sure we reset scanout on error in create_kms_dumb_buffer_for_resource()
711- winsys/panfrost: Make sure we reset scanout on error in create_kms_dumb_buffer_for_resource()
712
713Boyuan Zhang (2):
714
715- frontends/va: add default intra idr period
716- radeonsi: disable H264HIGH10 profile
717
718Brian Paul (5):
719
720- llvmpipe: remove lp_setup_alloc_triangle()'s unneeded tri_size param
721- llvmpipe: code clean-ups in llvmpipe_get_query_result_resource()
722- lavapipe: clean-ups in lvp_GetQueryPoolResults()
723- lavapipe: clean-ups in lvp_physical_device_get_format_properties()
724- lavapipe: asst. clean-ups in lvp_execute.c
725
726Caio Oliveira (56):
727
728- spirv/tests: Add test for single-block loop
729- spirv: Output spirv2nir tool result to stdout
730- spirv: Add --optimize flag to spirv2nir tool
731- spirv: Rework structured control flow handling
732- spirv: Do more on spirv2nir --optimize
733- spirv: Use NIR_PASS for spirv2nir --optimize
734- spirv: Extract vtn_handle_debug_text() helper
735- spirv: Fix gl_spirv_validation when OpLine with strings is present
736- spirv: Improve the 'ID is the wrong kind of value' error messages
737- mesa/spirv: Provide more specific error message for glSpecializeShader()
738- spirv: Validate Dim of OpTypeSampledImage and OpSampledImage
739- spirv: Assert sampler_dim is valid when building nir_tex_instr
740- nir/print: Print 0 when mem_modes or resource_intel have no values
741- nir/print: Do not print raw values
742- spirv: Add workaround for OpImageQueryLevels with Multi-sampled images
743- compiler/types: Make key in subroutine_name more effective
744- r600/sfn: Fix warning about overloads hiding virtual functions
745- spirv: Refactor and rename scope translation helper
746- spirv: Use vtn_translate_scope for OpReadClockKHR
747- intel/compiler: Refactor dump_instruction(s)
748- intel/compiler: Remove unused functions and declarations
749- compiler/types: Be consistent when naming array element/size
750- compiler/types: Tidy up the asserts in get_*_instance functions
751- compiler/types: Use hash table pre-hashed functions for type caching
752- microsoft/clc: Add unreachable() to fix 'may be unitialized' warning
753- compiler: Move from nir_scope to mesa_scope
754- compiler: Add mesa_scope_name() function
755- nir/print: Use mesa_scope_name() function to print scopes
756- intel/compiler: Move brw_kernel.c to the intel_clc target
757- compiler/clc: Rename the internal library from libclc to libmesaclc
758- compiler/clc: Move related NIR passes to the common mesa clc
759- compiler: Move spirv into a module of its own
760- nir/print: Print whether the shader is internal or not
761- intel/compiler: Respect NIR_DEBUG_PRINT_INTERNAL flag
762- meson: Explicitly add "check : false" to a couple instances of run_command
763- vulkan: Add NV suffix to VK_NV_cooperative_matrix feature names
764- vulkan: Update XML and headers to 1.3.255
765- nir: Allow nir_gather_ssa_types() to ignore regs instead of assert
766- nir/print: Improve NIR_PRINT=print_consts by using nir_gather_ssa_types()
767- nir/print: Make NIR_DEBUG=print_consts behavior the default
768- nir: Make a const-friendly way to get the offset_src and arrayed_io_src from intrinsic
769- nir: Extract logic to get dest and srcs types from intrinsic
770- nir/print: Use src_type when printing consts in SSA uses
771- nir/print: Print more representations in load_const
772- nir/print: Use symbols % for SSA and @ for intrinsic
773- nir/print: Use \`bN` instead of \`block_N` for identifying basic blocks
774- nir/print: Use BITSIZExELEMENTS for SSA sizes
775- nir/print: Align instructions around \`=`
776- nir/print: Rename print_tabs() to print_indentation() and use it more
777- nir/print: Don't use comment syntax for deref_cast properties
778- nir/print: Use \`//` for comments
779- nir/print: Use 4-space indentation
780- nir/print: Print div/con annotation first
781- nir/print: Reformat the preds/succs block information
782- meson: Ensure that LLVMSPIRVLib is not required for Clover
783- compiler/types: Use right hash for function types
784
785Caleb Cornett (3):
786
787- d3d12: Fix Xbox GDK build errors
788- wgl: Add BITMAPV5HEADER to stw_gdishim.h
789- d3d12: Fix Xbox frame scheduling for interval != 1
790
791Charmaine Lee (7):
792
793- translate: do not clamp element index in generic_run
794- svga: set PIPE_CAP_VERTEX_ATTRIB_ELEMENT_ALIGNED_ONLY for VGPU10 device
795- mesa/main: fix distance attenuation calculation in ffvertex
796- svga: fix shader type after ntt
797- svga: fix compute shader type after ntt
798- svga: lower images before ntt
799- svga: set clear_texture to NULL for vgpu9
800
801Chia-I Wu (24):
802
803- drm-shim: apply file overrides for open
804- amd/drm-shim: add amdgpu drm-shim
805- hasvk: Refactor Android externalFormat handling in CreateYcbcrConversion
806- hasvk/android: Use VkFormat for externalFormat
807- hasvk: Use the common vk_ycbcr_conversion object
808- vulkan: make sure vk_image::format is never UNDEFINED
809- vulkan: make sure vk_image_view::format is never UNDEFINED
810- vulkan: rename vk_image::ahardware_buffer_format
811- vulkan: define inline stubs when android api level < 26
812- vulkan: add vk_ahb_format_to_image_format
813- anv,hasvk,radv: do not fall back to AHARDWAREBUFFER_FORMAT_BLOB
814- vulkan: add vk_image_format_to_ahb_format
815- anv,hasvk: android ahb is not always exportable
816- radv: improve externalMemoryFeatures for android ahb
817- amd/drm-shim: add raven2
818- ac/surface: print tile_swizzle as well
819- radv: do not use a pipe offset for aliased images
820- aco: fix alignment check in emit_load
821- ac, radeonsi: add and use ac_get_ps_iter_mask
822- radv: fix gl_SampleMaskIn for sample shading
823- radv: fix msaa feedback loop without tc-compat cmask
824- radv: fix non-square compressed image copy on gfx9
825- radv: disable calibrated timestamps on raven/raven2
826- ac/surface: limit RADEON_SURF_NO_TEXTURE to color surfaces
827
828Christian Gmeiner (31):
829
830- etnaviv: Add util_blitter_save_so_targets(..) call
831- etnaviv: nir: improve uniform usage for ALU opc
832- etnaviv: correct number of instructions in dump_shader_info(..)
833- etnaviv: move printing of final shader out of etna_link_shaders(..)
834- etnaviv: nir: do not call nir_lower_idiv(..) unconditionally
835- etnaviv: make wider use of DBG_ENABLED(..)
836- ci: add debian-arm32-asan
837- ci/etnaviv: add asan run
838- etnaviv: Add support for conditional rendering
839- etnaviv: add support for performance warnings
840- mesa/arbprog: fix compile errors
841- etnaviv: remove tgsi remains
842- etnaviv: drop usage of tgsi_swizzle_names
843- etnaviv: remove not used tgsi includes
844- ci/etnaviv: update ci expectation
845- ir3/analyze_ubo_ranges: Move IR3_DBG_NOUBOOPT check
846- etnaviv: nir: call nir_remove_dead_variables(..) before linking setup
847- etnaviv: linker: add fallback lookup to VARYING_SLOT_BFC[n]
848- nir: add helper to clear all pass_flags
849- nir/lower_amul: make use nir_shader_clear_pass_flags(..)
850- etnaviv: make use nir_shader_clear_pass_flags(..)
851- etnaviv: nir: do a late nir_opt_cse run
852- docs: mark OES_texture_half_float done on etnaviv
853- etnaviv: support OES_texture_half_float_linear
854- ci/etnaviv: update ci expectation
855- docs: update etnaviv extensions
856- etnaviv: linker: handle scenario where there are FS inputs without matching VS output
857- etnaviv: linker: clean up etna_link_shader(..)
858- nir: rename intrinsic to have a more generic nameing
859- nir: rename has_txs to has_texture_scaling
860- nir/lower_tex: optimize offset lowering for has_texture_scaling
861
862Christopher Snowhill (2):
863
864- Corrects log print to produce hexadecimal base output
865- intel: Sync xe_drm.h
866
867Collabora's Gfx CI Team (4):
868
869- Uprev Piglit to 79a084c56b6dd79f7c3a97b57a72963121ebb1e6
870- Uprev Piglit to 536975d94a40cf76a69fcfa786c2513eccd0c989 https://gitlab.freedesktop.org/mesa/piglit/-/compare/79a084c56b6dd79f7c3a97b57a72963121ebb1e6...536975d94a40cf76a69fcfa786c2513eccd0c989
871- Uprev Piglit to d8c08d123fadb986e9a8a7887b922ff63fcff52e https://gitlab.freedesktop.org/mesa/piglit/-/compare/536975d94a40cf76a69fcfa786c2513eccd0c989...d8c08d123fadb986e9a8a7887b922ff63fcff52e
872- Uprev Piglit to 5036601c43fff63f7be5cd8ad7b319a5c1f6652c
873
874Connor Abbott (42):
875
876- tu: Don't override depth for GMEM
877- tu: Don't pre-shift depth and stencil pitch
878- freedreno/fdl: Don't pre-shift image view pitch
879- freedreno/fdl: Expose view offset
880- tu: Add 3D GMEM load path
881- tu: Use dirty bit for scissor state
882- tu: Precompute maximum views across all subpasses
883- tu: Merge RB_DEPTH_CNTL and RB_STENCIL_CONTROL drawstates
884- tu: Make dynamic viewport and scissor count more accurate
885- freedreno/a6xx: Document per-view viewport in GRAS_SU_CNTL
886- tu: Parse fragment density map attachment info
887- tu: Implement sampling the fragment density map
888- tu/cs: Add support for CS patching
889- tu: Add core FDM patchpoint infrastructure
890- ir3: Record whether a shader writes gl_ViewportIndex
891- tu: Implement FDM viewport patching
892- tu: Implement FDM scaled loads/stores
893- nir, ir3: Add option to use unscaled FragCoord for input attachments
894- tu, ir3: Handle FDM shader builtins
895- tu/autotune: Always prefer GMEM with fragment density maps
896- tu: Don't allow importing/exporting subsampled images with modifiers
897- tu: Expose VK_EXT_fragment_density_map
898- util/bitset: Add some extra functions
899- vk/graphics_state: Remove vk_subpass_info
900- vk/graphics_state: Add feedback_loop_input_only
901- vk/graphics_state: Add VI_BINDINGS_VALID state
902- vk/graphics_state: Fix some assertions when copying state
903- vk/graphics_state: Add helpers for pre-baking state
904- radv: Fix radv_pipeline_is_blend_enabled
905- vk/graphics_state: Track attachment count as state
906- vulkan: Fix renderpass flags with driver-specific renderpass
907- vk/graphics_state: Don't track each vertex input field
908- tu: Don't use A6XX_PC_PRIMITIVE_CNTL_0::TESS_UPPER_LEFT_DOMAIN_ORIGIN
909- freedreno/a6xx: Fix name of A6XX_PC_PRIMITIVE_CNTL_0::TESS_UPPER_LEFT_DOMAIN_ORIGIN
910- tu: Split pipeline struct into different types
911- tu: Rewrite to use common Vulkan dynamic state
912- tu: Use common dirty tracking for PC_PRIMITIVE_CNTL_0
913- freedreno/regs: Document a7xx CP_FIXED_STRIDE_DRAW_TABLE
914- tu: Fix vk2tu_*_stage flag type
915- vk/graphics_state: Fix copying MS locations pipeline state
916- tu: Fix per-view viewport state propagation
917- tu: Fix assert in FDM state emission
918
919Constantine Shablia (3):
920
921- anv: move get_features after get_device_extensions (ugly diff)
922- panvk: use common vkGetPhysicalDeviceFeatures2
923- v3dv: use common vkGetPhysicalDeviceFeatures2
924
925Constantine Shablya (7):
926
927- vulkan: add common implementation of vkGetPhysicalDeviceFeatures2
928- vulkan: introduce supported_features parameter to vk_physical_device_init
929- anv: switch to using the common vkGetPhysicalDeviceFeatures2
930- vulkan: inline vk_get_physical_device_features into vk_common_GetPhysicalDeviceFeatures2
931- vulkan: put interesting code before boring code
932- vulkan: put TEMPLATE_H before TEMPLATE_C
933- vulkan: rename vk_physical_device_features.py to vk_physical_device_features_gen.py
934
935Corentin Noël (18):
936
937- ci: Uprev crosvm and virglrenderer
938- nir: Propagate the type sampler type change to the used variable.
939- build-crosvm: Use the pkg-config crate 0.3.27
940- util: Use the gcc_struct attribute for packed structures in mingw
941- ci: Bump base tag to rebuild piglit
942- ci: uprev virglrenderer and crosvm
943- gallium: Incorporate the device release in dri_destroy_screen_helper
944- gallium: Rename dri_destroy_screen_helper into dri_release_screen
945- pipe-loader: Document the behavior regarding screen creating failures
946- pipe-loader: Do not destroy the winsys on screen creation failure
947- gallium: Only call dri_init_options when the screen is actually created
948- gallium: Use the common destroy function on screen initialization failure
949- gallium: Rename dri_init_screen_helper into dri_init_screen
950- compiler: Allow the explicit_stride of aoa types to be zero
951- nir/split_64bit_vec3_and_vec4: Use the right number of components
952- ci: Uprev virglrenderer
953- ci: Add locked flag to bindgen-cli installation
954- virgl: Do not expose EXT_texture_mirror_clamp when using a GLES host
955
956Daniel Schürmann (60):
957
958- radv/rt: fix total stack size computation
959- radv/rt: properly destroy radv_ray_tracing_lib_pipeline on error
960- radv/rt: rename radv_ray_tracing_module -> radv_ray_tracing_group
961- radv/rt: add shader stage indices to radv_ray_tracing_group
962- radv/rt: replace uses of pGroups with radv_ray_tracing_group
963- radv/rt: remove merged VkRayTracingShaderGroupCreateInfoKHR
964- vulkan/pipeline_cache: replace raw data objects on cache insertion of real objects
965- vulkan/pipeline_cache: use vk_pipeline_cache_insert_object() to replace raw data objects
966- radv: add padding to radv_shader_binary_legacy
967- vulkan/pipeline_cache: expose vk_raw_data_cache_object
968- radv/pipeline_cache: add NIR caching capabilities
969- radv/rt: expose radv_parse_rt_stage()
970- radv/rt: introduce struct radv_ray_tracing_stage
971- radv/rt: retain parsed NIR shaders in radv_ray_tracing_lib_pipeline
972- radv/rt: use precompiled stages to create RT shader
973- radv/rt: refactor compute_rt_stack_size() to use radv_ray_tracing_stage information
974- radv/rt: remove merged VkPipelineShaderStageCreateInfo
975- radv/rt: Fix and improve VkPipelineCreationFeedback
976- radv/rt: change base of radv_ray_tracing_lib_pipeline to radv_compute_pipeline
977- radv/rt: unify radv_ray_tracing_lib_pipeline and radv_ray_tracing_pipeline
978- radv/rt: unify radv_rt_pipeline_create() and radv_rt_pipeline_library_create()
979- radv/rt: refactor radv_rt_pipeline_compile()
980- radv/rt: use vk_multialloc for radv_ray_tracing_pipeline
981- radv/rt: store stack_sizes per stage instead of per group
982- vulkan/pipeline_cache: don't log warnings for internal caches
983- vulkan/pipeline_cache: don't log warnings for client-invisible caches
984- radv: add remaining RT shader args for separate compilation
985- nir,amd: add nir_intrinsic_store_[scalar|vector]_arg_amd to overwrite inputs
986- nir: add nir_intrinsic_resume_shader_address_amd
987- aco: implement nir_intrinsic_load_resume_shader_address_amd
988- aco: implement select_program_rt()
989- radv/rt: adjust shared_size when lowering hit_attribs
990- radv/rt: extend radv_pipeline_group_handle with shader VAs
991- radv/shader_info: add RT stages to radv_get_user_data_0()
992- radv/rt: implement radv_nir_lower_rt_abi to lower RT shaders for separate compilation
993- radv/rt: implement radv_rt_nir_to_asm()
994- radv/rt: change RT main shader to MESA_SHADER_INTERSECTION
995- radv/rt: replace pCreateInfo with VkPipelineCreateFlags in rt_variables
996- radv/rt: pass radv_ray_tracing_pipeline to RT shader creation
997- radv/rt: add and use specialized cache search/insert functions
998- radv/rt: reference library shaders during radv_rt_fill_stage_info()
999- radv/rt: don't write cache hit feedback per stage.
1000- radv/rt: create compile_rt_prolog() function
1001- radv/rt: set up RT shader args for separate compilation
1002- radv/rt: adjust lower_rt_instructions() for shader functions [disables RT]
1003- aco: adjust RT prolog for shader functions [disables RT]
1004- radv/rt: separate shader compilation
1005- radv/debug: dump ray tracing shaders in case of a hang
1006- radv/rt: use priorities to select the next shader
1007- radv/rt: remove now dead code
1008- radv: reference pipeline cache object in radv_pipeline
1009- aco/assembler: align resume shaders with cache lines
1010- aco/assembler: align loops if it reduces the number of cache lines
1011- aco/assembler: change prefetch mode on GFX10.3+ during loops if beneficial
1012- vulkan/pipeline_cache: add 'skip_disk_cache' option
1013- radv/meta: disable disk cache for meta shaders
1014- radv: migrate radv_shader hash to BLAKE3
1015- amd: move end-of-code marker padding to ACO.
1016- amd: Do shader binary alignment for prefetch at memory allocation time.
1017- aco/insert_exec_mask: set Exact mode after p_discard_if when necessary
1018
1019Daniel Stone (10):
1020
1021- wsi/wayland: Support VK_KHR_present_wait
1022- ci/zink: Disable Freedoom trace on ANV
1023- ci: Respect $HTTP_PROXY for ci_run_n_monitor
1024- ci: Elaborate causes for job retries
1025- ci: Don't retry manual or scheduled jobs
1026- ci: Extend a618_vk_full runtime
1027- CI: Re-enable freedreno CI
1028- ci/fdno: Pause a660 testing
1029- Revert "ci/fdno: Pause a660 testing"
1030- egl/wayland: Always initialise fd_display_gpu
1031
1032Danylo Piliaiev (42):
1033
1034- freedreno: Early exit in device matching if id doesn't have chip_id
1035- ir3/a7xx: NOPs may have some no-op bits set
1036- ir3/a7xx: Add new lock/unlock CS instructions
1037- ir3/a7xx: Add new form of stg.a/ldg.a addressing
1038- ir3/a7xx: Add STSC definition
1039- ir3: Document that stc has higher DST upper bound than we defined
1040- ir3/a7xx: Document "alias" instruction
1041- ir3: documents (ss) flag for cat7 instructions
1042- tu: Create drm fd per logical device
1043- tu: Move VMA heap to the logical device
1044- tu: Re-enable bufferDeviceAddressCaptureReplay
1045- freedreno/perfcntrs: Link with libfreedreno_common
1046- freedreno: Decouple GPU gen from gpu_id/chip_id
1047- freedreno,ir3: Don't call fd_dev_64b more than necessary
1048- freedreno/decode: Correctly handle chip_id
1049- tu: Add missing dbg reg stomping to tu_CmdBeginRendering
1050- tu: Fix zombie VMAs array not initialized when first BOs may be freed
1051- freedreno/regs: Print xml validation error if validation fails
1052- freedreno/rnn: Fix addvariant being set effectively once
1053- freedreno/rnn: Make addvariant work for fields in the same reg
1054- freedreno/rnn: Take into account array's variant for regs
1055- freedreno/regs: Change a7xx regs to have open range for generation
1056- freedreno/regs: More CP commands are the same on a7xx as on a6xx
1057- freedreno/regs: Document CP_MEM_TO_SCRATCH_MEM
1058- freedreno/regs: Document a7xx CP_MODIFY_TIMESTAMP
1059- freedreno/regs: Clarify polling on a7xx for CP_WAIT_REG_MEM/CP_COND_WRITE5
1060- freedreno/regs: Add a7xx pseudo-regs to CP_SET_PSEUDO_REG
1061- freedreno/regs: a7xx has a new source type CP_REG_TEST
1062- freedreno/regs: Add 2 new a7xx modes to CP_COND_REG_EXEC
1063- freedreno/regs: Add some new a7xx events
1064- freedreno/regs: Add more a7xx regs and reg fields
1065- freedreno/regs: Fix a7xx SP_FS_PREFETCH definition
1066- freedreno/regs: Generate per-gen reg usage tables
1067- freedreno/regs: Define usage for all a6xx/a7xx regs
1068- tu: Allow reg stomping of compute related registers
1069- tu: Use reg usage tables for stale reg dbg option
1070- freedreno/regs: Properly document a7xx CP_EVENT_WRITE, CP_WAIT_TIMESTAMP
1071- freedreno/regs: Document a7xx CP_BV_BR_COUNT_OPS
1072- freedreno/regs: Rename SP_FS_CTRL_REG0.DIFF_FINE into LODPIXMASK
1073- ir3: Fix FS quad ops returning wrong values from helper invocations
1074- tu,freedreno: Forbid blit event for R8G8_SRGB due to gpu faults
1075- radv: fix unused non-xfb shader outputs not being removed
1076
1077Dave Airlie (134):
1078
1079- radeonsi/ac: move some vcn defines to common
1080- radv/video: add missing gfx family
1081- radv: set a video decode ip block in physical device.
1082- radv/winsys: handle encoder queue padding/submits.
1083- radv/video: add a video addr gfx mode
1084- radv/video: fix dpb surface programming
1085- radv/video: start adding gfx11 vcn decoder
1086- lp_jit: use pipe max for the lp_jit texture levels.
1087- gallivm: consolidate draw/lp texture type.
1088- gallivm: consolidate llvmpipe/draw sampler types.
1089- gallium: consolidate jit image types between draw/llvmpipe
1090- gallivm: reorder some texture/image members.
1091- vulkan/cmd_queue: handle beta extensions.
1092- vulkan: write beta extensions into generator scripts.
1093- draw: align common members in jit context structs.
1094- llvmpipe: refactor fs/cs jit structure members.
1095- gallivm: refactor common resources out of contexts
1096- gallivm/draw/llvmpipe: consolidate the sampler/image dynamic state fns
1097- gallivm: add common code for sample/image tracking.
1098- llvmpipe: move to common sampler/image binding code
1099- draw: move to use common sampler/image binding code
1100- llvmpipe/cs: refactor cs generator args to use an enum
1101- gallivm/draw: refactor vertex header jit type out
1102- llvmpipe: convert a bunch of shader_type ifs to switches.
1103- llvmpipe/cs: start making variant generator less compute specific
1104- llvmpipe/cs: support passing a csctx instead of using implicit one
1105- lavapipe: add lavapipe specific shader stages define.
1106- lvp: explictly skip compute shader stage.
1107- gallivm: fix whitespace in get_deref_offset
1108- gallivm/nir: refactor the local invocation index calc.
1109- lvp: use stage mask
1110- lvp: use stage iterator macros instead of explicit loops
1111- ci: reenable lavapipe
1112- radv/video: add missing space checks for video.
1113- radv/video: use correct h264 levels
1114- radv/video: fix h264/265 dpb usage.
1115- radv/video: add missing offset to the dpb binding.
1116- radv/video: rework stream handle generation.
1117- radv/video: fix some whitespace.
1118- radv/video: add debug flag to enable dpb image array on newer GPUs.
1119- radv/video: fix physical device format property count.
1120- vk/video: add a common function to get block alignments for profiles
1121- radv: align video images internal width/height inside the driver.
1122- anv/video: move format properties to outarray.
1123- radv/meta: fix uninitialised stack memory usage.
1124- gallium: add task/mesh shader query types to stats interface.
1125- gallium: expand pipe_grid_info to handle task/mesh.
1126- gallium: add a new PIPE_SHADER_MESH_TYPES
1127- freedreno: don't report task/mesh.
1128- gallium: add task/mesh shader entrypoints in context
1129- iris: don't return shader params for task/mesh.
1130- crocus: don't report mesh/task limits
1131- radeonsi: don't report shader params for task/mesh
1132- svga: don't report mesh/task shader limits
1133- d3d12: don't report mesh/task limits
1134- gallium/cso: add task/mesh shaders to the cso cache
1135- gallium/nir/tgsi: add various support for task/mesh bits
1136- lavapipe: when in doubt, swizzle the swizzle
1137- lavapipe: fix pipeline sanitizing.
1138- lavapipe: fix indentation whitespace
1139- draw: add mesh shader infrastructure
1140- draw: move draw_vertex_info and draw_prim_info to public header.
1141- draw: add a mesh primitive assembler.
1142- draw: add mesh pipeline middle end.
1143- draw: add support for per primitive aos emission
1144- gallivm: add support for payload access
1145- gallivm/nir: add launch mesh workgroups
1146- gallivm/nir: add a mesh interface and vert/prim count setting.
1147- gallivm/nir: call task shader lowering.
1148- gallivm/nir: add support for mesh shader outputs.
1149- llvmpipe: resize arrays to handle mesh shaders.
1150- llvmpipe: start adding task/mesh support.
1151- llvmpipe: bump dirty tracker to 64-bits.
1152- llvmpipe: add dirty bits for mesh and task shaders.
1153- llvmpipe: add debug bit for mesh shaders
1154- llvmpipe: add query support for task/mesh shaders
1155- llvmpipe: bind task/mesh resources and dirty bits
1156- gallivm/cs: add payload ptr to the cs thread data.
1157- llvmpipe/cs: add task/mesh shader support to compute shader builder.
1158- llvmpipe/cs: add multiple stride indirect to fill_grid_info.
1159- llvmpipe: add mesh shader drawing.
1160- llvmpipe: enable task/mesh shader support.
1161- lavapipe: handle some mesh shader stage differences.
1162- lavapipe: add mesh query support
1163- lavapipe: add support for task/mesh shader stages in various places
1164- lavapipe: add execution backends for mesh shader draw apis
1165- lavapipe: enable task/mesh shaders.
1166- docs: update docs for lavapipe mesh shading
1167- llvmpipe: emit fences for barrier.
1168- lavapipe: don't remove queue family barriers.
1169- gallivm/nir: fix shuffleup tests.
1170- draw: rename jit to vs_jit in lots of places.
1171- draw/tess: drop unused tgsi bits.
1172- gallium/tgsi/draw/softpipe: remodel shader const/buffer bindings.
1173- draw: refactor resources to use arrays instead of explicit structs.
1174- draw: add a max stage define and use it in a few places
1175- draw: repack some members of context.
1176- radv/video: convert video format properties to an outarray
1177- radv/video: convert session memory requirements to outarray.
1178- radv/video: don't supply an 8-bit format for a 10-bit dpb.
1179- radv/video: rework h265 reference frame bindings.
1180- radv/video: fix hevc st rps programming
1181- radv/video: fix hevc scaling lists.
1182- lavapipe: ignore another yuv format.
1183- radv/video: report bad profile operation if h264 profile isn't supported.
1184- radv/video: fix hevc scaling list order.
1185- radv/video: program frame number correctly.
1186- radv/video: program hevc max dec pic buffering correctly
1187- radv/video: restrict the number of IBs on video related queues.
1188- ac/radeonsi: add av1 defaults header file from radeonsi
1189- radv/video: drop incorrect defines for uapi ones.
1190- lavapipe: check sampler pointer before deref
1191- draw/gs: handle extra shader outputs in geometry.
1192- lavapipe: expose subgroups in mesh/task shaders.
1193- gallivm: store thread id in separate values.
1194- gallivm: convert block_id to discrete values.
1195- gallivm: convert grid_size to discrete values.
1196- gallivm: make block_size use discrete values.
1197- clc: llvm 17 requires opaque pointers.
1198- gallium/va: fix superres av1 decoding.
1199- llvmpipe/linear: don't allow linear path for shader output with location frac
1200- llvmpipe/linear: refactor linear samplers into templated code.
1201- llvmpipe/linear/tgsi: calculate num_texs properly for nir.
1202- llvmpipe/linear: add sample routines for swapping r/b channels
1203- llvmpipe/linear: add support for sampling when cbuf order is different.
1204- llvmpipe/linear: add support for rgba color buffers.
1205- ci: update fails for fixed tests due to llvmpipe linear changes.
1206- gallivm: fix atomic global temporary storage.
1207- llvmpipe: fix fragdata/lastfragdata heuristic a bit more.
1208- zink: turn off threaded cpu access if not visible.
1209- llvmpipe: enable f16 paths on aarch64.
1210- radv: don't emit event code on video queues.
1211- spirv: use a pointer sized int type for opencl event_t
1212- radv/video: take db alignment into account when allocating images.
1213
1214David (Ming Qiang) Wu (1):
1215
1216- radeonsi/vcn: add an exception of field case for h264 decoding
1217
1218David Heidelberg (129):
1219
1220- ci/amd: 4/5 runners TPad-C13 runners are online, restore most of the tests
1221- ci/dxvk: uprev to 2.1
1222- ci/amd: update checksums after DXVK 2.1 update
1223- ci: bump kernel to the 6.3, support HDK 888 based on sm8350
1224- ci/freedreno: do not restrict to 2 cpus on a530
1225- ci: drop overriding new a530 firmware due to preemption issues with older kernel
1226- ci/freedreno: a530 behaves stable in 6.3
1227- ci/freedreno: update a530 flakes, fails and skips
1228- ci/freedreno: fix the a530_piglit job and switch to Weston
1229- ci: polish deqp-runner a bit
1230- ci: uninstall libdrm from the GL and VK containers
1231- ci: do not retry on forks to get the upstream kernel and rootfs
1232- ci/mold: bump to 1.11.0
1233- ci: add Adreno 660 on sm8350 chipset (HDK 888)
1234- ci/lava: implement fastboot support
1235- ci/lava: add support for HDK 888 firmware
1236- ci: add a660 firmware into rootfs
1237- pvr: drop unused variable
1238- ci/dzn: add flaking test
1239- ci/skqp: replace license with SPDX and extract the used branch
1240- ci/skqp: update to the Android CTS 12.1_r5 version
1241- mesa/main: drop unused variable
1242- nir/lower_io_to_vector: initialize base
1243- panvk: clear dangling pointers
1244- ci: uprev kernel to 6.3.1 with fixed patch for Adreno SMMU
1245- util/tests: adjust for new gtest
1246- gtest: Update to 1.13.0
1247- ci/skqp: handle all warnings printed with clang >= 14
1248- panvk: drop path from panvk_physical_device struct
1249- venus: drop unused sem_feedback_count from vn_queue_bind_sparse_submit_batch
1250- ci/broadcom: skip timeouting ssbo.layout.3_level_array.std430.mat4 on RPi4
1251- ci/venus: add recent flakes
1252- ci/freedreno: add recent a630 flake
1253- ci/v3d: add flaking opengl 1.1@depthstencil-default_fb-drawpixels-float-and-ushort
1254- ci/amd: re-enable VA-API testing
1255- ci/rules: radeonsi VAAPI rules should include also VA-API targets
1256- ci: update libva to 2.18.1
1257- ci/gtest: improve the runner script
1258- ci/amd: update VA-API expectations
1259- ci/amd: add radeonsi-raven-va-full job to cover all VA-API tests
1260- ci/gtest-runner: fix results reporting
1261- ci/venus: add missing flakes
1262- ci/crosvm: update cmdline options
1263- docs: update crosvm networking options
1264- ci/radv: add another raven flake dEQP-VK.draw.dynamic_rendering.primary_cmd_buff.linear_interpolation
1265- ci/v3dv: add often timeouting ssbo.layout.3_level_array.std140.column_major_mat4
1266- r300: workaround GCC 12+ warning, declare NULL value as unreachable
1267- docs: use meson instead invoking ninja directly
1268- ci/freedreno: disable 3 jobs to match our farm 3 devices down
1269- ci/freedreno: rename piglit job to respresent the real testing it does
1270- ci: move from pkg-config to pkgconf
1271- ci: use meson setup and meson install instead of meson and invoking ninja directly
1272- ci: bump libdrm from 2.4.110 to 2.4.114 present in Debian 12
1273- ci: install stock android-libext4-utils (available in 12, bookworm)
1274- ci: bump gfxreconstruct revision up to compatible version with Debian 12
1275- ci: libwayland from 1.18 to 1.21 and wayland protocols from 1.24 to 1.31
1276- ci: VVL uprev (temporary until new release will be published)
1277- ci: bump from Debian 11 (bullseye) to 12 (bookworm)
1278- ci/apitrace: install win64 apitrace only on x86_64
1279- ci/crosvm: install libelogind0 and sysvinit-core for poweroff functionality
1280- ci: add clang-15 and clang++-15 wrapper script
1281- ci/skqp: skqp can't live with compiler named clang-15, provide symlink
1282- ci: drop gallium-aux test on msan builds, renable freedreno
1283- ci/mingw: disable as it's broken
1284- ci/venus: add fail after CI uprev to the Debian 12
1285- ci/virpipe: add flakes introduced with CI uprev to Debian 12
1286- ci/zink: disable flaking anv traces
1287- ci: enable shellcheck on whole .gitlab-ci
1288- ci: disable bogus GCC warning with -Warray-bounds
1289- ci: do not fail when SHA1 impl. produce stringop-overreads warning
1290- ci/lavapipe: document subgroups.shuffle.compute.subgroupshuffleup_double_constant crash
1291- ci/lavapipe: zink failures
1292- ci/llvmpipe: document intel_shader_atomic_float_minmax@execution@ssbo-atomic*
1293- bin/ci: mention requirements.txt
1294- gitlab: add template for merge requests
1295- ci/zink: add KHR-GL46.limits.max_fragment_interpolation_offset flake
1296- ci/amd: previously missed raven flake
1297- ci/panfrost: add largest possible eglcreatepbuffersurface and then glclear flake
1298- gitlab: prefill MR template with first multiline commit message
1299- ci: bump Alpine to 3.18
1300- ci/ccache: recent ccache changed a output a bit, adapt script
1301- ci: rename x86 and amd64 to x86_64, armhf to arm32, and i386 to x86_32
1302- ci: use bash arrays in Fedora script + shebang change
1303- ci/fedora: re-enable ccache
1304- traces: update sir-f720 trace expectations for zink on anv and freedreno
1305- ci: missed variable inside the big rename and split ARCH and DEBIAN_ARCH
1306- ci: fix KVM module modprobe code
1307- ci: explicitely state BUILDTYPE
1308- ci: rename S3 artifacts according to scheme mesa-$arch-$config-$buildtype
1309- ci: rename MINIO to S3
1310- ci: rename MINIO_HOST variable to S3_HOST
1311- ci: replace MINIO_RESULTS_UPLOAD with S3_RESULTS_UPLOAD
1312- ci: remove BUILD_PATH, always use S3_ARTIFACT_NAME
1313- ci/lava: rename rest local MINIO\_ variables to S3\_
1314- ci/android: remove the artifact file just as we unpack it
1315- ci: valve and freedreno farm is down
1316- ci/windows: move microsoft farm rules
1317- ci/etnaviv: if farm is down, we expect no manual jobs can be triggered
1318- ci/amd: hide vaapi job dependent on Collabora farm when it's down
1319- ci/crocus: depend on state of the Anholt farm
1320- ci: implement farms handling trough files inside .ci-farms
1321- ci/docs: fixup incorrect spacing around console block
1322- ci/panfrost: switch panfrost-g52-piglit-gles2 from X to XWayland
1323- ci/fastboot: use gzipped Image to avoid compressing on the runner
1324- ci/microsoft: uploading artifacts gets stuck currently (retried)
1325- ci/microsoft: rename manual rules according to rest introduced rules
1326- ci: create manual farm rules
1327- ci/traces: guard DXVK and VK behind VK_DRIVER
1328- ci/apitrace: include version with LTO enabled
1329- ci/traces: print version of apps used for replaying traces
1330- ci: when touching farms, never run manual jobs
1331- ci/microsoft: partly revert rename from container-rules to manual-rules
1332- ci/x86: Build ANGLE for testing layering on VK drivers.
1333- ci/amd: switch all possible jobs from X11 to Wayland
1334- ci/freedreno: switch a630_{piglit,skqp} and a618_gl to Weston
1335- ci/freedreno: re-enable a530 as it's now stable with multiple skips
1336- ci/freedreno: document number of a630 devices available
1337- ci/freedreno: add KHR-GL46.buffer_storage flakes
1338- ci/freedreno: add execution@varying-struct-copy-return-vs flake
1339- ci/container: add weston into Vulkan container
1340- ci/container: we need to keep the wine inside
1341- ci/traces: switch from xvfb to Weston XWayland
1342- ci/freedreno: another batch of a530 flakes
1343- ci: add quirk for GitLab assuming changes is always true for scheduled runs
1344- ci/microsoft: when re-enabling Windows Farm, always run the container
1345- ci: disable Material Testers.x86_64_2020.04.08_13.38_frame799.rdc trace
1346- ci/amd: fix timeouting radeonsi-raven-va-full job
1347- ci: add perfetto into mesa git-cache
1348- ci/deqp: really remove the uncompressed results.csv file
1349
1350David Redondo (1):
1351
1352- egl/wayland: fix oob buffer access during buffer_fds clean up
1353
1354David Rosca (7):
1355
1356- radeonsi: Use DIV_ROUND_UP instead of ALIGN_POT
1357- frontends/va: Init view_resources array in vlVaPut/GetImage
1358- frontends/va: Ignore requested size when creating VAEncCodedBufferType
1359- Revert "radeonsi/vcn: add an exception of field case for h264 decoding"
1360- frontends/va: Flush after unmapping VAImageBufferType
1361- frontends/va: Process VAEncSequenceParameterBufferType first in vaRenderPicture
1362- frontends/va: Set default rate control values once when creating encoder
1363
1364Derek Foreman (1):
1365
1366- vulkan/wsi: Allow binding presentation_timing when software rendering
1367
1368Diederik de Haas (1):
1369
1370- treewide: spelling fixes
1371
1372Dmitry Baryshkov (3):
1373
1374- freedreno/registers: updte HDMI registers to include CEC details
1375- freedreno/registers: add bitfield for DSI wide bus enablement
1376- tu: Pass real size of prime buffers to allocator
1377
1378Dmitry Osipenko (4):
1379
1380- iris/bufmgr: Use intel_ioctl() helper for GEM_SET_TILING
1381- intel/dev: Use intel_ioctl() helper for GEM_SET_TILING
1382- anv: Use intel_ioctl() helper for GEM_SET_TILING
1383- hasvk: Use intel_ioctl() helper for GEM_SET_TILING
1384
1385Dmitry Rogozhkin (1):
1386
1387- meson/vaon12: fix driver file name for mingw build
1388
1389Donald Robson (2):
1390
1391- pvr: Move heap initialisation out of pvr_winsys_helper.
1392- pvr: Rename rogue_fw.xml -> rogue_kmd_stream.xml.
1393
1394Dor Askayo (3):
1395
1396- meson: add feature option for use of system Clang headers at runtime
1397- ci: Disable "opencl-external-clang-headers" when "microsoft-clc" is enabled
1398- nouveau: add exported GEM handles to the global list
1399
1400Dr. David Alan Gilbert (4):
1401
1402- rusticl/screen: Wrap get_timestamp
1403- rusticl/device: Stash timestamp availability
1404- rusticl/api: Implement get_{device_and\_}host_timer
1405- rusticl/api: Wire up CL_DEVICE_PROFILING_TIMER_RESOLUTION
1406
1407Dylan Baker (57):
1408
1409- docs: add release notes for 23.0.1
1410- docs: Add sha256 sum for 23.0.1
1411- docs: add release notes for 23.0.2
1412- docs: Add sha256 sum for 23.0.2
1413- docs: add release notes for 23.0.3
1414- docs: Add sha256 sum for 23.0.3
1415- docs: update calendar for 23.0.1
1416- docs: update calendar for 23.0.2
1417- docs: update calendar for 23.0.3
1418- docs: add release notes for 23.0.4
1419- docs: Add sha256 sum for 23.0.4
1420- docs: update calendar for 23.0.4
1421- intel/tools/error2aub: Fix potential out of bounds read
1422- meson: Key whether to build batch decoder on expat
1423- bin/pick: fix issue where None for nomination_type could fail
1424- bin/pick: use lineboxes to make the UI clearer
1425- bin/pick: Add support for adding notes on patches
1426- bin/pick-ui: use asyncio.new_event_loop
1427- meson: Add back execmem option as a deprecated option
1428- VERSION: update to 23.2.0-rc1
1429- docs: Update release calendar for 23.2.0-rc1
1430- .pick_status.json: Update to 6e87b277bde71e30c98ab9dda7bd2f2017b77ed5
1431- .pick_status.json: Update to 27d30fe3c0e71efd90fcfe209d8515b195b0075f
1432- .pick_status.json: Update to 3a8aae9e6aa526367523c58dfe5046909776be74
1433- .pick_status.json: Update to 59087003c4b7a4f5a6bf207f214a4c3443b9759f
1434- ci: mark passing zink and lima tests as expected
1435- docs: truncate new_features.txt
1436- docs: add release notes for 23.2.0
1437- VERSION: update to 23.2.0
1438- docs: Update release calendar for 23.2.0
1439- docs: Add sha256 sum for 23.2.0
1440- Revert incorrect 23.2.0 release
1441- VERSION: update to 23.2.0-rc2
1442- docs: Update release calendar for 23.2.0-rc2
1443- .pick_status.json: Update to e88c0770969f6ae0bfa5bea0f9d99687d257fea1
1444- .pick_status.json: Mark d3f26cbbe1a957b76804da44bbf5e30de2bac941 as denominated
1445- .pick_status.json: Update to c5a6e88c4e816ded6105b74f101528eb004e0581
1446- .pick_status.json: Update to 088c2bbd51a48eb0de1e9fd23c529759585bad59
1447- .pick_status.json: Update to 088c2bbd51a48eb0de1e9fd23c529759585bad59
1448- VERSION: update to 23.2.0-rc3
1449- docs: Extend calendar entries for 23.2 by 2 releases.
1450- docs: update calendar for 23.2.0-rc3
1451- .pick_status.json: Update to 10e75aae1bddee9795b1ff04ffd656b0da79b5b5
1452- .pick_status.json: Updates notes for aebe58458611e0bb585a5bce8e16c1175783f3cc
1453- .pick_status.json: Updates notes for f8cb0d8a44afb9c70f38e359ffe0ad57416e66a4
1454- Revert "Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR""
1455- .pick_status.json: Updates notes for 93b4f200dead198e680991a1e95bf3d3b58f87bd
1456- .pick_status.json: Updates notes for 7e246f7f2bde0c859269c4b81505bd0887045e7b
1457- .pick_status.json: Updates notes for 9865e5dff49395543da4331a943ba5a03ce6a413
1458- .pick_status.json: Update to 1cdc4be14b66108ae0e8069686ac3efe52bef3cb
1459- .pick_status.json: Updates notes for b8ea9724fa5ca38620bc0cdc01b7addd05574954
1460- .pick_status.json: Updates notes for 68027bd38e134f45d1fe8612c0c31e5379ed7435
1461- VERSION: update to 23.2.0-rc4
1462- docs: Update release calendar for 23.2.0-rc4
1463- .pick_status.json: Mark fa6562b239f00f9f72c988459e252bdee072fd73 as denominated
1464- .pick_status.json: Update to f4fecdad724edf8187d22928ed844af7fd84654d
1465- zink/ci: mark unexpcted pass as expected
1466
1467Emma Anholt (124):
1468
1469- zink: Avoid infinite loop finding no var in update_so_info.
1470- ci/crocus: Update checksum for STK.
1471- symbol_table: Store the symbol name in the same allocation as the symbol entry.
1472- symbol_table: Don't maintain the HT as we're destroying the table.
1473- symbol_table: Don't bother resetting the key on popping scope.
1474- symbol_table: Prehash the key on insert, and reuse the entry on shadowing.
1475- tu/perfetto: Refactor code out of the macro, to stage_end.
1476- tu/perfetto: Clean up an extra token paste to just use the arg being passed.
1477- tu/perfetto: Use tu_CmdBeginDebugUtilsLabelEXT as a stage event in perfetto.
1478- tu/perfetto: Drop unused arg to send_descriptors().
1479- tu/perfetto: s/MRTs/attachment_count/ in traces.
1480- anv: Only enable GPL if ANV_GPL=true, or if zink or DXVK are the engine.
1481- anv: Refactor repeated pipeline creation feedback output code.
1482- ci/lvp: Update sanctuary trace hash.
1483- ci/radv: Demote navi21 to manual until recent flakiness resolves.
1484- ci/zink+tu: Drop some intermittently failing a630 traces.
1485- ci/freedreno: Drop portal-2-v2 trace.
1486- ci/radv: Add known flakes for #8817
1487- ci: Crank up the yamllint line length limit.
1488- ci/freedreno: Demote a530 to manual again.
1489- ci: Make a variable for the repeated rootfs directory name.
1490- ci: Add the Vulkan validation layer to amd64 rootfs builds.
1491- ci/zink: Re-enable traces now that !20319 has landed.
1492- ci: Move zink's validation layer setup to deqp-runner.sh.
1493- ci/zink: Enable the validation layer on the TGL GL46 run.
1494- blob: Don't valgrind assert for defined memory if we aren't writing.
1495- util/log: Fix log messages over 1024 characters.
1496- ci: Move some timeout xfails to skips.
1497- ci/deqp: Update to 1.3.5.1 and pull in additional bugfixes from main.
1498- ci/zink: Drop anv/lvp validation exceptions that should be fixed in the CTS.
1499- ci/valve: Add a workaround for finding libdrm on navi21s.
1500- ci/panfrost: Drop tex3d-maxsize on g52.
1501- ci/lima: Skip ppgtt_memory_alignment that flaked a job with the oomkiller.
1502- ci/crocus: Note a recent regression.
1503- ci/zink: Try to update TGL results for new MSAA behavior.
1504- vulkan: Handle alignment failure in the pipeline cache.
1505- vulkan: Actually increment the count of objects in GetPipelineCacheData.
1506- Revert "ci/zink: Try to update TGL results for new MSAA behavior."
1507- ci/zink: Update more xfails for tgl piglit.
1508- ci/zink+anv: Test piglit quick_gl pre-merge, dropping a few KHR-GL46 tests.
1509- ci/radeonsi: Mark glx-make-current as flaky.
1510- ci/radv: Disable flaky heaven d3d9 trace.
1511- ci/turnip: Drop an xfail from the full run for a recent fix.
1512- ci/turnip: Drop the IUB bug fallout flakes.
1513- mesa: Fix debug logging of fp compile compare func.
1514- mesa: Fix precompile of GLSL programs with shadow samplers.
1515- zink: Explain some of the current pathway for shadow sampling.
1516- zink: Fix silly void * type in rewrite_tex_dest.
1517- zink: Don't flag legacy_shadow_mask for RED-only reads in the shader.
1518- ci: Re-enable some piglit tests that should be fast enough post-uprev.
1519- ci/zink+anv: Skip a couple more long tests pre-merge.
1520- compiler: Update reference to name_for_stage func.
1521- nir: Add helpers for lazy var creation.
1522- drm-shim: Avoid assertion fail if someone does close(-1).
1523- glsl: Allow invariant flags on sysvals, such as gl_PointCoord.
1524- nir/lower_texcoord_replace: Flag SYSTEM_VALUE_POINT_COORD read when we load it.
1525- zink: Use PIPE_CAP_FS_POINT_IS_SYSVAL.
1526- mesa: Use find_state_var in lower_builtin.
1527- nir: Use find_state_var in lower_atomics_to_ssbo.
1528- nir,mesa: Add helpers for creating uniform state variables.
1529- mesa: Move ATI_fragment_shader fog code emit to a NIR lowering pass.
1530- mesa/ARB_fp: Drop an extra enum for fog mode.
1531- mesa/ARB_fp: Use the NIR pass for adding fog code instead of ARB instrs.
1532- mesa: Move ARB_vp position invariant option handling to NIR.
1533- mesa: Drop ARB program helper functions that are no longer used.
1534- mesa: Drop unused control flow instructions for ARB programs.
1535- mesa: Drop remaining unused ARB program instructions.
1536- mesa: Move st_prog_to_nir_postprocess out of prog_to_nir.
1537- mesa/ati_fs: Move sampler dim adjustment to a separate NIR pass.
1538- mesa/ati_fs: Move NIR translation to ATI_fs compile time.
1539- mesa/ati_fs: Move prog->SamplersUsed/TexturesUsed setup to EndFragmentShader.
1540- mesa: Use the NIR pass for fixed function fog.
1541- mesa/ffvs: Fix mvp_with_dp4 position transformation.
1542- mesa: Use shared NIR code for ARB_vp and FF VS position transformation.
1543- ci/freedreno: Update minetest hash.
1544- Revert "ci: disable anholt's farm"
1545- crocus: Fix regression from !20153
1546- ci/crocus: Add a missing xfail.
1547- ci/turnip: Update full-run xfails.
1548- tu: Ignore unused shader stages in pipeline library creation.
1549- anv: Drop unused ALL_GRAPHICS_LIB_FLAGS.
1550- ci/crocus: Update trace hash for the neverball regression.
1551- ci/etnaviv: Update some xfails common between the last 3 nightly runs.
1552- v3d: Respect nir_intrinsic_store_output's write_mask.
1553- mesa: Emit full output write in st_pbo_create_vs().
1554- mesa: Port the pbo.use_gs path to NIR and let it get used on NIR drivers.
1555- softpipe: Drop the use_tgsi debug flag.
1556- llvmpipe: Drop the LP_DEBUG=tgsi_ir debug option.
1557- virgl: Drop the VIRGL_DEBUG=use_tgsi debug var.
1558- r600: Drop docs for use_tgsi debug var.
1559- r300: Drop RADEON_DEBUG=use_tgsi.
1560- nouveau: Delete the NV50_PROG_USE_TGSI env var.
1561- svga: Switch to preferring NIR by default.
1562- nine: Drop the nir_vs/nir_ps env vars.
1563- gallium: Drop PIPE_SHADER_CAP_PREFERRED_IR.
1564- mesa/drawtex: Cut out the TGSI semantic translation.
1565- svga: Stop asserting that compute params are queried against TGSI.
1566- mesa: Always query our compute params against IR_NIR.
1567- mesa: Drop TGSI token handling
1568- mesa: Simplify st_get_nir_compiler_options().
1569- mesa: Drop dead TGSI serialization prototypes.
1570- mesa/atifs: Rename the header guard.
1571- mapi: clang-format _glapi_add_dispatch().
1572- mapi: Delete dynamic stub generation.
1573- mesa: Drop the function parameter spec from the remap table.
1574- mapi: Clean up mapi_stub struct.
1575- mesa: Drop the aliases from the remap table.
1576- mapi: Drop the unused_functions table.
1577- mapi: Delete execmem support code.
1578- intel: Count reads_remaining across all blocks.
1579- intel: Allocate the last_grf_write once per scheduler.
1580- intel: Reduce cost of resetting last_grf_write.
1581- ci/zink: Update current xfails on tgl.
1582- ci: Update to vulkan-cts-1.3.5.2 (and pull in some more fixes).
1583- ci: Drop skips for some previously-invalid CTS tests.
1584- ci: Drop some skips of GL CTS ArraysOfArrays tests.
1585- ci/anv: Make anv-manual-rules actually manual on anv-only changes.
1586- ci: Clean up .intel-rules definition.
1587- ci/amd: Report flakes to #amd-ci on OFTC.
1588- ci/anv: Add testing of the GLES CTS using ANGLE on TGL.
1589- ci/radv+radeonsi: Fix the combo rules to include core vulkan changes.
1590- ci/radv: Add testing of the GLES CTS using ANGLE on stoney.
1591- ci/tu: Drop some xfails for !24086
1592- disk_cache: Disable the "List" test for RO disk cache.
1593
1594Eric Engestrom (134):
1595
1596- VERSION: bump to 23.2
1597- docs: reset new_features.txt
1598- v3d: add flake spec@ext_framebuffer_blit@fbo-sys-sub-blit
1599- ci: stop removing -x11 suffix for x11 build of deqp-egl
1600- ci: add -android suffix for android build of deqp-egl
1601- ci: move deqp-egl instead of copying it
1602- ci: start documenting which image tags need to be bumped
1603- ci: bump tags
1604- ci: update shebang to make it more portable
1605- broadcom/ci: deduplicate script definition
1606- v3dv/ci: drop fixed failure from fails.txt
1607- amd: fix buggy usage of unreachable()
1608- compiler: fix buggy usage of unreachable()
1609- pvr: fix buggy usage of unreachable()
1610- vk/util: fix buggy usage of unreachable()
1611- util: enforce unreachable()'s argument being a literal string
1612- egl: inline driver.GetProcAddress() as it's always _glapi_get_proc_address()
1613- ci: rework vulkan validation layer build script
1614- v3d: document that \`V3D_DEBUG=shaderdb` is \*not* for shader-db
1615- v3d: fix tfu_supports_tex_format() param type, and document why
1616- v3d: fix various minor issues in gen_pack_header.py
1617- dzn: fix pointer type mismatch
1618- ci: bump bin/ci/ deps to support python 3.11
1619- ci: drop GENERATE_ENV_SCRIPT
1620- ci: stop marking environment variable list as executable
1621- ci: replace write + cat with tee
1622- ci: disable anholt's farm
1623- ci: only execute capture-devcoredump.sh when it's present
1624- util/bitset: ensure the sets compared have the same size at compile time
1625- docs: add release notes for 23.1.0
1626- docs: update calendar for 23.1.0
1627- ci/b2c: increase timeout to 5 minutes
1628- ci/amd: don't override the b2c timeout in the steamdeck config
1629- ci/zink: add new zink-radv-navi10-valve flakes
1630- mailmap: update @mupuf's name
1631- docs: fix release date of 23.1.0
1632- ci/zink: document new zink-radv-navi10-valve failures
1633- v3dv: fix align() computation for pixel formats with non-POT block sizes
1634- docs: update calendar for 23.1.1
1635- docs: add release notes for 23.1.1
1636- docs/relnotes: add sha256sum for 23.1.1
1637- ci_run_n_monitor: add ability to specify the pipeline to use, instead of auto-detecting it
1638- ci/amd: move AMD-specific LD_PRELOAD to AMD config
1639- ci/amd: only define AMDGPU_GPU_ID for the duration of the call
1640- bin/ci: fix mistakenly hardcoded repo name in get_gitlab_project()
1641- ci/intel: reuse iris_file_list instead of copying its definition
1642- meson: simplify another "any of" check
1643- wsi/display: drop unused parameters from local functions
1644- ci: split clang-format list of folders for easier maintenance
1645- ci: show diff when clang-format check fails
1646- panfrost: fix formatting of a couple of files that were missed
1647- panfrost: rename \*.cc files to \*.cpp
1648- ci/zink+radv: fix flakes definition
1649- ci/zink+radv: mark all spec@arb_copy_image@arb_copy_image-targets* as flaky after getting a bunch more of them
1650- ci/zink+radv: document recent regressions
1651- ci: color the diff for clang-format
1652- meson: enable the clang-format target
1653- ci: use meson to run clang-format
1654- docs: document clang-format and how to use it
1655- docs/calendar: add 23.2 branchpoint and release candidates
1656- ci/zink+radv: mark flakes as such
1657- ci/radv: fix flakes definition
1658- ci/crocus: fix flakes definition
1659- ci/zink+anv: fix flakes definition
1660- ci/b2c: also detect non-soft GPU hangs with AMDGPU
1661- amd/ci: run gl(es) cts & piglit on radeonsi on vangogh
1662- ci/radv: update expectations
1663- ci/zink+radv: update expectations
1664- docs/relnotes/23.1.1: clear "new features"
1665- docs: add release notes for 23.1.2
1666- docs/relnotes: add sha256sum for 23.1.2
1667- docs: update calendar for 23.1.2
1668- egl: return correct error for EGL_KHR_image_pixmap
1669- clang-format: add explanation for anyone reading .clang-format-include
1670- radv,aco: tweaks to get clang-format to print nicer code
1671- radv: reformat according to its .clang-format
1672- aco: reformat according to its .clang-format
1673- ci: enforce formatting for RADV & ACO
1674- radv: fix formatting
1675- Revert "ci: remove clang-format testing"
1676- asahi: drop unnecessary DRM_FORMAT_MOD_{LINEAR,INVALID} fallbacks
1677- ci: mark the valve farm as down
1678- docs/ci: fix command to disable/re-enable farms
1679- docs: add release notes for 23.1.3
1680- docs/relnotes: add sha256sum for 23.1.3
1681- docs: update calendar for 23.1.3
1682- docs/coding-style: add example vim config for clang-format
1683- docs/coding-style: add example emacs config for clang-format
1684- docs/coding-style: add pre-commit hook fallback for clang-format
1685- v3dv: replace boolean and uint with bool and size_t
1686- amd/ci: add another dEQP-VK.multiview.renderpass2.multisample.* flake
1687- amd/ci: add another dEQP-VK.dynamic_rendering.primary_cmd_buff.basic.* flake
1688- ci: split valve farm in two
1689- util/disk_cache: fix ~/.cache/ permissions
1690- panfrost/ci: drop invalid skips that are already marked as known flakes
1691- intel/ci: fix skips definitions
1692- etnaviv/ci: fix skips definition
1693- zink/ci: fix skips & flakes for zink+radv on vangogh & navi10
1694- docs/codingstyle: fix clang-format command
1695- vc4/ci: fix skipping of gles3 piglit tests
1696- v3dv/ci: fix skipping of vk tests
1697- v3dv/ci: skip more tests that are timing out
1698- virgl/ci: fix skips definition
1699- clang-format: add egl foreach macro
1700- clang-format: add wayland foreach macros
1701- egl: change a couple of clang-format settings
1702- egl: add a few trailing commas
1703- egl: protect the formatting in a couple of places
1704- egl: prevent clang-format from reordering some headers
1705- egl: re-format using clang-format
1706- clang-format: enforce formatting of egl
1707- add initial .git-blame-ignore-revs
1708- ci/zink+radv: document another flake
1709- ci/zink+radv: fix flake definition
1710- ci: document workflow rules
1711- ci: set priority:low tag only on non-Marge pipelines
1712- ci: fix .valve-farm-manual-rules
1713- ci: split farm rules out of test-source-dep.yml
1714- etnaviv/ci: drop duplicate line in etnaviv files list
1715- broadcom/ci: add the renderonly folder to things that can affect v3d & vc4
1716- meson: clarify description of \`opengl` option
1717- meson: clarify what "off-screen rendering" means
1718- ci: avoid running hardware jobs if there are already trivial issues
1719- ci: avoid running hardware jobs if lint fails - now on LAVA too!
1720- ci: avoid running hardware jobs if lint fails - now on Windows too!
1721- bin/ci_run_n_monitor: get git sha from pipeline if specified, instead of requiring --rev to match
1722- panfrost: upcast uint8/uint16 before shifting them beyond their range
1723- vc4: drop duplicate .lower_ldexp
1724- zink: fix format in zink_make_{image,texture}_handle_resident()
1725- v3dv: fix VK_PIPELINE_ROBUSTNESS_{BUFFER,IMAGE}_BEHAVIOR_DEVICE_DEFAULT_EXT copy/paste typo
1726- v3dv: fix copy/pasted type of \`sample`
1727- v3dv: fix shader stage name in error message
1728- v3d/qpu: fix type of function argument
1729- ci/farm-rules: fix missing valve-infra jobs in scheduled pipelines
1730
1731Erico Nunes (6):
1732
1733- Revert "ci: disable lima farm, currently out-of-space, needs to be fixed"
1734- lima: fix stringop-overflow warning
1735- lima/ci: temporarily disable deqp-egl tests due to timeouts
1736- ci: temporarily disable lima farm
1737- ci: restore lima farm
1738- lima: fix plbu block stride calculation
1739
1740Erik Faye-Lund (144):
1741
1742- nir: remove nir_state_slot::swizzle
1743- glsl: remove ir_state_slot::swizzle
1744- docs: renderpass -> render pass
1745- docs: statechanges -> state changes
1746- docs: backfacing -> back-facing
1747- docs: codepath -> code-path
1748- docs: did't -> didn't
1749- docs: cma -> CMA
1750- docs: Anv -> ANV
1751- docs: perfetto -> Perfetto
1752- docs: use correct tick for "doesn't"
1753- docs: vlan -> VLAN
1754- docs: toplevel -> top-level
1755- docs: correct spelling of "source"
1756- docs: correct spelling of "tagged"
1757- docs: correct spelling of "frame"
1758- docs: sort extensions
1759- docs: add custom html theme
1760- docs: add bootstrap extension
1761- docs: translate admonitions into bootstrap alerts
1762- docs: remove support for old sphinx-versions
1763- docs: use custom html theme
1764- nir: clean up white-space in deref-printing
1765- mesa/main: clean up white-space in ffvertex_prog.c
1766- mesa/main: drop disasm-code from ffvertex_prog.c
1767- mesa/main: allow passing nir-shaders to st_program_string_notify
1768- mesa/main: make ffvertex output nir
1769- nir: fix constant-folding of 64-bit fpow
1770- docs: fix edit-links
1771- mesa/main: drop use_legacy_math_rules
1772- llvmpipe: fixup refactor copypasta
1773- docs: fixup About Mesa3D.org link
1774- docs/tgsi: fix up indent
1775- docs/tgsi: fix bad latex
1776- docs/tgsi: fixup bad latex
1777- docs/tgsi: wrap overly long lines
1778- docs/tgsi: use math-notations for conditionals
1779- docs/tgsi: do not use math-block for non-latex
1780- docs/tgsi: fixup latex for TEX and TEX2
1781- docs/tgsi: use \\ll and \\gg for left and right shift
1782- aux/draw: check for lines when setting clipping-mode
1783- zink: fix bad indent
1784- zink: clean up tcs_vertices_out_word handling
1785- zink: do not open-code memcpy
1786- aco: use c++17
1787- meson: remove needless c++17-overrides
1788- mesa/main: clean up white-space in ff_fragment_shader.cpp
1789- mesa/st: refactor st_translate_fragment_program
1790- mesa/st: allow using nir for ff-fragment shaders
1791- compiler/nir: move find_state_var to common code
1792- mesa/main: ff-fragshader to nir
1793- mesa/main: compile ff_fragment_shader as c-code
1794- mesa/program_cache: remove unused shader-cache functions
1795- panfrost: expose PIPE_CAP_POLYGON_OFFSET_CLAMP
1796- util: mark externally-unused functions as static
1797- nir: use more nir_fmul_imm
1798- nir: use more nir_fadd_imm
1799- nir: fsub -> fadd_imm
1800- nir: use more nir_ffma_imm variants
1801- nir: add nir_fsub_imm
1802- nir: use nir_fsub_imm
1803- radeonsi,radv: use nir_format_linear_to_srgb
1804- docs: explicitly mark extensions as obsolete
1805- docs: mark MESA_multithread_makecurrent as obsolete
1806- docs: mark MESA_shader_debug as obsolete
1807- docs: mark MESA_swap_frame_usage as obsolete
1808- docs: mark MESA_texture_array as obsolete
1809- docs: move obsolete extensions to their own list
1810- zink: update profiles schema
1811- zink: keep gl46_optimal extensions/features sorted
1812- zink: compute correct location for line-smooth gs
1813- zink: do not lower line-smooth for non-lines
1814- docs: increase contrast in dark-theme
1815- zink: update profiles schema
1816- d3d12, dozen: make sure we pass float to fge
1817- nir: use nir_i{ne,eq}_imm helpers
1818- nir: generate nir_{cmp}_imm variants
1819- nir: use generated immediate comparison helpers
1820- nir: add nir_[fui]gt_imm and nir_[fui]le_imm helpers
1821- nir: use new immediate comparison helpers
1822- mesa/st: use nir_imm_vec4
1823- nir: use more imm-helpers
1824- nir: isub -> iadd_imm
1825- nir: use nir_imm_{true,false}
1826- nir: add and use nir_fdiv_imm
1827- nir: add and use nir_imod_imm
1828- nir: add missed nir_cmp_imm-helpers
1829- docs: upgrade bootstrap to 5.3.0
1830- cso: use enum for render-conditions
1831- draw: use enum for tgsi-semantic
1832- draw: use uint32_t instead of uint
1833- draw: use enum for primitive-type
1834- draw: track vertices and vertex_ptr as byte-pointers
1835- draw: use stdint.h types
1836- cso: use unsigned instead of uint
1837- draw: match type of pipe_draw_start_count_bias::count
1838- draw: use unsigned instead of uint
1839- aux/indices: use stdint.h types
1840- draw/i915: move hwfmt array to i915 specific struct
1841- microsoft/compiler: use nir_imm_zero
1842- mesa/st: use nir_ineg
1843- vulkan: avoid needless constant-folding
1844- broadcom/compiler: use imm-helpers
1845- v3dv: use imm-helpers
1846- pan: use imm-helpers
1847- freedreno: use imm-helpers
1848- r600/sfn: use imm-helpers
1849- d3d12: use imm-helpers
1850- radeonsi: use imm-helpers
1851- vc4: use imm-helpers
1852- intel: use imm-helpers
1853- anv: use imm-helpers
1854- hasvk: use imm-helpers
1855- mesa/st: use imm-helpers
1856- amd: use imm-helpers
1857- etnaviv: use imm-helpers
1858- gallium: use imm-helpers
1859- nir: use imm-helpers
1860- math: fix indentation in m_matrix.[ch]
1861- math: remove unused defines
1862- math: drop MAT_[ST][XYZ] defines
1863- aux/trace: use stdint.h types
1864- pipebuffer: use unsigned instead of uint
1865- gallivm: use unsigned instead of uint
1866- aux/pp: use unsigned instead of uint
1867- aux/util: use enum for render-condition
1868- aux/util: match type of pipe_draw_start_count_bias::start/count
1869- aux/util: use enum for primitive-type
1870- aux/util: use unsigned instead of uint
1871- aux/util: use stdint.h types
1872- aux/util: uint -> unsigned
1873- tgsi: use enum instead of defines
1874- tgsi: use stdint.h types
1875- tgsi: use enum for tgsi-file type
1876- tgsi: use enum for property-name
1877- tgsi: use enum for shader-type
1878- tgsi: use enum for interpolate-mode
1879- tgsi: uint -> uint32_t
1880- tgsi: uint -> unsigned
1881- nir: constify intrin
1882- nir: use nir_intrinsic_get_var
1883- radv: do not rely on constant-folding
1884- nir: do not needlessly rely on optimizations
1885- panfrost: delete stale editorconfig file
1886
1887Faith Ekstrand (16):
1888
1889- nouveau/nir: image_samples/size don't have coordinates
1890- vulkan: Document vk_physical_device::supported_features
1891- nir/opt_if: Use block_ends_in_jump
1892- nir: Add a reg_intrinsics flag to nir_convert_from_ssa
1893- nir/from_ssa: Make additional assumptions in coalescing
1894- nir/from_ssa: Support register intrinsics
1895- freedreno/ci: Update pixmark piano checksums
1896- nv50/ir: Support vector movs
1897- nir: Properly handle divergence for load_reg
1898- nir/trivialize: Maintain divergence information
1899- nir/trivialize: Trivialize cross-block loads
1900- Revert "mesa, compiler: Move gl_texture_index to glsl_types.h"
1901- Revert "compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h"
1902- nir: Handle nir_op_mov properly in opt_shrink_vectors
1903- nir: Don't handle nir_op_mov in get_undef_mask in opt_undef
1904- nir: Fix metadata in nir_lower_is_helper_invocation
1905
1906Felix DeGrood (19):
1907
1908- anv: disable reset query pools using blorp opt on MTL
1909- anv: Add END_OF_PIPE_SYNC reporting to INTEL_DEBUG=pc
1910- anv: Add flush reasons to raytracing flushes
1911- anv: Add flush reason to NEEDS_END_OF_PIPE_SYNC
1912- anv: split INTEL_MEASURE multi events
1913- intel: INTEL_MEASURE cpu mode
1914- anv: Enable INTEL_MEASURE=cpu
1915- iris: Enable INTEL_MEASURE=cpu
1916- docs: add INTEL_MEASURE=cpu
1917- intel/debug: Control start/stop frame of batch debug
1918- anv: Enable INTEL_DEBUG_BATCH_FRAME_START/_STOP
1919- iris: Enable INTEL_DEBUG_BATCH_FRAME_START/_STOP
1920- docs: Add INTEL_DEBUG_BATCH_FRAME_START/_STOP
1921- anv: fix INTEL_MEASURE on MTL
1922- anv: re-enable RT data in INTEL_MEASURE
1923- intel: refactor INTEL_MEASURE pointer dumping
1924- intel: batch consecutive dispatches into implicit renderpasses
1925- intel: Secondary CB print primary CB's renderpass
1926- anv: override vendorID for Cyberpunk 2077
1927
1928Feng Jiang (3):
1929
1930- frontends/va: Fix memory leak of decrypt_key
1931- radeonsi/vcn: Remove unnecessary type conversion
1932- virgl/video: Fix out-of-bounds access in fill_mpeg4_picture_desc()
1933
1934Filip Gawin (5):
1935
1936- nine: add fallback for D3DFMT_D16 in d3d9_to_pipe_format_checked
1937- glx: fix build with APPLEGL
1938- ac/nir: fix slots in clamping legacy colors
1939- anv: allow intel_clflush_range only on igpu
1940- crocus: Avoid fast-clear with incompatible view
1941
1942Francisco Jerez (3):
1943
1944- anv: Fix calculation of guardband clipping region.
1945- intel/gfx12.5: Enable L3 partial write merging for compressible surfaces among other cases.
1946- anv: Swap ordering of memory types on non-LLC platforms to work around application bugs.
1947
1948Frank Binns (7):
1949
1950- pvr: add missing explicit check against VK_SUCCESS
1951- pvr: use util_dynarray_begin() in more places
1952- pvr: replace transfer EOT binary shaders with run-time compiled shaders
1953- pvr: fix typo in pvr_rt_get_region_headers_stride_size()
1954- pvr: fix array overflow in pvr_device_tile_buffer_ensure_cap()
1955- pvr: fix invalid read reported by valgrind
1956- pvr: skip setting up SPM consts buffer when no const shared regs are used
1957
1958Friedrich Vock (41):
1959
1960- radv/rmv: Fix creating RT pipelines
1961- radv/rmv: Fix import memory
1962- radv/rt: Plug some memory leaks during shader creation
1963- radv: Don't leak the RT prolog binary
1964- radv: Hash pipeline libraries separately
1965- radv: Always call si_emit_cache_flush before writing timestamps
1966- radv: Add driconf to always drain waves before writing timestamps
1967- nir: Rematerialize derefs in use blocks before repairing SSA
1968- nir: Remove unnecessary assert in nir_before_src
1969- radv: Disable capture/replay handles
1970- aco: Lower divergent bool phis iteratively
1971- radv: Always flush before writing acceleration structure properties
1972- aco: Reset scratch_rsrc on blocks without predecessors
1973- aco: Fix live_var_analysis assert
1974- aco: Fix assert in insert_exec_mask
1975- radv: Add driconf to force wave64 for RT
1976- radv: Add RADV_DEBUG=nort
1977- radv: Enable ray tracing pipelines by default
1978- radv: Add the BOs of all shaders in a RT pipeline
1979- radv: Add radv_shader_free_list
1980- radv: Move shader arena allocation to a separate function
1981- radv: Add option to allocate shaders in replayable VA range
1982- radv: Add utilities to serialize and deserialize shader allocation info
1983- radv: Add radv_shader_reupload
1984- radv: Break up radv_shader_nir_to_asm
1985- radv: Split up implementation of radv_shader_create
1986- radv: Add support for creating capture/replay shaders
1987- radv: Add radv_rt_capture_replay_handle
1988- radv/rt: Only compare the non-recursive capture/replay handle
1989- radv/rt: Associate capture/replay handles with stages
1990- radv/rt: Replay shader allocations according to capture/replay handle
1991- radv/rt: Rework radv_GetRayTracingCaptureReplayShaderGroupHandlesKHR
1992- radv: Re-enable RT pipeline capture/replay handles
1993- meson: Prefix Vulkan "Ray Tracing" summary with "Intel"
1994- radv/ci: Skip ray tracing tests on vangogh
1995- Revert "radv/rt: Enable RT pipelines on GFX10_3+ excluding vangogh"
1996- Revert "Revert "radv: Enable ray tracing pipelines by default""
1997- radv/rt: Enable exact on software intersection functions
1998- radv/rt: Miss rays that hit the triangle's v edge
1999- radv: Handle VK_SUBOPTIMAL_KHR in trace layers
2000- nir/load_store_vectorize: Handle intrinsics with constant base
2001
2002Ganesh Belgur Ramachandra (5):
2003
2004- gallium/pipe: Add get_resources() to pipe_video_buffer
2005- gallium/vl: implementation for get_resources()
2006- nouveau: implementation for get_resources()
2007- d3d12: implementation for get_resources()
2008- frontends/va: use resources instead of views
2009
2010Georg Lehmann (51):
2011
2012- nir: lower ballot_bit_count_exclusive/inclusive to mbcnt_amd
2013- radv: use lower_ballot_bit_count_to_mbcnt_amd
2014- aco: Assert that operands have the same byte offset when reassigning split vectors
2015- aco: also reassign p_extract_vector post ra
2016- aco/vn: compare all valu modifers
2017- aco/optimizer: don't use pass_flags for mad idx
2018- aco/optimizer: copy pass flags for newly created valu instructions
2019- aco/assembler: support VOP3P with DPP
2020- aco/builder: support VOP3(P) with dpp
2021- aco: add assembler tests for VOP3(P) with DPP
2022- aco/ra: convert VOPC_DPP instructions without vcc to VOP3
2023- aco: use VOP3+DPP
2024- aco: don't apply dpp if the alu instr uses the operand twice
2025- aco: emit_wqm on MIMG dst, not operands
2026- aco: introduce helper to swap valu operands with modifiers
2027- aco/gfx11: use fmamk/fmaak with opsel
2028- aco: add withoutVOP3 helper
2029- aco/ra: use smaller operand stride for VOP3P with DPP
2030- aco/ra: use fmac with DPP/opsel on GFX11
2031- aco: add helper function for can_use_input_modifiers
2032- aco: use get_operand_size for dpp opt
2033- aco: use can_use_input_modifiers helper
2034- aco/optimizer: allow DPP to use VOP3 on GFX11
2035- util: fix stack dynarray used by multiple tus
2036- nir/opt_if: use nir_alu_instr_is_comparison directly
2037- aco: cleanup v_cmp_class usage
2038- aco: p_start_linear_vgpr doesn't always need exec mask
2039- aco/ir: return true in hasRegClass for Operand(reg, rc)
2040- aco/statistics: improve v_fma_mix dual issuing detection
2041- aco: use v_add_f{16,32} with clamp for fsat
2042- aco: use v_fma_mix for f2f32 and f2f16 on gfx11 if wave64
2043- aco: make validation work without SSA temps
2044- aco: move cfg validation to its own function
2045- aco: don't validate p_constaddr_addlo/p_resumeaddr_addlo operands
2046- aco: validate ir for prologs and after lower_to_hw_instr
2047- aco/opcodes: move v_cndmask_b32 back to the VOP2 list
2048- aco: remove v_cvt_pkrtz_f16_f32_e64 when it's actually VOP2
2049- aco/opcodes: delete wrong comment copy pasted from NIR
2050- aco: use uses helpers for pk_fma opt
2051- aco: combine scalar mul+pk_add to pk_fma
2052- aco/gfx10+: use v_cndmask with literal for reduction identity
2053- nir: add single bit test opcodes
2054- nir/lower_bit_size: mask bitz/bitnz src1 like shifts
2055- aco: implement nir_op_bitz/bitnz
2056- nir/opt_algebraic: combine bitz/bitnz
2057- radv: set has_bit_test for aco
2058- aco/optimizer: delete s_bitcmp optimization
2059- aco/gfx11: fix get_gfx11_true16_mask with v_cmp_class_f16
2060- aco: fix non constant 16bit bitnz/bitz
2061- aco: fix u2f16 with 32bit input
2062- nir/opt_algebraic: remove broken fddx/fddy patterns
2063
2064George Ouzounoudis (1):
2065
2066- radv: small fix for VkDescriptorSetVariableDescriptorCountLayoutSupport
2067
2068Gert Wollny (98):
2069
2070- r600/sfn: Lower tess levels to vectors in TCS
2071- r600/sfn: make sure f2u32 is lowered late and correctly for 64 bit floats
2072- r600: remove TGSI code path
2073- r600/sfn: Add a type for address registers
2074- r600/sfn: don't track address registers in live ranges
2075- r600/sfn: Handle MOVA_INT in sfn assembler
2076- r600/sfn/tests: Cleanup and move some code around
2077- r600/sfn: Add address and index registers creation to ValueFactory
2078- r600/sfn: Rework query for indirect access in alu instr and opt
2079- r600/sfn: don't allow more than one AR per instruction
2080- r600: Allow both index registers for all CF types
2081- r600/sfn: Prepare uniforms and local arrays for better address handling
2082- r600/sfn: handle AR and IDX register in shader from string
2083- r600/sfn: add method to update indirect address to all instrution types
2084- r600/sfn: Add function to insert op in block
2085- r600/sfn: Update resource based instruction index mode check
2086- r600/sfn: Be able to track expected AR uses
2087- r600/sfn: AR and IDX don't need the write flag, but haev a parent
2088- r600/sfn: Add a RW get function of IF predicate access
2089- r600/sfn: Add interface to count AR uses in ALU op
2090- r600/sfn: Add pass to split addess and index register loads
2091- r600/sfn: Add function to check whether a group loads a index register
2092- r600/sfn: take address loads into account when scheduling
2093- r600/sfn: Add more tests and update to use address splits
2094- r600/sfn: Don't copy-propagate indirect access into LDS instr
2095- r600/sfn: Add test for multiple index load
2096- r600/sfn: set CF force flag always when starting a new block
2097- r600/sfn: Start a new ALU CF on index use, not on index emission
2098- r600/sfn: Add chip family to shader class
2099- r600/sfn: Add handling for R600 indirect access alias handling
2100- r600/sfn: Override Array access handling in backend assembler
2101- r600/sfn: Fix copy-prop with array access
2102- r600/sfn: scheduled instructions are always ready
2103- r600/sfn: Add more tests and update to use address splits
2104- r600/sfn: print failing block when scheduling fails
2105- r600/sfn: Can't use an indirect array access as source to AR load
2106- r600/sfn: factor out index loading for non-alu instructions
2107- r600/sfn: prepare for emitting AR loads
2108- r600/sfn: Tie in address load splitting
2109- r600+sfn: Assign ps_conservative_z and switch to NIR defines
2110- r600/sfn: assign window_space_position in shader state
2111- r600/sfn: Ass support for image_samples
2112- r600/sfn: fix cube to array lowering for LOD
2113- r600/sfn: Fix iterator use
2114- r600/sfn: move kill instruction test to alu instruction
2115- r600/sfn: add dependencies for kill instructions
2116- r600/sfn: move kill handling fully to scheduling
2117- r600/sfn: use correct FS output location if not all outputs are used
2118- virgl: Make query result resource as dirty before requesting result
2119- virgl: Add support for ARB_pipeline_statistics
2120- virgl/ci: uprev virglrenderer
2121- docs/features: fix empty line error
2122- virgl: Fix IB upload when a start >0 is given
2123- virgl: Submit drawid_offset if is not zero
2124- virgl: signal support for group vote and draw parameters
2125- virgl: enable ARB_gl_spirv
2126- features: Update virgl features
2127- ci: uprev virglrenderer to include changes needed for GL 4.6 support
2128- r600/sfn: assert that group barrier is not emitted in divergent code flow
2129- r600/sfn: Switch to scoped barriers
2130- util/driconf: pin minImageCount to three for "Path of Exile"
2131- r600/sfn: add read instruction for unused but required LDS op results
2132- r600/sfn: Don't rewrite TESS_EVAL inner tess level outputs
2133- r600/sfn: Add experimental support for load/store_global
2134- r600/sfn: Handle store_global when lowering 64 bit ops to vec2
2135- r600/sfn: Handle load_global in 64 to vec2 lowering
2136- rusticl: compile r600 driver
2137- r600: fix handling of use_sb flag
2138- r600/sfn: move kill handling to fully scheduling
2139- 600/sfn: Trigger use of ACK for some barriers
2140- r600: Disable SB if we use the ariable length DOT
2141- r600/sfn: Silence warnings "overloaded-virtual"
2142- r600/sfn: Downgrade some error message to warning
2143- r600: Split tex CF only if written component is read
2144- r600/sfn: Don't deref unused group slots
2145- r600/sfn: on R600/R700 write a dummy pixel output if there is a gap
2146- r600/sfn: Clean up FS member initialization
2147- virgl: don't allow vertex input arrays on GLES hosts
2148- r600/sfn: Fix typo
2149- r600/sfn: drop use of nir source mods
2150- r600/sfn: allow source mods for per source with multi-slot ops
2151- r600/sfn: add source and dest mod info to opcode table
2152- r600/sfn: Implement source mod optimization in backend
2153- r600/sfn: Implement fsat for 64 bit ops
2154- r600/sfn: Add source mod propagation also to fp64 ops
2155- r600/sfn: Don't clear clear group flag on vec4 that comes from TEX or FETCH
2156- virgl/ci: Drop duplicate runs
2157- ci: Upref virglrenderer
2158- r600/sfn: Fix filling FS output gaps
2159- r600: Pre-EG - Set wrap texture modes to repeat when seemless cube is used
2160- r600/sfn: Be more conservative with AR re-use
2161- r600/sfn: Shorten array elements live range
2162- r600/sfn: remove debug output leftovers
2163- r600/sfn: Fix use of multiple IDX with kcache
2164- r600/sfn: Don't try to propagate to vec4 with more than one use
2165- r600/sfn: Only switch to other CF if no AR uses are pending
2166- r600/sfn: AR loads should depend on all previous non ALU instructions
2167- r600/sfn: Take source uses into account when switching channels
2168
2169Giancarlo Devich (5):
2170
2171- d3d12: Update and require DirectX-Headers 1.610.0
2172- d3d12: Query device for D3D12_FEATURE_D3D12_OPTIONS14
2173- d3d12: Update PSO creation to use CreatePipelineState
2174- d3d12: Add ID3D12GraphicsCommandList8 to the context
2175- d3d12: Support separate front/back stencils
2176
2177Gregory Mitrano (2):
2178
2179- ac/sqtt: Add RGP Definitions for Mesh Shaders
2180- radv/sqtt: Add RGP Markers for Mesh Shaders
2181
2182Guilherme Gallo (29):
2183
2184- ci/lava: Move job definition stuff to another file
2185- ci/lava: Extract LAVA proxy and LAVAJob abstractions
2186- ci/lava: Use python-fire in job submitter
2187- ci/lava: Update LogFollower for better section handling and history
2188- ci/lava: Add a simple Structural Logger into submitter
2189- bin/ci: Add StructuredLogger to improve log handling
2190- ci/lava: Integrate StructuralLogger with AutoSaveDict
2191- ci/lava: Force use of UTC timezones
2192- ci/lava: Refactor LAVAJobSubmitter and add tests
2193- ci/lava: Use f-strings in job definition
2194- ci/lava: Skip regression test if LAVA log file is not present
2195- ci/freedreno: Fix a618-traces-performance rules
2196- ci/lava: Bypass arg list to print_log function
2197- ci/lava: Fix last section in job submitter
2198- ci: Use absolute paths in init-stage2.sh
2199- ci/lava: Add SSH support in rootfs
2200- ci/lava: Add SSH job definition
2201- ci/lava: Add bridge function for job definition
2202- ci/lava: Distinguish test suites in DUT vs Docker
2203- ci/lava: Only check for the first section marker
2204- ci/lava: Hide JWT block during YAML dump
2205- ci/lava: Tweak http-download timeout in SSH based jobs
2206- ci/lava: Raise the post test metadata gathering retry count
2207- ci/lava: Force LAVA panfrost jobs to use UART
2208- dzn: Skip a few deqp tests which are prone to timeout
2209- ci/lava: Renable SSH sessions for panfrost jobs
2210- ci/lava: Increase Docker action failure_retry counter
2211- ci/lava: Add LAVA SSH client container
2212- ci/lava: Use an alpine image for SSH client container
2213
2214Hans-Kristian Arntzen (6):
2215
2216- wsi/x11: Fix present ID signal when IDLE comes before COMPLETE.
2217- wsi/wayland: Simplify wait logic for present wait.
2218- wsi/wayland: Do not assert that all present IDs have been waited on.
2219- radv/amdgpu: Report 48-bit VAs in bo logs.
2220- Fix DGC bug where indirect count > maxSequencesCount.
2221- wsi/x11: Fix potential deadlock in present ID.
2222
2223Harri Nieminen (11):
2224
2225- amd: fix typos
2226- amd: fix typos in code
2227- r300: fix typos
2228- radeonsi: fix typos
2229- r600: fix typos
2230- r600/sb: fix typo
2231- r600/sfn: fix typos
2232- r600/sfn: fix typos in code
2233- broadcom: fix typos
2234- egl: fix typos
2235- glx: fix typos
2236
2237Helen Koike (3):
2238
2239- ci: move .microsoft-farm-container-rules to test-source-dep.yml
2240- ci: remove unused tag DEBIAN_X86_64_TEST_IMAGE_PATH
2241- ci/android: remove strace output from cuttlefish-runner.sh
2242
2243Hyunjun Ko (27):
2244
2245- intel/genxml: fix num bits of some MOCS fields
2246- intel/genxml: conform some fields to each other gen.
2247- intel/genxml: align some fields on gen9/11/12/125 with media driver.
2248- intel/genxml: add a command VD_CONTROL_STATE to gen12/125
2249- util/vl: initialize data/end pointers.
2250- vulkan/video: add to parse h265 slice.
2251- vulkan/video: add h265 reference structures and relevant util functions.
2252- anv/image: Add a surface usage bit for video decoding
2253- anv/image: allocate mv storage buffers for h265
2254- anv/image: allow VK_IMAGE_CREATE_ALIAS_BIT with a private binding.
2255- anv: add initial video decode support for h265
2256- anv: support P010 format for video 10-bit hevc decoding
2257- anv/image: get width/height for each plane of a surface for video decoding.
2258- anv: support HEVC 10-bit decoding
2259- anv: enable the video h265 decode extension.
2260- anv/ci: Add tests for video formats to the failing tests.
2261- anv/video: move video requirements to outarray.
2262- vulkan/video: adds more conditions for setting loop_filter_across_slices_enable in h265 slice parsing.
2263- vulkan/video: move parsing longterm rps in h265 slice parsing.
2264- util/rbsp: keep track of removed bits for the emulation prevention three bytes.
2265- vulkan/video: consider removed bits when calculating the size of comsumed data.
2266- anv/video: fix to set U/V offset correctly.
2267- vulkan/video: keep delta weight and offsets of predicted weight tables in h265 slice parsing
2268- intel/genxml: changes the type for predicted weight to unsigned.
2269- anv: fix to set predicted weight tables correctly.
2270- anv/video: fix to support HEVC 10bit on some of 9th gens.
2271- anv: Adds a workaround for HEVC decoding on some old platforms.
2272
2273Iago Toral Quiroga (34):
2274
2275- broadcom/compiler: fix v3d_qpu_uses_sfu
2276- broadcom/compiler: add a v3d_qpu_instr_is_legacy_sfu helper
2277- broadcom/compiler: fix incorrect check for SFU op
2278- broadcom/compiler: fix incorrect ALU checks
2279- broadcom/compiler: return early for SFU op latency calculation
2280- broadcom/compiler: try harder to merge thread switch earlier
2281- broadcom/compiler: don't allocate undef to rf0
2282- broadcom/compiler: move buffer loads to lower register pressure
2283- broadcom/compiler: increase peephole limit to 24 instructions
2284- broadcom/compiler: use unified atomics
2285- broadcom/compiler: skip jumps in non-uniform if/then when block cost is small
2286- v3dv: simplify too small Z viewport scale workaround
2287- v3dv: store slice dimensions in pixels
2288- v3dv: allow TFU transfers for mip levels other than 0
2289- v3dv: align compressed image regions to block size
2290- broadcom/compiler: flag use of control barriers
2291- broadcom/compiler: use scoped barriers
2292- v3d: only warn about bining sync for indirect draw once
2293- v3dv: remove bogus viewport code
2294- v3dv: simplify scissor setup for negative viewport height
2295- broadcom/cle: fix up viewport offset packet definition for V3D 4.1+
2296- v3d,v3dv: fix viewport offset for negative viewport center
2297- broadcom/compiler: only use last thread switch flag to detect final section
2298- nir/lower_tex: copy missing fields when creating copy of tex instruction
2299- nir/lower_tex: handle lower_tg4_offsets with lower_tg4_broadcom_swizzle
2300- broadcom/compiler: handle textureGatherOffsets
2301- v3dv: expose shaderImageGatherExtended
2302- v3dv: fix slice size for miplevels >= 2
2303- v3dv: don't use the TLB path if we might be copying partial tiles
2304- v3dv: use div_round_up for division by block size
2305- v3dv: fix blit path for compressed image to buffer copies
2306- broadcom: use nir info to keep track of implicit sample shading
2307- broadcom/compiler: free defin and defout arrays if they already exist
2308- broadcom/compiler: don't leak v3d_compile when finding a new best strategy
2309
2310Ian Romanick (20):
2311
2312- intel/fs: Don't munge source order of 3-src instructions in opt_algebraic
2313- intel/fs: Fix handling of W, UW, and HF constants in combine_constants
2314- intel/fs: Allow HF const in MAD on Gfx12.5 if all sources are HF
2315- nir/algebraic: Fixup iadd3 related patterns
2316- intel/fs: Add constant propagation for ADD3
2317- intel/eu/validate: Use a single macro define half_float_conversion cases
2318- intel/eu/validate: Add Gfx12.5
2319- intel/eu/validate: Add some validation of ADD3
2320- nir: Add optimization pass to reassociate some bfi instructions
2321- intel/fs: Use nir_opt_reassociate_bfi
2322- nir/algebraic: Lower some bfi with two constant sources
2323- intel/fs: Emit better code for bfi(..., 0)
2324- nir/algebraic: Optimize some u2f of bfi
2325- nir/algebraic: Simplify various trivial bfi
2326- intel/stub_gpu: Don't run program again after using GDB
2327- intel/fs: Constant propagate into SHADER_OPCODE_SHUFFLE
2328- intel/fs: Add missing newline
2329- intel/fs: Always do opt_algebraic after opt_copy_propagation makes progress
2330- intel/fs: Constant fold SHL
2331- intel/fs: Constant fold OR and AND
2332
2333Ikshwaku Chauhan (2):
2334
2335- radeonsi/gfx11: updated si_is_format_supported
2336- radeonsi/gfx11: updated vertex format changes
2337
2338Illia Abernikhin (3):
2339
2340- docs: add iris features to docs/features.txt
2341- docs: add crocus features to docs/features.txt
2342- docs: remove i965 features from docs/features.txt
2343
2344Illia Polishchuk (6):
2345
2346- glx: add fail check for current context in another thread
2347- drirc: add allow_sampled_tex_copy option
2348- nir: switch to a normal sampler for ARB program with not depth textures
2349- zink, drirc: Add Borderlands 2 workaround to fix spir-v 1.6 translated discard
2350- zink: move find_sampler_var from zink to nir core
2351- nir: fix invalid sampler search by texture id
2352
2353Italo Nicola (22):
2354
2355- egl: disable partial redraw when gallium hud is active
2356- egl: fix comments alignment
2357- freedreno: implement clear_render_target and clear_depth_stencil
2358- v3d: implement clear_render_target and clear_depth_stencil
2359- vc4: implement clear_render_target and clear_depth_stencil
2360- d3d12: fix clear_depth_stencil texture deref
2361- gallium: implement u_default_clear_texture
2362- gallium: use u_default_clear_texture where applicable
2363- gallium: rename util_clear_texture to util_clear_texture_sw
2364- mesa/st: use fallback path when pipe->clear_texture is not available
2365- rusticl: use fallback path when pipe->clear_texture is not available
2366- clover: use fallback path when pipe->clear_texture is not available
2367- gallium: cleanup util_blitter_clear_render_target
2368- gallium: remove PIPE_CAP_CLEAR_TEXTURE
2369- lima/ci: add some ARB_clear_texture piglit tests to lima-fails.txt
2370- d3d12/ci: add piglit arb_clear_texture-integer fail to CI expectations
2371- nir: add options to lower y_vu, yv_yu, yx_xvxu and xy_vxux
2372- gallium/st: add support for PIPE_FORMAT_NV21 and PIPE_FORMAT_G8_B8R8_420
2373- mesa/main: add PIPE_FORMAT_YVYU and PIPE_FORMAT_R8B8_R8G8
2374- mesa/main: add PIPE_FORMAT_VYUY and PIPE_FORMAT_B8R8_G8R8
2375- freedreno/ci: add KHR-GL46.buffer_storage.map_persistent_flush to flakes
2376- egl: reenable partial redraw with a warning when using gallium hud
2377
2378Iván Briano (24):
2379
2380- anv: Remove dead parameters from copy_fast_clear_dwords
2381- anv: make anv_can_fast_clear_color_view more generally available
2382- anv: factor out code for ccs_op and mcs_op
2383- anv: expose some helper functions
2384- anv: support fast color clears on vkCmdClearAttachments
2385- anv: put EXT_mesh_shader behind an environment variable
2386- anv: enable graphics pipeline libraries by default
2387- hasvk: avoid assert due to unsupported format
2388- anv: enable the GPL feature based on whether the extension is supported
2389- vulkan/wsi: fix double free on error condition
2390- anv: do not explode on 32 bit builds
2391- anv: update conformanceVersion
2392- anv: flush data cache before emitting availability
2393- anv: ensure CFE_STATE is emitted for ray tracing pipelines
2394- iris: ensure mesh is disabled on context init
2395- anv: ensure mesh is disabled on context init
2396- anv: implement Wa_14019750404
2397- blorp: fix hangs with mesh enabled
2398- anv: use a simpler MUE layout for fast linked libraries
2399- anv: track what kind of pipeline a fragment shader may be used with
2400- intel/fs: read viewport and layer from the FS payload
2401- intel/fs: handle URB setup for fast linked mesh pipelines
2402- anv: enable VK_EXT_mesh_shader where supported
2403- intel/fs: use ffsll so we don't explode on 32 bits
2404
2405James Glanville (7):
2406
2407- pvr: Improve support for image clears
2408- pvr: Fix vtxin special var allocation count
2409- pvr: Fix image to buffer copies
2410- pvr: Fix incorrect PBE packmode for S8_UINT
2411- pvr: Adjust clear's region clip words
2412- pvr: Fix seg fault on unused ds attachment
2413- pvr: Fix deferred_control_stream_flags
2414
2415James Knight (1):
2416
2417- meson: ensure i915 Gallium driver includes Intel sources
2418
2419Janne Grunau (4):
2420
2421- asahi: Fix typo in debug/error message helper macro
2422- asahi: Free low VA BOs correctly
2423- st/mesa: Set gl_config.floatMode based on color_format
2424- asahi,agx: Fix stack buffer overflow in agx_link_varyings_vs_fs
2425
2426Jarred Davies (3):
2427
2428- pvr: Don't ralloc build context from compiler
2429- pvr: Use vk_device's enabled features struct
2430- pvr: Reduce free list initial size when multiple devices are created
2431
2432Jesse Natalie (133):
2433
2434- d3d12: Remove #if D3D12_SDK_VERSION blocks now that 610 is required
2435- microsoft/clc: Remove #if D3D12_SDK_VERSION blocks now that 610 is required
2436- dzn: Remove #if D3D12_SDK_VERSION blocks now that 610 is required
2437- util: Delete Offset() macro from u_memory.h
2438- d3d12: Respect buffer offsets for sampler views
2439- d3d12: Support blit texture uploads
2440- spirv2dxil: Lower quad ops in non-fragment/compute stages
2441- dzn: Remove driconf for quad ops in vertex stages
2442- dzn: Add physical device arg to format lookup
2443- dzn: Support dynamic depth bias via command list instead of PSO
2444- dzn: Use narrow quadrilateral lines when supported
2445- dzn: Support aniso-with-point-mip samplers
2446- dzn: Align-up heap sizes when allocating memory
2447- ci/windows: Update Agility SDK to 1.610.2
2448- dzn: Use unrestricted copy alignments when available
2449- dzn: Handle opaque BC1
2450- dzn: Handle depth bias for point fill mode emulation
2451- dzn: Re-design custom buffer descriptors
2452- ci/dzn: Run almost the full CTS
2453- dzn: Expose core VK1.1 extensions that aren't optional
2454- dzn: Expose core VK1.2 extensions that aren't optional
2455- meson: Don't use masm with VS backend
2456- spirv2dxil: Mark SSBO reads for bindless as CAN_REORDER
2457- microsoft/compiler: Unroll loops in opt passes
2458- dzn: Fix UBO descriptors pointing to the end of the buffer
2459- dzn: Hook up subgroup size to compute shader compilation
2460- dzn: Ensure sample-rate shading is factored into nir hash
2461- dzn: Use the nir hash as an input to the dxil hash
2462- dzn: Ensure subgroup size control is factored into pipeline hash
2463- dzn: Ensure bindless is factored into pipeline/nir hash
2464- dzn: Augment blit resolve to support min/max/sample-zero modes
2465- dzn: Support all available depth/stencil resolve modes
2466- dzn: Support separate depth/stencil resolves via blits
2467- dzn: Delete queue-level event waits
2468- ci/windows: Pick up WARP 1.0.6 NuGet with lots of dzn fixes
2469- dzn: Use A4B4G4R4 instead of B4G4R4A4 when available
2470- spirv2dxil: Lower large temps to scratch
2471- microsoft/compiler: Avoid integer divides by 0
2472- dzn: Run nir_opt_remove_phis before nir_lower_returns
2473- dzn/ci: Remove 'exclude' for graphicsfuzz cases
2474- microsoft/compiler: Allocate space for I/O and viewID dependency tables before instruction processing
2475- microsoft/compiler: Do basic I/O analysis for dependency tables
2476- spirv2dxil: Support int64 and doubles
2477- d3d12: Convert from D3D shader model to Mesa shader model earlier
2478- dzn: Enable 64-bit ints and floats
2479- microsoft/compiler: Take inputs from callers before providing nir options
2480- microsoft/compiler: Enable packed dot product intrinsics for SM6.4+
2481- dzn: Enable KHR_shader_integer_dot_product
2482- nir_lower_system_values: Add ASSERTED to assert-only variable
2483- nir: Load/store atomic op indices when lowering image intrinsics
2484- microsoft/compiler: Remove alu type info from store_dest()
2485- microsoft/compiler: Duplicate some SSA values to simplify SSA typing
2486- microsoft/compiler: Back-propagate type requirement information
2487- dxil: Use unified atomics
2488- vulkan: Win32 sync import/export support
2489- dzn: Don't zero an output struct that can have pNext
2490- dzn: Finish implementing KHR_synchronization2
2491- dzn: Dedicated resource cleanup
2492- dzn: External Win32 memory extension
2493- dzn: External Fd memory extension
2494- dzn: Hook up win32 semaphore import/export
2495- dzn: Hook up fd semaphore import/export
2496- docs: Update list of extensions implemented by dzn
2497- glsl: Delete dead intrinsics
2498- microsoft/compiler: Better and simpler bitcast reduction
2499- dzn: Add a no-bindless debug flag
2500- dzn: Fix inverted assert
2501- dzn: Partial revert of 8887852d
2502- dzn: Don't expose copy queues
2503- dzn: Fix src/dest confusion for some non-bindless descriptor copies
2504- wsi/win32: Handle acquiring an image while one is already acquired
2505- nir_lower_returns: Optimize phis before beginning the pass
2506- nir: Add undef phi srcs when adding successors
2507- radv: Don't run opt_remove_phis before lower_returns
2508- dxil: Don't run opt_remove_phis before lower_returns
2509- ci/windows: Update WARP to 1.0.7
2510- microsoft/compiler: Enable emitting type info for textures with <4 comps
2511- microsoft/compiler: Add a pass to assign image formats based on number of components
2512- spirv2dxil: Assign formats to image vars before lowering to bindless
2513- microsoft/compiler: Use image formats to determine texture types
2514- ci/windows: Update WARP to 1.0.7.1
2515- nir_opt_algebraic: Don't shrink 64-bit bitwise ops if pack_split is going to be lowered
2516- nir: Add preserve_mediump as a shader compiler option
2517- microsoft/compiler: Always set support_16bit_alu
2518- microsoft/compiler: Handle mediump
2519- spirv2dxil: Enable mediump
2520- dzn: Don't lower away mediump
2521- microsoft/compiler: Fix the int->uint pass for arrayed I/O
2522- microsoft/compiler: Fix usage of type var in semantic asserts
2523- microsoft/compiler: Viewport/layer as input to GS/HS needs to set feature bit
2524- d3d12: Support PIPE_CAP_VS_LAYER_VIEWPORT
2525- dzn: Don't create D3D objects for secondary command buffers
2526- dzn: Fix incremental binding of VBs
2527- d3d12: Fully initialize UAV desc for null SSBOs
2528- dzn: Don't support VK R4G4B4A4_UNORM_PACK16 unless we have B4G4R4A4
2529- nir_opt_constant_folding: Fix nir_deref_path leak
2530- nir: Add is_null_constant to nir_constant
2531- vtn: Set is_null_constant
2532- nir_split_struct_vars: Support more modes and constant initializers
2533- nir: Allow atomics as non-complex uses for var-splitting passes
2534- nir_lower_ubo_vec4: Delete an invalid assert
2535- nir_lower_mem_access_bit_sizes: Add a bit_size input to the callback
2536- nir_lower_mem_access_bit_sizes: Move options into a struct
2537- nir_lower_mem_access_bit_sizes: Support unaligned stores via a pair of atomics
2538- nir: Fix constant expression for unpack_64_4x16
2539- nir: Optimize unpacking 16 bit values that were originally packed
2540- microsoft/clc: Try harder to optimize memcpys before lowering them
2541- microsoft/clc: Fix progress reporting for some lowering
2542- microsoft/compiler: Support vec/struct const vals
2543- microsoft/compiler: Improvements to constant -> shader_temp pass used for CL
2544- microsoft/compiler: Add some more lowering passes for derefs
2545- microsoft/compiler: Emit const accesses as load_deref
2546- microsoft/compiler: Use mem_constant instead of shader_temp for consts
2547- microsoft/compiler: Un-lower shared/scratch to derefs
2548- spirv2dxil: Don't lower shared/temp to explicit I/O
2549- microsoft/compiler: Support load_ubo_vec4
2550- dxil: Don't generate load_ubo_dxil directly
2551- dxil: Delete load_ubo_dxil intrinsic
2552- microsoft/compiler: Don't lower bit sizes for movs
2553- microsoft/compiler: Don't over-align raw buffer load/store intrinsics
2554- dxil: Remove custom SSBO lowering
2555- nir_lower_returns: Mark assert-only var as ASSERTED
2556- dzn: Ignore export access parameters
2557- dzn: Inline D3D12 device creation in physical device creation
2558- dzn: Use common GetPhysicalDeviceFeatures2
2559- dzn: Remove dynamic check for block-compressed support
2560- dzn: Fix multisample counts in device limits
2561- dzn: Align placed footprints used when copying linear <-> optimal for BC formats
2562- dzn: VK_EXT_external_memory_host
2563- radv: Fix label name
2564- microsoft/clc: Fix usage of nir_builder_at
2565- ci/windows: Re-enable Windows builds
2566- d3d12: Fix indexing of local_reference_state
2567
2568Jiadong Zhu (1):
2569
2570- ac: enable SHADOW_GLOBAL_CONFIG for preemptible ib
2571
2572Jianxun Zhang (8):
2573
2574- iris: Fix memory alignment when importing dmabuf (GFX12.5)
2575- include/uapi: Update drm_fourcc.h from drm kernel
2576- intel/isl: Add MTL RC CCS modifier into modifier info
2577- iris: Support I915_FORMAT_MOD_4_TILED_MTL_RC_CCS modifier
2578- intel/isl: Add MTL RC CCS CC modifier into modifier info
2579- iris: Support I915_FORMAT_MOD_4_TILED_MTL_RC_CCS_CC modifier
2580- intel/isl: Add MTL MC CCS modifier into modifier info
2581- iris: Support MTL modifier MC_CCS
2582
2583Jonathan Gray (1):
2584
2585- intel/dev: remove dg2 0x5698 pci id
2586
2587Jordan Justen (26):
2588
2589- intel/compiler/gfx12.5+: Lower 64-bit cluster_broadcast with 32-bit ops
2590- mesa/main: Exit early when trying to create an unsupported context API
2591- iris: Flush untyped dataport cache when HDC flush is requested on compute
2592- iris: Flush untyped dataport cache DC flush is requested on compute
2593- anv: Clear untyped dataport cache flush bit if not in GPGPU mode
2594- anv: Flush untyped dataport cache when HDC flush is requested on compute
2595- anv: Flush untyped dataport cache DC flush is requested on compute
2596- intel/devinfo: Add has_set_pat_uapi
2597- intel/devinfo: Define PAT indices used on MTL
2598- iris/bufmgr: Add iris_pat_index_for_bo_flags()
2599- iris/bufmgr: Skip bucket allocation if not using writeback cache PAT index
2600- iris: Map aux-map with WC on MTL+ (has_set_pat_uapi)
2601- drm-uapi/i915_drm.h: Update from drm-next (2023-06-09)
2602- iris: Use set PAT extension on BO creation for MTL
2603- anv: Use set PAT extension on BO creation for MTL
2604- intel/devinfo/i915: Set has_set_pat_uapi for MTL+
2605- intel/genxml: Add COMPCS0 aux-table registers
2606- anv: Program compute aux-map base address during queue init
2607- anv: Use correct CCS0 aux-map register offset in pipe flush
2608- isl: Add ISL_SURF_USAGE_STREAM_OUT_BIT
2609- anv,iris,hasvk: Use ISL_SURF_USAGE_STREAM_OUT_BIT for setting stream-out MOCS
2610- isl/dev: Add uncached MOCS value
2611- isl: Set MOCS to uncached for MTL stream-out
2612- intel/dev: Use RPL-U name on RPL-U devices
2613- intel/dev: Add more RPL PCI IDs
2614- intel/dev: Update device string for MTL PCI ID 0x7d55
2615
2616Joshua Ashton (8):
2617
2618- radv: Do not enable robustness for push constants with robustBufferAccess2
2619- radv: Refactor buffer robustness to an enum
2620- radv: Rename radv_nir_compiler_options::robust_buffer_access to robust_buffer_access_llvm
2621- radv: Split and move buffer robustness to shader key
2622- radv: Rename radv_required_subgroup_info to radv_shader_stage_key
2623- radv: Implement VK_EXT_pipeline_robustness
2624- radv: Advertise VK_EXT_pipeline_robustness
2625- radv: Remove unused pipeline param from radv_generate_pipeline_key
2626
2627Joshua Watt (2):
2628
2629- drm-shim: Set file type in readdir()
2630- drm-shim: Use anonymous file for file override
2631
2632José Fonseca (2):
2633
2634- wgl: Fix unintentional assignment on assert.
2635- wgl: Remove needless \`if (1) { ... }`.
2636
2637José Roberto de Souza (29):
2638
2639- iris: Move i915 batch destroy logic to iris_i915_destroy_batch()
2640- iris: Initialize batch screen in iris_init_batch()
2641- iris: Move iris_batch i915 specific variables to union
2642- iris: Create, destroy and replace Xe engines
2643- iris: Implement batch_check_for_reset() in Xe kmd backend
2644- iris: Set priority to Xe engines
2645- iris: Fix close of exported bos
2646- intel/common: Add gt_id to intel_engine_class
2647- iris: Implement batch_submit() in Xe kmd backend
2648- iris: Fix vm bind of imported bos from other GPUs
2649- build: Add Iris and ANV to ARM's auto-generated drivers
2650- anv: Take into consideration physical device max heap size to set maxStorageBufferRange
2651- iris: Allow shared scanout buffer to be placed in smem as well
2652- iris: Add a function to return allocated bo mmap mode
2653- iris: Add function to return mmap mode for userptr bos
2654- iris: Add function to return mmap mode for aux map
2655- anv: Set memory types supported by Xe KMD
2656- anv: Fix ANV_BO_ALLOC_NO_LOCAL_MEM flag
2657- anv: Nuke ANV_BO_ALLOC_WRITE_COMBINE
2658- iris: Fix return of xe_batch_submit() when exec fails
2659- iris: Replace aperture_bytes by sram size in iris_resource_create_for_image() for PIPE_USAGE_STAGING
2660- intel: Fix support of kernel versions without DRM_I915_QUERY_ENGINE_INFO
2661- iris: Attach a dma-buf to bo flink
2662- iris: Implement external object implicit syncronization for Xe kmd
2663- anv: Fix compute maximum number of threads value
2664- anv: Fix some mismatches of canonical and regular addresses around anv_bo_vma_alloc_or_close()
2665- anv: Drop unnecessary intel_canonical_address() call around anv_address_physical()
2666- anv: Drop unnecessary intel_canonical_address() calls around bo->offset
2667- iris: Convert slab address to canonical
2668
2669Juan A. Suarez Romero (34):
2670
2671- v3d: set depth compare function correctly
2672- v3d: use primitive type to get stream output offset
2673- v3d/ci: annotate failure
2674- v3dv/ci: rename waiver test
2675- v3d: add support for ARB_texture_cube_map_array
2676- v3d/ci: enable glsl 1.30 and 1.40 piglit tests
2677- v3d: apply 1D texture miplevel alignment in arrays
2678- v3d/ci: update neverball-v2 trace reference
2679- vc4/ci: skip unsupported test versions
2680- vc4/ci: disable VC4 jobs
2681- v3d: add per hw-version caller macro
2682- v3d: upgrade V3D 4.1 to 4.2 version
2683- v3d: apply proper clamping when setting up RT
2684- v3d/ci: annotate failures
2685- vc4/ci: re-enable VC4 testing
2686- v3d: delay offset/counter values with primitive restart
2687- v3d/ci: run GPU piglit profile
2688- v3d/ci: make traces test mandatory
2689- v3d: enable NIR compact arrays
2690- vc4: set blit mask correctly
2691- vc4: call blit paths in chain
2692- vc4: allow tile-based blit for Z/S
2693- vc4: add specific stencil blit path
2694- v3d/v3dv/ci: adjust job fractions
2695- v3dv/vc4/ci: update expected results
2696- v3d/ci: update traces
2697- v3d: Z/S blit require Z/S formats
2698- broadcom/ci: update expected results
2699- v3d: handle samplerExternalOES
2700- broadcom/ci: update expected results
2701- gallium/util: fix color clamp for alpha-only formats
2702- v3d: clear alpha-only as red-only
2703- vc4/v3d/ci: update expected results
2704- v3d/ci: add new flake
2705
2706Julia Tatz (7):
2707
2708- zink: Implement PIPE_CAP_OPENCL_INTEGER_FUNCTIONS and PIPE_CAP_INTEGER_MULTIPLY_32X16.
2709- zink: Implement PIPE_CAP_RESOURCE_FROM_USER_MEMORY
2710- zink: fix layout(local_size_variable) for vk1.3+
2711- zink/ci: update expected results
2712- aux/trace: fix (u)int dump
2713- gallium/dri: fix dri2_from_names
2714- aux/trace: fix set_hw_atomic_buffers method name
2715
2716Julia Zhang (1):
2717
2718- virgl: remove check of VIRGL_CAP_V2_UNTYPED_RESOURCE
2719
2720Julian Hagemeister (1):
2721
2722- Gallium: Fix shared memory segment leak
2723
2724Juston Li (24):
2725
2726- venus: use pipelineCacheUUID for shader cache id
2727- venus: filter out queue familes with exclusive sparse binding support
2728- venus: add helper function support for VkBindSparseInfo
2729- venus: add back sparse binding support
2730- venus: enable sparse binding features
2731- venus: enable sparse binding properties
2732- venus: sync to latest protocol header from v1.3.252
2733- venus: sync protocol for multiple extensions for zink
2734- venus: enable VK_EXT_non_seamless_cube_map
2735- venus: enable VK_EXT_dynamic_rendering_unused_attachments
2736- venus: enable VK_KHR_shader_clock
2737- venus: enable VK_EXT_border_color_swizzle
2738- venus: enable VK_EXT_fragment_shader_interlock
2739- venus: enable VK_EXT_shader_subgroup_ballot
2740- venus: enable VK_EXT_color_write_enable
2741- docs: venus: update extension support
2742- radv: fix incorrect size for primitives generated query
2743- venus: factor out flush barrier cmd
2744- venus: expose vn_feedback_buffer_create()
2745- venus: add query pool feedback cmds
2746- venus: track viewMask
2747- venus: track render pass
2748- venus: batch query feedback and defer until after render pass
2749- venus: use feedback for vkGetQueryPoolResults
2750
2751Karmjit Mahil (44):
2752
2753- pvr: Add missing includes in pvr_common.h
2754- pvr: Implement vkCmdUpdateBuffer().
2755- pvr: Implement simple internal format v2 transfer paths.
2756- pvr: Add deferred RTA clears for cores without gs_rta_support.
2757- pvr: Finish pvr_perform_start_of_render_attachment_clear().
2758- pvr: Collect vertex input data and fill info struct.
2759- pvr: Fix a comment in the PDS code
2760- pvr: Fix typo in PDS function name
2761- pvr: Add handling for missing entries in pvr_setup_vertex_buffers()
2762- pvr: Handle special built-in variable loading in vertex shader
2763- pvr: Add PVR_DW_TO_BYTES()
2764- pvr: Fix pvr_csb_bake() list return.
2765- pvr: Change push_constants_shader_stages to type pvr_stage_allocation
2766- pvr: Fix static assert check
2767- pvr: Fix unaligned VDMCTRL_PDS_STATE1 data address
2768- pvr: Don't advertise S8_UINT support
2769- pvr: Fix cs corruption in pvr_pack_clear_vdm_state()
2770- pvr: Add missing NULL checks in some vkDestroy...() functions
2771- pvr: Use original binding numbers instead of reassigning
2772- pvr: Remove custom status in command buffer
2773- pvr: Fix missing invalidation of the command buffer
2774- pvr: Fix possible allocation of 0 size
2775- pvr: Fix vk_free() in vkCreateRenderPass2() error path
2776- pvr: Use the suballocator for queries
2777- pvr: Add pvrsrvkm sync prim set bridge call
2778- pvr: Move pvrsrv sync prim code into new pvr_srv_sync_prim.{c,h}
2779- pvr: Use idalloc as the allocator for sync prims
2780- pvr: Handle barrier load and store flags.
2781- pvr: Fix typo causing seg faults copying immutable samplers
2782- pvr: Fix draw indirect page faults due to missing index list buffer
2783- pvr: Rename temps_count to pds_temps_count
2784- pvr: Fix PDS temps allocation on fragment stage
2785- pvr: HWRT creation simplifications.
2786- pvr: Dedup a check with pvr_is_render_area_tile_aligned()
2787- pvr: Remove outdated finishme
2788- pvr: Fix seg fault on empty descriptor set
2789- pvr: Fix dynamic offset patching
2790- pvr: Fix csb control stream extension
2791- pvr: Fix missing BITFIELD_BIT for winsys frag job flag
2792- pvr: Change winsys flag defines to bitfields
2793- pvr: Setup ZLS depth and stencil load/store separately
2794- docs: Add inital PowerVR driver documentation
2795- pvr: Fix \`for` loop itarator usage
2796- pvr: Fix dynamic desc offset storage
2797
2798Karol Herbst (140):
2799
2800- rusticl: rework CLVec helper function to calculate bounds
2801- rusticl/mem: fix Mem::copy_rect
2802- rusticl/mem: replace buffer_offset_size with CLVec::calc_offset_size
2803- gallium: correctly name the flags of svm_migrate
2804- rusticl/context: add helper to get the max mem alloc size for all devices
2805- rusticl/memory: Rework mapping of memory located in system RAM
2806- rusticl/mem: add get_parent helper
2807- rusticl: add support for fine-grained system SVM
2808- nv50/ir: ignore CL system values
2809- nouveau: allow to enable SVM without having to enable CL
2810- nouveau: nouveau_copy_buffer can deal with user_ptrs just fine
2811- rusticl/event: drop work item before updating status
2812- rusticl: add create_pipe_box to better deal with pipe_box restrictions
2813- rusticl/mem: more region and origin validation
2814- radeonsi: lower mul_high
2815- ac/llvm: support shifts on 16 bit vec2
2816- rusticl: don't set size_t-is-usize for >=bindgen-0.65
2817- rusticl/device: improve advertisement of fp64 support
2818- rusticl/platform: make the initialization more explicit
2819- rusticl/platform: extract env variable parsing from Platform::init
2820- rusticl/platform: add RUSTICL_FEATURES boilerplate
2821- rusticl/device: allow enablement of fp64 via RUSTICL_FEATURES
2822- rusticl/program: rework dynamic Program state
2823- rusticl/program: use if let to get rid of an unwrap in build
2824- clc: free kernel args in clc_free_kernels_info
2825- rusticl/nir: finish blob after serializing
2826- nvc0: do not randomly emit fences.
2827- nv50/ir: Use unified atomics
2828- Reviewed-by: Nora Allen <[email protected]>rusticl/platform: make the extension array a static
2829- rusticl/device: use PLATFORM_EXTENSIONS as a template for filling extensions
2830- rusticl/platform: advertise byte_addressable_store
2831- rusticl/device: split add_ext in fill_extensions
2832- rusticl: explicitly state supported SPIR-V extensions
2833- rusticl/platform: generate extension constants via macro
2834- rusticl/spirv: skip printing info messages
2835- rusticl/device: limit MAX_PARAMETER_SIZE to 32k
2836- rusticl/device: set preferred vector size of doubles if fp64 is enabled
2837- nv50/ir: convert to scoped_barrier
2838- doc/rusticl: add Rust Update Policy
2839- rusticl: bump rust req to 1.60
2840- rusticl/event: flush queues from dependencies
2841- ci: add and use clippy for rusticl
2842- rusticl: fix clippy errors on image_slice_pitch change to usize
2843- clc: relax spec constant validation
2844- rusticl: add proc macro module for generating API stubs
2845- rusticl/icd: make release return nothing
2846- rusticl/icd: use new proc macros
2847- ac/llvm: support vec2 on b2i16
2848- ac/llvm: replace MESA_SHADER_COMPUTE checks with gl_shader_stage_is_compute
2849- ac/llvm: set +cumode for radeonsi
2850- lp: align memory for long16 CL types
2851- rusticl/icd: fix ReferenceCountedAPIPointer::from_ptr for NULL pointers
2852- rusticl/api: remove some repr(C)
2853- rusticl/event: ensure even status is updated in order
2854- docs: improve OpenCL features
2855- rusticl/queue: overhaul of the queue+event handling
2856- rusticl: enforce using unsafe blocks in unsafe functions
2857- nv50/ir: use override
2858- nv50/ir: resolve -Woverloaded-virtual=1 warnings
2859- clc: add commment to clc_optional_features to ensure no padding exists
2860- rusticl/spirv: Key optional clc features when caching.
2861- clc: static assert that clc_optional_features has no padding
2862- nouveau: eliminate busy waiting on fences
2863- rusticl/device: add intel usm queries DPCPP cares about
2864- rusticl/device: sort cl_device_info queries
2865- rusticl/version: use cl_version instead of cl_uint and provide a From impl
2866- rusticl: advertize cl_khr_extended_versioning
2867- docs/cl: fix whitespace issues and add missing entries
2868- rusticl: advertize cl_khr_spirv_no_integer_wrap_decoration
2869- docs/cl: improve reporting of image features
2870- rusticl/mem: cache the pipe_format
2871- rusticl/mem: fix validation of packed image formats
2872- rusticl/format: pass order and type to rusticl_image_format directly
2873- rusticl/format: extract CL format to pipe format mapping into const function
2874- rusticl/format: extract required format checks into const functions
2875- rusticl/format: drop req_for_3d_image_write_ext
2876- rusticl/format: add required format table for CL2.0
2877- rusticl/format: document cl to pipe format mapping
2878- rusticl/format: move format table generation into a macro
2879- rusticl/format: enable all trivial to support optional image formats
2880- clc: fix SPIRVMessageConsumer for NULL src
2881- clc: allow passing custom validator options
2882- rusticl/program: pass our max param size along to the spirv validator
2883- compiler/types: fix size of padded OpenCL Structs
2884- rusticl/device: rename doubles to fp64 and long to int64
2885- rusticl: experimental support for cl_khr_fp16
2886- rusticl: add ld_args_gc_sections
2887- rusticl: specify which symbols to export
2888- rusticl: stop linking with libgalliumvl
2889- rusticl/device: create helper context before loading libclc
2890- nir/load_libclc: run some opt passes for everybody
2891- docs: document CLC_DEBUG
2892- rusticl/program: add debugging for OpenCL C compilation
2893- rusticl/program: add debugging option to disable SPIR-V validation
2894- nvc0: fix printing shaders
2895- nv50/ir/nir: set numBarriers if we emit an OP_BAR
2896- rusticl: structurize and reorder mesa binding args
2897- rusticl: generate bindings for build-id stuff
2898- rusticl/meson: extract common bindgen rust args
2899- rusticl/mesa: create proper build-id hash for the disk cache
2900- rusticl: bump bindgen requirement
2901- rusticl/program: skip linking compiled binaries
2902- docs/rusticl: mark building section as such
2903- docs/rusticl: add Enabling section
2904- docs/cl: remove cl_khr_byte_addressable_store from extension list.
2905- docs/cl: move vec3 support under OpenCL C 1.1
2906- docs/cl: timer sync is implemented
2907- docs: add missing get_compute_state_info documentation
2908- vtn: more CL subgroups
2909- clc: rework optional subgroup feature
2910- llvmpipe: report the proper subgroup size
2911- gallium: add simd_sizes to pipe_compute_state_object_info
2912- gallium: add get_compute_state_subgroup_size
2913- gallium: add PIPE_COMPUTE_CAP_MAX_SUBGROUPS
2914- iris: implement get_compute_state_subgroup_size
2915- rusticl/util: add an Iterator to iterate over set bits in an integer
2916- rusticl/util: add div_round_up
2917- rusticl/device: rework subgroups to subgroup_sizes
2918- gallium: change PIPE_COMPUTE_CAP_SUBGROUP_SIZE to a bitfield of sizes
2919- rusticl: deal with compute_param returning 0
2920- rusticl: support subgroups
2921- nvc0: backport fp helper invocation fix to 2nd gen Maxwell+
2922- rusticl/kernel: silence newer clippy warning
2923- rusticl: Replace &Arc<Device> with &Device
2924- rusticl/device: make it &'static
2925- api/icd: drop static lifetime from \`get_ref` return type
2926- nvc0: initial Ada enablement
2927- rusticl: fix warnings with newer rustc
2928- nv50/ir/nir: fix txq emission on MS textures
2929- nv50/ir/nir: Fix zero source handling of tex instructions.
2930- rusticl/kernel: only handle function_temp memory before lowering printf
2931- n50/compute: submit initial compute state in nv50_screen_create
2932- nv50: fix code uploads bigger than 0x10000 bytes
2933- nouveau: take glsl_type ref unconditionally
2934- nv50: limit max code uploads to 0x8000
2935- clc: use CLANG_RESOURCE_DIR for clang's resource path
2936- zink: fix source type in load/store scratch
2937- zink: fix global stores
2938- rusticl/disk_cache: fix stack corruption
2939- rusticl/memory: do not verify pitch for IMAGE1D_BUFFER
2940
2941Kenneth Graunke (17):
2942
2943- intel/compiler: UNDEF comparisons with smaller than 32-bit
2944- intel/compiler: UNDEF SubgroupInvocation's register
2945- intel/compiler: Fold constants after distributing source modifiers
2946- nir: Add a variant of nir_lower_int64 for float conversions only
2947- intel/compiler: Postpone most int64 lowering to brw_postprocess_nir
2948- nir: Add find_lsb lowering to nir_lower_int64.
2949- intel/compiler: Fix 64-bit ufind_msb, find_lsb, and bit_count
2950- nir: Assert that we don't shrink bit-sizes in nir_lower_bit_size()
2951- intel/compiler: Fix a fallthrough in components_read() for atomics
2952- intel/genxml: Drop Tiled Resource Mode fields
2953- intel: Initialize FF_MODE2 on all Gfx12 platforms
2954- iris: Allocate coherent buffers for resources flagged as persistent/coherent
2955- isl: Don't set "Enable Unorm Path in Color Pipe" on Alchemist
2956- intel/genxml: Fix gen_sort_tags.py to handle mbz/mbo
2957- intel/genxml: Update RENDER_SURFACE_STATE Fields
2958- iris: Re-emit 3DSTATE_DS for each primitive (workaround 14019750404)
2959- iris: Check prog[] instead of uncompiled[] for BLORP state skipping
2960
2961Kiskae (1):
2962
2963- vulkan/wsi: check for dri3 buffer initialization failure
2964
2965Konrad Dybcio (2):
2966
2967- freedreno: Add some A6/7xx registers
2968- freedreno: Partially decode CP_PROTECT_CNTL
2969
2970Konstantin Kharlamov (1):
2971
2972- loader/dri3: temporarily work around a crash when front is NULL
2973
2974Konstantin Seurer (133):
2975
2976- nir/lower_fp16_casts: Fix SSA dominance
2977- nir/lower_io: Emit less iadd(x, 0)
2978- nir: Make rq_load committed src an index
2979- radv: Stop running constant folding during ray query lowering
2980- radv/ci: Test ray tracing pipelines
2981- gallium/nir: Handle unified atomics in nir_to_tgsi_info
2982- nir/inline_uniforms: Handle num_components > 1
2983- nir/lower_shader_calls: Remat derefs earlier
2984- radv: Stop using radv_get_int_debug_option
2985- treewide: Add a .clang-format file
2986- amd: Use the Mesa base style
2987- asahi: Use the Mesa base style
2988- freedreno: Use the Mesa base style
2989- d3d12: Use the Mesa base style
2990- i915: Use the Mesa base style
2991- r600/sfn: Use the Mesa base style
2992- panfrost: Use the Mesa base style
2993- util/perf: Use the Mesa base style
2994- venus: Use the Mesa base style
2995- asahi: Reformat using the new style
2996- panfrost: Reformat using the new style
2997- gallivm: Fix gather/scatter types for newer llvm
2998- radv/rt: Fix pipeline libraries
2999- gallivm: Fix anisotropic sampling with num_mips=1
3000- gallivm: Cast read_first_invocation source to an int
3001- llvmpipe: refactor out the pipe->lp_jit structure fillers.
3002- llvmpipe: Add lp_storage_image_format_supported
3003- llvmpipe: Add lp_storage_render_image_format_supported
3004- gallivm: Add lp_build_nir_sample_key
3005- gallivm: Add lp_img_op_from_intrinsic
3006- gallivm: Handle invalid image format/op combinations
3007- gallivm: Zero initialize param structs
3008- radv/rt: Do not guard the raygen shader
3009- radv/rt: Clear NIR metadata after lowering the ABI
3010- aco/rt: Do not initialize the next shader addr
3011- radv/ci: Test ray tracing on vkd3d-proton
3012- radv/rt: Stop forcing wave32 by setting compute_subgroup_size
3013- Revert "radv: Enable ray tracing pipelines by default"
3014- radv/rt: Enable RT pipelines on GFX10_3+ excluding vangogh
3015- radv: Move the shader type to radv_shader_info
3016- radv: Adjust the traversal shader description
3017- radv: Use get_shader_from_executable_index for executable properties
3018- radv: Implement executable properties for ray tracing stages
3019- radv: Use _mesa_shader_stage_to_string for executable name
3020- radv/rt: Store the prolog outside the shaders array
3021- radv: Call radv_pipeline_init_scratch per shader
3022- meson: Add a xcb-keysyms dependency
3023- vulkan: Common trace capturing infrastructure
3024- radv: Add radv_trace_mode
3025- vulkan/wsi/x11: Capture traces using a hotkey
3026- radv/rra: Use common trace trigger
3027- radv/rgp: Use common trace trigger
3028- vulkan/rmv,radv: Use common trace trigger
3029- docs: Update envvars used for tracing
3030- amd: Use nir\_ instead of nir_build\_ helpers
3031- microsoft: Use nir\_ instead of nir_build\_ helpers
3032- intel: Use nir\_ instead of nir_build\_ helpers
3033- freedreno: Use nir\_ instead of nir_build\_ helpers
3034- vtn: Use nir\_ instead of nir_build\_ helpers
3035- nir: Use nir\_ instead of nir_build\_ helpers
3036- nir/builder_opcodes: Remove nir_build\_ prefixed helpers
3037- util: Do not include immintrin.h in half_float.h
3038- radv/rt: Fix caching non-recursive stages
3039- radv/rt: Hash stages using radv_hash_shaders
3040- llvmpipe: Add BDA jit type helpers
3041- gallivm: Add missing includes
3042- gallivm: Add lp_descriptor struct
3043- gallivm: Expose lp_build_sample_soa_code
3044- llvmpipe: Add lp_build_sampler_soa_dynamic_state
3045- llvmpipe: Add lp_build_image_soa_dynamic_state
3046- gallivm: Add LP_IMG_OP_COUNT
3047- gallivm: Expose LP_MAX_TEX_FUNC_ARGS
3048- llvmpipe: Add LP_TOTAL_IMAGE_OP_COUNT
3049- gallivm: Expose lp_build_texel_type
3050- gallivm: Propagate vulkan resources
3051- gallivm: Clamp the texel buffer size
3052- llvmpipe: Pre compile sample functions
3053- gallivm: Add a function for loading vulkan descriptors
3054- gallivm: Implement vulkan UBOs
3055- gallivm: Implement vulkan SSBOs
3056- gallivm: Implement vulkan textures
3057- gallivm: Implement vulkan images
3058- llvmpipe: Disable the linear path when running vulkan
3059- lavapipe: Include llvmpipe
3060- lavapipe: Lower more texture OPs
3061- lavapipe: Make pipeline_lock generic for accessing the queue
3062- lavapipe: Rework descriptor handling
3063- lavapipe: Lower non uniform access
3064- lavapipe: EXT_descriptor_indexing
3065- llvmpipe: Use lp_jit_buffer_from_pipe_const in setup
3066- lavapipe: Make shader compilation thread safe
3067- zink: Increase ZINK_FBFETCH_DESCRIPTOR_SIZE to 280
3068- zink/ci: Update lavapipe expectations
3069- venus/ci: Update fails
3070- lavapipe/ci: Update CI expectations for new extensions
3071- llvmpipe/ci: Update expectations
3072- nir: Add nir_builder_at
3073- radv: Use nir_builder_at
3074- asahi: Use nir_builder_at
3075- v3d: Use nir_builder_at
3076- glsl: Use nir_builder_at
3077- nir: Use nir_builder_at
3078- spirv: Use nir_builder_at
3079- freedreno: Use nir_builder_at
3080- gallium,st: Use nir_builder_at
3081- crocus: Use nir_builder_at
3082- etnaviv: Use nir_builder_at
3083- r600: Use nir_builder_at
3084- radeonsi: Use nir_builder_at
3085- vc4: Use nir_builder_at
3086- zink: Use nir_builder_at
3087- lavapipe: Use nir_builder_at
3088- microsoft: Use nir_builder_at
3089- panfrost: Use nir_builder_at
3090- intel: Use nir_builder_at
3091- nir/opt_dead_cf: Handle if statements ending in a jump correctly
3092- nir/builder_opcodes: Do not generate empty intrinsic indices
3093- amd: Move ac_hw_stage to its own file
3094- gallivm: Fix atomic_global types
3095- lavapipe: Set the descriptor count to what vkd3d-proton requires
3096- llvmpipe: Allow comparison sampling for float formats
3097- llvmpipe: Allocate more dummy sample functions for FORMAT_NONE
3098- llvmpipe,lavapipe: Relayout lp_descriptor
3099- lavapipe: Always advertise formatless storage image OPs
3100- nir/lower_shader_calls: Remat derefs after shader calls
3101- nir/opt_dead_cf: Run dead_cf_block while it makes progress
3102- nir/opt_dead_cf: Clarify comment
3103- draw: Do not restart the primitive_id at 0
3104- llvmpipe: Fix compiling with LP_USE_TEXTURE_CACHE
3105- llvmpipe: Zero extend vectors in widen_to_simd_width
3106- vulkan/wsi/x11: Implement capture hotkey using the keymap
3107- radv: Don't use the depth image view for depth bias emission
3108- aco/spill: Make sure that offset stays in bounds
3109
3110Kurt Kartaltepe (1):
3111
3112- drirc: Set limit_trig_input_range option for Nier games
3113
3114Leo Liu (6):
3115
3116- radeonsi: create a new context for transcode with multiple video engines
3117- radeonsi/vcn: AV1 skip the redundant bs resize
3118- radeonsi: Remove redundant vcn_decode from info
3119- amd: Add vcn ip version info
3120- radeonsi: Use vcn version instead of CHIP family for VCNs
3121- radeonsi/vcn: fix the incorrect dt_size
3122
3123Lina Versace (2):
3124
3125- venus: Advertise 1.3 in ICD file
3126- venus: Fix detection of push descriptor set
3127
3128LingMan (4):
3129
3130- rusticl: core: stop using cl_prop from the api module
3131- rusticl: drop CLProp implementation for String
3132- rusticl: drop cl_prop_for_type macro
3133- rusticl: fix UB in CLProp machinery
3134
3135Lionel Landwerlin (185):
3136
3137- docs: add missing MESA_VK_WSI_HEADLESS_SWAPCHAIN variable
3138- vulkan/runtime: discard unused graphics stages in libraries
3139- intel/vec4: force exec_all on float control instruction
3140- anv: enable blorp query reset for performance queries
3141- vulkan/overlay: deal with unknown pNext structures
3142- isl: don't set inconsistent fields for depth when using stencil only
3143- anv: introduce a base graphics pipeline object
3144- anv: move force shading rate writes checks
3145- anv: make input attachments available through bindless
3146- anv: move preprocessing of NIR right before compilation
3147- anv: add dynamic buffer offsets support with independent sets
3148- anv: implement VK_EXT_graphics_pipeline_library
3149- anv: Work around the spec question about pipeline feedback vs GPL.
3150- isl: fix a number of errors on storage format support on Gfx9/12.5
3151- intel/nir: add options to storage image lowering
3152- anv: drop lowered storage images code
3153- anv: enable shaderStorageImageReadWithoutFormat on Gfx12.5+
3154- anv: rework Wa_14017076903 to only apply with occlusion queries
3155- intel/tools: add ability to dump out raw kernels data
3156- nir/divergence: add missing load_global_constant_* intrinsics
3157- anv: fix anv_nir_lower_ubo_loads pass
3158- anv: enable shaderUniformBufferArrayNonUniformIndexing
3159- intel/fs: fix per vertex input clamping
3160- nir/lower_non_uniform_access: add get_ssbo_size handling
3161- intel/compiler: make uses_pos_offset a tri-state
3162- vulkan: bump headers to 1.3.249
3163- spirv: update to latest headers
3164- spirv/nir: wire ray interection triangle position fetch
3165- intel/nir/rt: use a single load for instance leaf loading
3166- intel/nir/rt: wire position fetch intrinsic
3167- anv: implement VK_KHR_ray_tracing_position_fetch
3168- intel/fs: fix scheduling of HALT instructions
3169- anv: remove 48bit address space checks
3170- anv: avoid hardcoding instruction VA constant in shaders
3171- anv: link anv_bo to its VMA heap
3172- anv: make internal address space allocation more dynamic
3173- anv: increase instruction heap to 2Gb
3174- intel/fs: reduce register usage for relocated constants
3175- intel: enable protected context creation along with engines
3176- Revert "intel/compiler: make uses_pos_offset a tri-state"
3177- anv: fixup workaround 16011411144
3178- intel/mi_builder: fixup tests for newer kernel uAPI
3179- intel: switch over to unified atomics
3180- spirv: fix argument to ray query intrinsic
3181- intel/devinfo: printout on stdout
3182- intel/devinfo: allow -p to take a pci-id in hexa
3183- intel/devinfo: call intel_device_info_init_was only once
3184- anv: put private binding BOs into execlists
3185- anv: mark images compressed for untracked layout/access
3186- gitlab-ci: add capture for i915 error state
3187- anv: defer binding table block allocation to when necessary
3188- anv: assume context isolation support
3189- anv: fix push descriptor deferred surface state packing
3190- intel/fs: fix size_read() for LOAD_PAYLOAD
3191- anv: move timestamp vfunc initialization to genX code
3192- anv: use COMPUTE_WALKER post sync field to track compute work
3193- iris: use COMPUTE_WALKER post sync field to track compute work
3194- intel/fs: make tcs input_vertices dynamic
3195- anv: implement EDS2.extendedDynamicState2PatchControlPoints
3196- iris: rework Wa_14017076903 to only apply with occlusion queries
3197- intel: add alignment helper for aux map
3198- iris: add a comment about aux-tt alignment requirements
3199- anv: update aux-tt alignment requirements for MTL
3200- intel: reduce minimum memory alignment on Gfx12.5
3201- anv: further reduce pool alignments
3202- anv: opportunistically align VMA to 2Mb
3203- anv: update internal address space to have 4Gb of dynamic state
3204- anv: fix push range for descriptor offsets
3205- intel/fs: reuse descriptor helper
3206- intel/fs: lower get_buffer_size like other logical sends
3207- nir/lower_shader_calls: add ability to force remat of instructions
3208- nir: add a new intrinsic to describe resources accessed on intel
3209- nir: teach nir_chase_binding about resource_intel
3210- nir/opt_gcm: allow resource_intel to be moved anywhere
3211- intel/fs: add a pass to move resource_intel closer to user
3212- intel/fs: teach ubo range analysis pass about resource_intel
3213- intel/fs: keep track of new resource_intel information
3214- intel/fs: enable SSBO accesses through the bindless heap
3215- intel/fs: enable UBO accesses through bindless heap
3216- intel/fs: enable get_buffer_size on bindless heap
3217- intel/fs: enable extended bindless surface offset
3218- intel/fs: enable bindless sampler state offsets
3219- intel/fs: enable uniform block accesses through bindless heap
3220- intel/fs: try to rematerialize surface computation code
3221- anv: remove unused define
3222- anv: fix null descriptor handling with A64 messages
3223- anv: remove incorrect ifdef
3224- anv: bail flush_gfx_state when not gfx push constant is dirty
3225- anv: track pipeline in anv_cmd_pipeline_state
3226- anv: move pipeline active_stages to common structure
3227- anv: increase workaround BO so that we can hold a full 4Kb page of 0s
3228- anv: toggle extended bindless surface state on Gfx12.5+
3229- docs/anv: some binding table explanations
3230- anv: add an option for using indirect descriptors
3231- anv: introduce a new descriptor set layout type
3232- anv: create a pool for indirect descriptors
3233- anv: reduce push constant size for descriptor sets
3234- anv: new structure to hold surface states
3235- anv: add a pass to partially lower resource_intel
3236- nir: expose a couple of address format add helpers
3237- anv: bound load descriptor mem better
3238- anv: prepare image/buffer views for non indirect descriptors
3239- anv: add support for direct descriptor in allocation/writes
3240- anv: add helpers to build pipeline bindings
3241- anv: handle null surface in the binding table with direct descriptors
3242- anv: factor out dynamic buffer bti emission
3243- anv: implement binding table emission for direct descriptors
3244- anv: simplify ycbcr bti computations
3245- anv: track descriptor data size
3246- anv: add direct descriptor support to apply_layout
3247- anv: bring back the max number of sets to 8
3248- anv: descriptor binding for direct descriptors
3249- anv: ensure descriptor addresses are used with bindless stages
3250- anv: enable direct descriptors on platforms with extended bindless offset
3251- anv: add support for VK_EXT_dynamic_rendering_unused_attachments
3252- anv: remove unused functions
3253- intel/fs: fix a couple of descriptor mistakes
3254- intel/stub_gpu: add an option to launch valgrind
3255- intel/fs: fix pull-constant-load prior to gfx7
3256- anv: allow binding tables allocations on compute only queues
3257- intel/nir: switch ray query state tracking to local variables uint16_t
3258- anv: add query tracepoints
3259- anv: deal with unsupported VkImageFormatListCreateInfo::pViewFormats
3260- anv: report max simd width only once for fragment shaders
3261- anv: always report all pipeline stats regardless of stages
3262- anv: only disable mesh when enabled at the VkDevice level
3263- anv: disable mesh/task for generated draws
3264- anv: fix incorrect batch for 3DSTATE_CONSTANT_ALL emission
3265- anv: limit ANV_PIPE_RENDER_TARGET_BUFFER_WRITES to blorp operations using 3D
3266- anv: factor out generation kernel dispatch into helper
3267- anv: add support for simple internal compute shaders
3268- anv: generalize internal kernel concept
3269- anv: add shaders for copying query results
3270- intel/ds: add query count in query tracepoints
3271- anv: enable CmdCopyQueryPoolResults to use shader for copies
3272- intel/fs: fix bindless/shared surface mistake
3273- intel/fs: print identation for control flow
3274- intel/fs: avoid reusing the VGRF for uniform load_ubo
3275- nir: add a new ubo uniform loading intrinsic for intel
3276- intel/fs: make use of load_ubo_uniform_block_intel
3277- nir: add a load_global_constant uniform intel variant
3278- intel/fs: handle load_global_constant_uniform_block_intel
3279- anv: avoid private buffer allocations in vkGetDeviceImageMemoryRequirementsKHR
3280- anv: add missing query clear flush for acceleration structure queries
3281- anv: track buffer writes from shaders for query results writes
3282- anv: change the way we clear pending query bits
3283- anv: fix pending query bits for compute only command buffers
3284- anv: tracking query buffer writes & query clears separately
3285- anv: switch copy query results with shaders from semaphore waits to flushes
3286- vulkan: registry/headers bump to 1.3.254
3287- vulkan/runtime: add support for EXT_depth_bias_control
3288- anv: add VK_EXT_depth_bias_control support
3289- isl: assert on gfx6 condition that should not be met
3290- isl: assert on gfx7 condition that should not be met
3291- isl: assert on gfx8 condition that should not be met
3292- isl: add surface creation reporting mechanism
3293- anv: align buffers to a cache line
3294- anv: fix utrace batch allocation
3295- genxml: enable decoding on compute engine
3296- intel/aubinator_error_decode: add ccs support
3297- anv: look into batch bo reloc list looking for BOs to decode
3298- anv: implement storage image depth query using descriptor buffer read
3299- Revert "isl: Set Depth to array len for 3D storage images"
3300- docs/features: update anv entries
3301- intel/fs: disable coarse pixel shader with interpolater messages at sample
3302- nir/opt_shrink_vectors: enable sparse intrinsics shrinking
3303- docs/features: add more missing extensions
3304- docs/features: add hasvk entries
3305- zink: update profile vulkan version requirements
3306- zink: drop linear D32_SFLOAT_S8_UINT requirement
3307- anv: fix utrace signaling with Xe
3308- intel/fs: fix missing predicate on SEL instruction
3309- intel/fs: don't try to rebuild sequences of non ssa values
3310- anv: fix 3DSTATE_RASTER::APIMode field setting
3311- hasvk: fix null descriptor handling with A64 messages
3312- anv: don't try to access dynamic buffers from surface states
3313- intel/compiler: disable per-sample interpolation modes with non-per-sample dispatch
3314- anv: add missing ISL storage usage
3315- intel/nir: rerun lower_tex if it lowers something
3316- hasvk: add state cache invalidation back before fast clears
3317- anv: fix utrace timestamp buffer copies
3318- intel: don't assume Linux minor dev node
3319- blorp: switch blorp_update_clear_color to early return
3320- blorp: update and move fast clear PIPE_CONTROLs to drivers
3321- iris: ensure stalling pipe control before fast clear
3322
3323Liviu Prodea (1):
3324
3325- microsoft/clc: Don't build compiler test if build-tests is false
3326
3327Lone_Wolf (3):
3328
3329- compiler/clc: Fix embedded clang headers (microsoft-clc) for LLVM 16+
3330- clc: Add clangASTMatchers to fix static llvm build of microsoft-clc with LLVM 16+
3331- clc: Add clang frontendhlsl module to fix build of microsoft-clc with llvm 16+
3332
3333Luc Ma (1):
3334
3335- meson: keep Mako version checking in accord with build msg
3336
3337Luca Bacci (1):
3338
3339- Add checks for NULL dxil_validator
3340
3341Luca Weiss (1):
3342
3343- freedreno: Enable A506
3344
3345Lucas Fryzek (6):
3346
3347- broadcom: Add support for VK_FORMAT_A2R10G10B10_UNORM_PACK32
3348- broadcom: Fix slice memory allocation logic for compressed textures
3349- v3d: Add support for ASTC texture compression
3350- v3dv: Update texture padding logic to match v3d changes
3351- mailmap: Add Lucas Fryzek to mailmap
3352- gallium: Remove \`PIPE_CAP_RGB_OVERRIDE_DST_ALPHA_BLEND`
3353
3354Lucas Stach (16):
3355
3356- etnaviv: update derived state after forced commandstream flush
3357- etnaviv: don't flush implicit flush resources when forced
3358- etnaviv: rs: flush TS cache before making configuration changes
3359- etnaviv: rs: unconditionally flush color and depth cache before using RS
3360- etnaviv: optimize transfer flushes
3361- etnaviv: query: move sample counter manipulation into query providers
3362- etnaviv: query: reset sample count on begin_query
3363- etnaviv: query: remove incorrect comment
3364- etnaviv: query: correct max number of perfmon samples
3365- etnaviv: query: correct max number of occlusion query samples
3366- etnaviv: query: optimize context flushes
3367- mesa/st: discard whole resource when mapping drawpixels texture
3368- etnaviv: only emit sampler config for changed samplers
3369- etnaviv: move resource level dimension members to make comments line up
3370- etnaviv: rs: fix multisampled blits
3371- etnaviv: blt: fix multisampled blits
3372
3373Luigi Santivetti (13):
3374
3375- pvr: use PVR_DW_TO_BYTES for stream_link_space calculation
3376- pvr: add GUARD_SIZE_DEFAULT for CDM and VDM control stream links 1 and 2
3377- pvr: fixup stack overflow in {start,end}_sub_cmd
3378- pvr: introduce suballocator for internal allocations
3379- pvr: switch pvr_gpu_upload_* to use pvr_bo_suballoc
3380- pvr: switch pvr_cmd_buffer_alloc_mem to use pvr_bo_suballoc
3381- pvr: switch pvr_descriptor_set_create to use pvr_bo_suballoc
3382- pvr: switch pvr_clear to use pvr_bo_suballoc
3383- pvr: switch pvr_spm to use pvr_bo_suballoc
3384- pvr: fixup assert in pvr_cmd_buffer_alloc_mem
3385- pvr: fix division by block size in blit
3386- pvr: fixup transfer primary sub-command list
3387- pvr: do not claim support for ASTC texture compression
3388
3389Luna Nova (5):
3390
3391- device_select_layer: fix inverted strcmp in device_select_find_dri_prime_tag_default (v1)
3392- device_select_layer: apply DRI_PRIME even if default device is > 1 to match opengl behavior
3393- device_select_layer: pick a default device before applying DRI_PRIME
3394- device_select_layer: add MESA_VK_DEVICE_SELECT_DEBUG which logs why default selection was made
3395- device_select_layer: log selectable devices if MESA_VK_DEVICE_SELECT_DEBUG or DRI_PRIME_DEBUG are set
3396
3397Lynne (4):
3398
3399- radv/video: reject general unsupported video formats
3400- radv/video: reject non-8bit H264
3401- radv/video: reject unsupported hevc profiles and bit depths
3402- anv_video: reject decoding of unsupported profiles and formats
3403
3404M Henning (12):
3405
3406- nvc0: Use nir in nvc0_program_init_tcp_empty
3407- nvc0: Use nir in nvc0_blitter_make_vp
3408- nv50,nvc0: Use nir in nv50_blitter_make_fp
3409- nv50,nvc0: Stop advertising TGSI by default
3410- nv50,nvc0: Use ttn for tgsi shaders by default
3411- gallium: Add pipe_shader_state_from_nir
3412- nouveau/codegen: Check nir_dest_num_components
3413- nv50/codegen: Set lower_uniforms_to_ubo
3414- nouveau/nir: Set isSigned on all atomic_imax/imin
3415- nv50,nvc0: Free nir from blitter fp shader
3416- nvc0: Free blitter->vp
3417- nv50: Fix return type of nv50_blit_is_array
3418
3419Marcin Ślusarz (17):
3420
3421- intel: split URB space between task and mesh proportionally to entry sizes
3422- anv: move nir_shader_gather_info to anv_pipeline_nir_preprocess
3423- intel/tools: decode ACTHD printed by newer kernels
3424- nir: extract try_lower_id_to_index_1d
3425- nir: use wg id to wg idx shortcut if two dims of num_workgroups are 1
3426- nir: use constant components of num_workgroups in wg id to wg idx lowering
3427- nir: lower num_workgroups to constants
3428- intel/compiler: pass num_workgroups from task to mesh shaders
3429- nir: add cheap shortcut for wg id to wg idx lowering
3430- anv,intel/compiler: enable shortcut in wg id to wg idx lowering on >= gfx12.5
3431- intel/compiler: simplify reading of gl_NumWorkGroups in task/mesh
3432- anv: fix how NULL buffer_view is handled in anv_descriptor_set_write_buffer_view
3433- anv: pass anv_surface_state using a pointer
3434- anv: limit stack usage for anv_surface_state
3435- intel/compiler/mesh: compactify MUE layout
3436- intel/compiler,anv: put some vertex and primitive data in headers
3437- intel/compiler: load debug mesh compaction options once
3438
3439Marek Olšák (169):
3440
3441- nir: fix 2 bugs in nir_create_passthrough_tcs
3442- nir: lower load_barycentric_at_offset in lower_wpos_ytransform
3443- nir: assign IO bases in nir_lower_io_passes
3444- nir: skip nir_lower_io_passes for compute shaders
3445- nir: extend nir_opt_fragdepth to handle lowered IO
3446- nir: handle more opcodes in nir_lower_io_to_scalar
3447- nir: handle all varying slots in gl_varying_slot_name_for_stage
3448- nir: don't remove dead IO variables in nir_lower_io_passes for st_link_nir
3449- nir: rework nir_lower_color_inputs to work with lowered IO intrinsics
3450- nir: return a status from nir_remove_varying whether it removed the instruction
3451- nir: remove an obsolete comment from nir_gather_xfb_info_from_intrinsics
3452- nir: add next_stage parameter to nir_slot_is_sysval_output to return better info
3453- nir: add next_stage parameter to nir_remove_varying
3454- nir: set uses_wide_subgroup_intrinsics for all shader stages
3455- venus: fix the RHEL8 build by using syscall for gettid
3456- nir: rename ACCESS_STREAM_CACHE_POLICY -> ACCESS_NON_TEMPORAL and document
3457- nir: add/update comments for gl_access_qualifier
3458- ac/surface: don't expose modifiers with DCC retiling if radeon_info forbids it
3459- ac/gpu_info: disable display DCC on Raphael and Mendocino to improve power usage
3460- radeon: add radeon_info parameter into radeon_winsys::surface_init
3461- radeonsi: do AMD_DEBUG=nodisplaydcc differently to also remove modifiers
3462- aco: don't treat ACCESS_NON_READABLE as ACCESS_COHERENT
3463- ac/llvm: don't treat ACCESS_NON_READABLE as ACCESS_COHERENT
3464- ac/llvm: rewrite and unify how GLC, DLC, SLC are set
3465- nir/lower_io: don't renumber VS inputs when not called from a linker
3466- ac/surface: fix address calculation for large images by using uint64_t
3467- radv: fix sparse image address calculation for large images by using uint64_t
3468- radv: fix SDMA image address calculation for large images by using uint64_t
3469- radeonsi: fix SDMA image address calculation for large images by using uint64_t
3470- radeonsi: fix image address calculation for large images by using uint64_t
3471- radeonsi: fix sparse image address calculation for large images by using uint64_t
3472- radeonsi: fix image size calculation in fast clear
3473- ac/surface: clean up and move the PIPE_CONFIG helper to ac_surface.c
3474- ac/surface: define LINEAR_PITCH_ALIGNMENT
3475- ac/surface: validate overridden pitch for all chips
3476- ac/surface: fix overridden linear pitch for CPU access
3477- ac/surface: add ac_surf_config::is_array
3478- amd/registers: update pitch definitions in descriptors
3479- mesa: fix a VBO buffer reference leak in _mesa_bind_vertex_buffer
3480- ac,radeonsi,winsyses: switch to SPDX-License-Identifier: MIT
3481- winsys/radeon: set has_image_opcodes to unbreak gfx6-7
3482- winsys/radeon: fix the scratch buffer on gfx6-7
3483- winsys/radeon: set more radeon_info fields
3484- ac/gpu_info: give has_msaa_sample_loc_bug a more accurate name
3485- ac/surface: move CB format translation helpers here
3486- ac/surface: move determing ADDR_FMT_* into a helper function
3487- ac/llvm: clean up translation of nir_intrinsic_load_invocation_id
3488- ac/llvm: clean up visit_load_local_invocation_index and visit_load_subgroup_id
3489- ac/llvm: use LLVM 0/1 constants from ac_llvm_context instead of LLVMConstInt
3490- radeonsi/gfx11: fix alpha-to-coverage with blending
3491- radeonsi: reorder code in si_texture_create_object as preparation for the future
3492- radeonsi: cosmetic changes in si_shader.h
3493- radeonsi: remove the gl_SampleMask FS output if MSAA is disabled
3494- radeonsi: don't enable WGP_MODE because of high cost of workgroup mem coherency
3495- radeonsi: move emitting draws states out of si_emit_all_states
3496- radeonsi/gfx11: use DISABLE_FOR_AUTO_INDEX to disable non-indexed prim restart
3497- radeonsi: reduce the supported compute grid size
3498- radeonsi: update test results and flakes
3499- radeonsi: re-enable fp16_rtz for compute blits to fix PBO tests on gfx11
3500- amd/addrlib: switch the license to the SPDX identifier MIT
3501- amd/addrlib: add ADDR_FMT_BG_RG_16_16_16_16
3502- ac/surface: fix is_linear for stencil-only surfaces
3503- ac/nir: handle DEPTH as PITCH in ac_nir_lower_resinfo
3504- radeonsi: implement setting a custom pitch to any multiple of 256B on gfx10.3+
3505- radv: implement setting a custom pitch to any multiple of 256B on gfx10.3+
3506- ac/surface: relax custom pitch requirements to any multiple of 256B on gfx10.3+
3507- ac/surface: fix R32G3B32 image format regression for gfx6-8
3508- ac/nir/ngg: always use load_initial_edgeflags_amd, choose the value in drivers
3509- amd: add radeon_info* into ac_llvm_context and radv_nir_compiler_options
3510- radeonsi: define si_shader_io_get_unique_index() values as SI_UNIQUE_SLOT_*
3511- radeonsi: remove gl_BackColor VS outputs on demand if color_two_side is disabled
3512- radeonsi: export non-zero edgeflags for GS and tess
3513- radeonsi/gfx11: extend DB_Z_INFO.NUM_SAMPLES programming to > GFX11
3514- radeonsi: print shader-db stats with AMD_DEBUG=vs,ps,stats
3515- radeonsi: use nir_lower_alu_to_scalar correctly
3516- radeonsi: remove a useless depth texture function call in a fast color clear
3517- radeonsi: add a gfx11 version of si_decompress_textures, add assertions < GFX11
3518- radeonsi: remove RADEON_FLAG_MALL_NOALLOC due to no use
3519- radeonsi: completely rewrite how VGT_SHADER_STAGES_EN is set
3520- radeonsi: unduplicate si_translate_format_to_hw
3521- radeonsi: decompress DCC for SDMA if we're really going to use SDMA
3522- radeonsi: increase SDMA gfx9+ limits
3523- radeonsi: split tracked_regs masks into context registers and other registers
3524- radeonsi: reorder and comment tracked registers
3525- radeonsi: move PA_CL_NGG_CNTL emission into rasterizer state
3526- radeonsi: always set sample locations even for 1x MSAA for simplicity
3527- radeonsi: adjust 16x EQAA sample locs to make PA_SU_PRIM_FILTER_CNTL immutable
3528- radeonsi: move PA_SU_SMALL_PRIM_FILTER_CNTL to the preamble when possible
3529- radeonsi: merge si_emit_msaa_sample_locs with si_emit_sample_locations
3530- radeonsi: rename the msaa_sample_locs state to sample locations
3531- radeonsi: optimize no-op primitive restart index changes thanks to index masking
3532- radeonsi: don't program COMPUTE_MAX_WAVE_ID (GDS register) on gfx6
3533- radeonsi: add helpers to create and clone a sized pm4 state
3534- radeonsi: add a separate gfx10_init_gfx_preamble_state function
3535- radeonsi: don't set registers set by CLEAR_STATE in the preamble for gfx10-11
3536- radeonsi: add a separate cdna_init_compute_preamble_state function
3537- radeonsi/ci: add gfx6 failures
3538- radeonsi: re-indent gfx10_create_sh_query_result_cs
3539- radeonsi: don't use SET_SH_REG_INDEX on gfx7-9
3540- radeonsi: don't use SET_SH_REG_INDEX if the kernel doesn't use CU reservation
3541- amd: remove unused PKT0 definitions
3542- treewide: use uint64_t / (u)intptr_t in image address calculations
3543- amd: drop support for LLVM 11
3544- amd: drop support for LLVM 12
3545- amd: drop support for LLVM 13
3546- amd: drop support for LLVM 14
3547- mesa: fix glBitmap in display lists when width <= 0 || height <= 0
3548- gallium/hud: append results to files instead of overwriting them
3549- radeonsi: don't convert L8A8 to R8A8 when blitting via compute to fix gfx7
3550- amd: update SET_*_REG_PAIRS* documentation and remove radeon_info options
3551- amd: improve the IB parser, parse more packets
3552- amd: rename mid_command_buffer_preemption_enabled -> register_shadowing_required
3553- amd: increase the attribute ring size on gfx1103_r1
3554- amd: don't set PA_RATE_CNTL because it has no effect
3555- amd: fix GPU cache sizes retrieved from the kernel
3556- amd: remove non-shadowed register tables
3557- amd: remove ac_check_shadowed_regs
3558- amd: add a new helper that prints all non-shadowed regs
3559- amd: update shadowed register tables for gfx11
3560- amd: skip redundant PKT3_NUM_INSTANCES even with register shadowing
3561- amd: skip redundant INDEX_TYPE even with register shadowing
3562- radeonsi: set register_shadowing_enabled if AMD_DEBUG=shadowregs is set
3563- radeonsi/ci: add glx@glx-visuals-stencil to skips because it gets stuck often
3564- radeonsi: fix RB+ and gfx11 issues with framebuffer state
3565- radeonsi: change si_emit_derived_tess_state into a state atom
3566- radeonsi: shrink the last field of tcs_offchip_layout due to LDS limit
3567- radeonsi: don't do PFP_SYNC_ME before CP DMA and compute blits
3568- radeonsi: don't needlessly invalidate L0/L1 caches at the beginning of IBs
3569- radeonsi: add more variables into si_pm4_state and rework how it's created
3570- radeonsi: remove sscreen parameter from si_pm4_set_reg_idx3
3571- radeonsi: set non-graphics uconfig registers first in the preamble
3572- radeonsi: handle demoted si_pm4_set_reg_idx3 as si_pm4_set_reg
3573- radeonsi: eliminate redundant compute SH register changes
3574- radeonsi: handle VGT_GS_OUT_PRIM_TYPE like a tracked register
3575- radeonsi: handle VGT_LS_HS_CONFIG like a tracker register
3576- radeonsi: handle GE_CNTL and IA_MULTI_VGT_PARAM as a tracked register
3577- radeonsi: remove gfx10 NGG streamout
3578- ci: remove clang-format testing
3579- intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR
3580- radeonsi: move TCS.gl_PatchVerticesIn into the tcs_offchip_layout SGPR
3581- radeonsi: replace tcs_out_lds_layout with nearly identical tes_offchip_addr
3582- radeonsi: move the only tcs_out_lds_offsets field to vs_state_bits
3583- radeonsi: eliminate redundant TCS user data and RSRC2 register changes
3584- radeonsi/gfx11: use SET_*_REG_PAIRS_PACKED packets for pm4 states
3585- radeonsi: determine si_pm4_state::reg_va_low_idx automatically
3586- radeonsi: keep pipeline statistics disabled when they are not used
3587- radeonsi: don't do BREAK_BATCH for context regs with only 1 context per batch
3588- radeonsi: use si_pm4_create_sized for the shadowing preamble
3589- radeonsi: remove radeon_winsys::cs_set_preamble
3590- radeonsi: remove uses_reg_shadowing parameter from si_init_gfx_preamble_state
3591- radeonsi/gfx11: fix GLCTS with register shadowing by keeping the CS preamble
3592- radeonsi/gfx11: enable register shadowing by default
3593- radeonsi: reorder compute code to prepare for packed SET_SH_REG packets
3594- radeonsi/gfx11: use SET_SH_REG_PAIRS_PACKED for gfx by buffering reg writes
3595- radeonsi/gfx11: use SET_SH_REG_PAIRS_PACKED for compute by buffering reg writes
3596- radeonsi: clean up query functions, make them static, remove forward decls
3597- radeonsi: declare compiler[] and nir_options as pointers to reduce #includes
3598- radeonsi: clean up #includes
3599- Revert "egl: return correct error for EGL_KHR_image_pixmap"
3600- vbo: correctly restore _VaryingInputs for display list fast path
3601- radeonsi/gfx11: only use SET_*_PAIRS* packets on dGPUs
3602- radeonsi: fix gfx9 regression causing GPU hangs
3603- radeonsi/gfx11: fix a regression with PAIRS packets due to shader changes
3604- Revert "ac/nir/ngg: Follow intrinsic sources when analyzing before culling."
3605- glthread: determine global locking once every 64 batches to fix get_time perf
3606- mesa: fix 38% decrease in display list performance of Viewperf2020/NX8_StudioAA
3607- util/u_queue: fix util_queue_finish deadlock by merging lock and finish_lock
3608- radeonsi: fix a CDNA regression breaking compute
3609- Revert "ac: don't call ac_query_pci_bus_info from ac_query_gpu_info"
3610
3611Mark Collins (1):
3612
3613- ir3/a7xx: Add definitions for (last) src GPR attribute
3614
3615Mark Janes (8):
3616
3617- intel/dev: update mesa_defs.json from defect database
3618- intel/dev: report stepping for TGL systems
3619- intel/dev: switch defect identifiers to use lineage numbers
3620- isl: use generated workaround helpers for Wa_1806565034
3621- iris: convert Wa_14010455700 to use workaround mechanism
3622- anv: convert Wa_14010455700 to use workaround mechanism
3623- intel: use generated helpers for Wa_1508744258
3624- intel/dev: update mesa_defs.json from defect database
3625
3626Martin Roukala (né Peres) (26):
3627
3628- radv/ci: disable the vkcts-navi21-llvm-valve job
3629- radv/ci: document all the flakes we hit while I was away
3630- ci/b2c: allow not specifying a reboot condition
3631- radv/ci: only reboot on hangs for vkcts-navi10-valve
3632- zink/ci: document that some tests no longer fail
3633- zink/ci: mark 77 multisample-related tests as fixed
3634- radv/ci: document another vkcts flake on vega10
3635- radv/ci: document a series of recent regressions
3636- zink/ci: document recent fixes on RADV
3637- zink/ci: document new flakes on RADV
3638- radv/ci: document more flakes for navi21
3639- radv/ci: switch to b2c v0.9.10
3640- ci/b2c: update to mesa-trigger:2023-03-08.1
3641- zink/ci: add more QBO-related fails on RADV
3642- amd/ci: add another test to the vkcts-vega10 flake list
3643- zink/ci: remove spec@nv_shader_atomic_int64@* from the fail lists
3644- ci: bring back the valve farm
3645- ci/b2c: select the DUT to run on by name
3646- radv/ci: use the low-priority runners for vangogh jobs
3647- ci/b2c: change the default first-console-activity timeout to 2 minutes
3648- zink/ci: add more tests to the flake list of vangogh
3649- zink/ci: enable zink-radv-vangogh-valve for pre-merge testing
3650- Revert "ci: mark the valve farm as down"
3651- amd/ci: temporarily disable some manual jobs that take a long time to run
3652- zink/ci: remove 3 tests from the fails list
3653- Revert "amd/ci: temporarily disable some manual jobs that take a long time to run"
3654
3655Martin Stransky (1):
3656
3657- llvmpipe: fix UAF in lp_scene_is_resource_referenced.
3658
3659Matt Coster (57):
3660
3661- pvr: Complete pvr_isp_ctrl_stream()
3662- pvr: Fully declare support for VK_EXT_private_data
3663- pvr: Remove false assumption from pvr_write_draw_indirect_vdm_stream()
3664- pvr: Fixup format features
3665- pvr: Unmap mapped memory on free
3666- pvr: Correctly validate PBE accum format
3667- pvr: Actually check for depth load when setting up load op constants
3668- pvr: Initialize aspect_mask when creating buffer views
3669- pvr: Correctly compile graphics pipelines without a fragment shader
3670- pvr: Fix off-by-one in pvr_cmd_buffer_upload_desc_set_table() assert
3671- pvr: Remove unneeded assert in pvr_get_hw_clear_color()
3672- pvr: Set output_offset correctly in pvr_clear_color_attachment_static()
3673- pvr: Return correct pbe_accum_format size for A2B10G10R10_UINT_PACK32
3674- pvr: Remove bad assert in pvr_clear_attachments()
3675- pvr: Add PVR_DEBUG=vk_desc option to dump descriptor set layouts
3676- pvr: Simplify descriptor set layout dump separators
3677- pvr: Return VkResult from pvr_winsys_create()
3678- pvr: Propagate errors as VkResults from ioctls through winsys
3679- pvr: Fix incorrect error return in pvr_ctx_sr_programs_setup()
3680- pvr: Fix incorrect error handling in pvr_render_ctx_switch_init()
3681- pvr: Squeeze fd handling into winsys layer
3682- pvr: Drop pdevice from pvr_physical_device_get_supported_extensions()
3683- pvr: Rename primary_{device,fd,path} to display_*
3684- pvr: Use common physical device enumeration
3685- pvr: Assorted cleanup
3686- pvr: Return VkResult from winsys buffer_map operation
3687- pvr: Fix allocation scopes in vkCreateRenderPass2() code path
3688- pvr: Fix memory leaks on realloc failure in pvr_pipeline.c
3689- pvr: Correct error flow in pvr_graphics_pipeline_compile()
3690- pvr: Correct error flow in pvr_compute_pipeline_compile()
3691- pvr: Use correct surface for deferred RTA clear
3692- pvr: Rename shadowing loop variable in pvr_add_deferred_rta_clear()
3693- pvr: Do not free deferred pvr_transfer_cmd instances
3694- pvr: Fix out of range stream errors for geometry-only jobs on pvrsrvkm
3695- pvr: Reorder execution in pvr_cmd_buffer_end_sub_cmd()
3696- pvr: Fix page faults in occlusion query tests
3697- pvr: Fix rect splitting logic in pvr_unwind_rects()
3698- pvr: Use correct pbe format for VK_FORMAT_A8B8G8R8_UNORM_PACK32
3699- pvr: Use common vkGetPhysicalDeviceFeatures2() implementation
3700- pvr: Fix segfault in pvr_physical_device_init()
3701- pvr: Move pvr_get_isp_num_tiles_xy() to rogue_hw_utils.h
3702- pvr: Use pvr_sub_cmd_event union members directly
3703- pvr: Add wait_on_previous_transfer flag to graphics subcommand
3704- pvr: Cleanup in pvr_process_cmd_buffer()
3705- pvr: Add pvr_image_view_get_image()
3706- pvr: Publicise some static functions from pvr_blit.c
3707- pvr: Rename ds_{image,iview} in pvr_gfx_sub_cmd_job_init()
3708- pvr: Implement ZLS subtile alignment
3709- pvr: Correct calculations in pvr_unwind_rects()
3710- pvr: Refactor pvr_unwind_rects()
3711- pvr: Allow S8_UINT to be used as a stencil attachment format
3712- pvr: Don't overwrite PDS vertex input flags
3713- pvr: Declare dependency on idep_mesautil
3714- pvr: Add support for sampler border colors
3715- pvr: Correctly read dynamic state setup during blend constant setup
3716- pvr: Advance entry pointer in pvr_setup_vertex_buffers()
3717- pvr: Rename transfer 3D heap to transfer frag heap
3718
3719Matt Turner (13):
3720
3721- intel: Disable shader cache when executing intel_clc during the build
3722- u_format: Use memcpy to avoid unaligned accesses
3723- meson: Remove reference to removed SWR driver
3724- anv: Pipe anv_physical_device to anv_get_image_format_features2
3725- anv: Only expose video decode bits with KHR_video_decode_queue
3726- intel: Rearrange for next commit
3727- intel: Consider with_intel_clc in with_any_intel
3728- intel: Only build blorp if drivers are enabled
3729- intel: Only build ds if drivers are enabled
3730- intel: Only build perf if drivers or tools are enabled
3731- intel: Allow using intel_clc from the system
3732- intel: Limit Intel Vulkan RT to x86_64
3733- Revert "intel/fs: only avoid SIMD32 if strictly inferior in throughput"
3734
3735Matthieu Bouron (1):
3736
3737- lavapipe: honor dst base array layer when resolving color attachments
3738
3739Michael Tretter (2):
3740
3741- panfrost: remove BO from cache before closing GEM
3742- kmsro: assert that scanout refcount is larger than 0
3743
3744Michel Dänzer (17):
3745
3746- ci: Explicitly test for meson feature checks in compiler wrapper
3747- ci: Use set -e in frontend compiler wrapper scripts.
3748- ci: Remove shebang from backend compiler wrapper script
3749- ci: Drop executable permissions from backend compiler wrapper script
3750- tgsi: Make ureg_DECL_output_masked definition match its declaration
3751- llvmpipe: Make lp_build_interp_soa declaration match its definition
3752- mesa/st: Make st_convert_image(_from_unit) declaration match definition
3753- vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definition
3754- anv/format: Fix GetPhysicalDeviceSparseImageFormatProperties definition
3755- vulkan: Fix GetPhysicalDeviceSparseImageFormatProperties definitions
3756- svga: Make vmw_svga_winsys_buffer_map definition match declaration
3757- svga: Make declaration of emit_input_declaration match definition
3758- clover/llvm: Use llvm::DataLayout::getABITypeAlign with LLVM >= 16
3759- clover/llvm: Use std::nullopt already with LLVM 16
3760- ci: Drop -Wno-error=array-bounds from fedora-release job
3761- ci: Upgrade fedora-release job to Fedora 38
3762- ci: Enable rusticl in the fedora-release job
3763
3764Michel Zou (4):
3765
3766- vulkan/wsi: fix -Wnarrowing warning
3767- vk/entry_points:: fix mingw build
3768- mesa/draw: fix -Wformat warning
3769- util: reinstate ENUM_PACKED
3770
3771Mihai Preda (1):
3772
3773- nir: update nir->num_inputs, num_outputs in nir_recompute_io_bases()
3774
3775Mike Blumenkrantz (364):
3776
3777- mesa/st/program: don't init xfb info if there are no outputs
3778- zink: remove atomics from zink_query
3779- zink: pass ctx through query destroy paths
3780- zink: always defer query pool deletion
3781- zink: remove screen param from zink_prune_query()
3782- util/cpu: add big.LITTLE cpu detection
3783- driconf: rework glthread enablement
3784- glthread: disable by default with fewer than 4 (big) CPUs
3785- zink: move memoryTypeIndex selection down in general bo allocation
3786- zink: slightly rework memoryTypeIndex selection to pre-determine heap
3787- zink: restore BAR allocation failure demotion
3788- zink: make general bo allocation more robust by iterating
3789- zink: avoid zero-sized memcmp for descriptor layouts
3790- iris: use util_framebuffer_get_num_samples when setting ps dispatch samples
3791- nir/lower_alpha_test: rzalloc state slots
3792- zink: fix non-db bindless texture buffers
3793- util/blitter: fix line wrapping on error to avoid giving wrong line number
3794- glthread: add newline to env override
3795- zink: emit demote cap when using demote
3796- zink: only print copy box warning once per resource
3797- zink: hook up debug callback
3798- zink: use a perf_debug() macro for debug message logging of copy box warning
3799- util/debug: move null checks out of debug message macro
3800- zink: manually re-set framebuffer after msrtss replicate blit
3801- zink: handle 'blitting' flag better in msrtss replication
3802- zink: skip msrtss replicate if the attachment will be full-cleared
3803- zink: avoid recursion during msrtss blits from flushing clears
3804- zink: don't bitcast bool deref loads/stores
3805- zink: zink_shader_free -> zink_gfx_shader_free
3806- zink: split out generic shader destruction for reuse
3807- zink: always wait on precompile fence at start of zink_gfx_shader_free()
3808- zink: call zink_shader_free for compute shaders
3809- zink: add a util function for printing shaders
3810- zink: don't create separate shader dsls if there are no bindings
3811- drisw: don't leak the winsys
3812- zink: check for extendedDynamicState3DepthClipNegativeOneToOne for ds3 support
3813- mesa/st: try to block multisampled texsubimage from doing cpu writes
3814- mesa: fix ms fallback texture creation
3815- draw: fix viewmask iterating
3816- zink: use tes to generate tcs
3817- zink: hook up EXT_shader_object
3818- zink: wrap zink_shader_compile_separate() return
3819- zink: wrap return of compile_module()
3820- zink: make zink_shader_spirv_compile static
3821- zink: more zink_shader_object conversion
3822- zink: use zink_shader_object for precompiled separate shaders
3823- zink: minor whitespace cleanup
3824- zink: move separate shader dsl creation to compiler function
3825- zink: add a 'separate' flag to shader module compile to indicate separate shaders
3826- zink: run bo lowering passes for separate shader compile with uniform inlining
3827- zink: remove redundant compute program batch ref
3828- zink: use EXT_shader_object to (re)implement separate shaders
3829- zink: add validation exceptions for shader object extension enable
3830- zink: don't pin flush queue threads if no threads exist
3831- zink: add z32s8 as mandatory GL3.0 profile attachment format
3832- zink: add a driver workaround to disable background compiles
3833- nir/gs: fix array type copying for passthrough gs
3834- zink: fix array copying in pv lowering
3835- gallivm: break out native vector width calc for reuse
3836- llvmpipe: do late init for llvm builder
3837- zink: print the type of shader when dumping
3838- zink: use intermediate variable for separate shader descriptor update loop
3839- zink: use intermediate variable for separate shader db resize check
3840- zink: simplify separate shader prog init a little
3841- zink: streamline separate shader descriptor update
3842- zink: switch to a regular loop to wait on precompile shader fences
3843- zink: move some shader CSO functions around
3844- zink: assign separate shader prog stages from ctx->shader_stages
3845- zink: use a more standardized loop for initing separate shader program descriptors
3846- zink: move separate shader creation to shader CSO creation
3847- zink: handle all stages in fixup_io_locations()
3848- zink: fix longstanding TODO for generated tcs
3849- zink: use EXT_shader_object to implement generic separate shader precompile
3850- bump VVL to 1.3.248
3851- zink: prune some validation errors from ci
3852- zink: break out VkImageViewUsageCreateInfo applying for reuse
3853- zink: reapply VkImageViewUsageCreateInfo when rebinding a surface
3854- zink: add a workaround for a nir_assign_io_var_locations bug
3855- zink: don't run update_so_info if shader has no outputs
3856- zink: add ZINK_DEBUG=noshobj to disable EXT_shader_object
3857- zink: rename 'separate' param in shader compilation to 'can_shobj'
3858- zink: explicitly block sample shading in the GPL precompile path
3859- zink: add zink_program::uses_shobj for managing shader object binds
3860- zink: use local screen var in zink_gfx_program_update_optimal()
3861- zink: deduplicate separable program replacement handling
3862- zink: delete redundant conditional
3863- zink: use zink_shader_object for zink_shader_module
3864- zink: use zink_destroy_shader_module() for compute to deduplicate code
3865- zink: store spirv onto zink_shader_object structs
3866- zink: allow zink_shader_module to be either a shobj or a mod using a bool
3867- zink: avoid accessing zink_gfx_program::modules during pipeline compile
3868- zink: add a union to zink_gfx_pipeline_cache_entry for gpl
3869- zink: use zink_shader_object for pipeline compiles from zink_gfx_program
3870- zink: make zink_shader_spirv_compile public
3871- zink: enable EXT_shader_object for generic precompiles
3872- draw: fix robust ubo size calc
3873- ci: disable all a306/a530/a630 jobs
3874- llvmpipe: fix native vector width init
3875- zink: update amdpro fails
3876- zink: add extendedDynamicState3DepthClipNegativeOneToOne to profile
3877- zink: only unset a generated tcs if the bound tcs is the generated one
3878- Revert "zink: don't create separate shader dsls if there are no bindings"
3879- zink: disable a630 traces
3880- zink: set depth dynamic state values unconditionally
3881- zink: null some descriptor buffer pointers during destruction
3882- zink: sync queries at the end of cmdbufs
3883- cso: unbind fb state when unbinding the context
3884- i915: use util_copy_framebuffer_state to set fb state
3885- i915: use util_unreference_framebuffer_state to unref fb state
3886- iris: use util_unreference_framebuffer_state to unref fb state
3887- softpipe: use util_unreference_framebuffer_state to unref fb state
3888- v3d: use util_unreference_framebuffer_state to unref fb state
3889- vc4: use util_unreference_framebuffer_state to unref fb state
3890- llvmpipe: use util_unreference_framebuffer_state to unref fb state
3891- svga: use util_unreference_framebuffer_state to unref fb state
3892- zink: move EXT_shader_object check to another place
3893- zink: break out optimal key handling into separate function
3894- zink: disable EXT_shader_object if !optimal_keys
3895- zink: add ZINK_DEBUG=optimal_keys
3896- gallium: pipe_rasterizer_state::point_tri_clip -> point_line_tri_clip
3897- aux/draw: guard_band_points_xy -> guard_band_points_lines_xy
3898- aux/draw: add guardband clipping for lines
3899- zink: don't init mutable resource bit for swapchain images
3900- zink: don't init mutable for swapchain src during blit
3901- tgsi_to_nir: handle PIPE_CAP_NIR_COMPACT_ARRAYS for clipdistance
3902- zink: allow vk 1.2 timelineSemaphore feature if extension isn't supported
3903- zink: stringify unsupported prim restart log error
3904- zink: delete persistent map tracking
3905- zink: add PERSISTENT for db buffer maps
3906- zink: delete unnecessary pipeline stage flags from inference
3907- zink: use an intermediate variable for binding ssbo slots
3908- zink: unbind the ssbo slot being iterated, not the index of the buffer
3909- zink: flush INDIRECT_BUFFER mem barrier for compute
3910- zink: disable batched unordered barries with ZINK_DEBUG=noreorder
3911- zink: block batching of unordered barriers if previous usage was write
3912- zink: fix uncached memory readback
3913- glsl/lower_samplers_as_deref: apply bindings for unused samplers
3914- vulkan/runtime: add VK_DYNAMIC_STATE_ATTACHMENT_FEEDBACK_LOOP_ENABLE_EXT
3915- zink: add ZINK_DEBUG=noopt
3916- zink: add ZINK_DEBUG=nobgc
3917- zink: make mesa_logw separate from perf_debug
3918- zink: add perf_debug for "interesting" shader compiles
3919- zink: set debug callback on context
3920- zink: bind bindless db set when updating separate shader db sets
3921- zink: compare desc set to detect bindless vars in separate shaders
3922- zink: adjust bindless texel buffer handle before indexing
3923- zink: block more flushes during unordered blits
3924- zink: also cache swapchain semaphores
3925- zink: disable always zs feedback loop on radv
3926- zink: add back some anv qbo flakes
3927- zink: disable have_EXT_vertex_input_dynamic_state without EDS2
3928- zink: disable dynamic state exts if the previous ones aren't present
3929- zink: add some ci flakes
3930- zink: don't leak swapchain readback semaphores
3931- zink: destroy current batch state after all other batch states
3932- zink: reorder some native blit code
3933- zink: reject blits where src/dst is 3D and dst/src z!=0
3934- zink: reorder some image copy code
3935- zink: ignore no-op image copies
3936- zink: only add feedback loop usage bit if extension is supported
3937- lavapipe: EXT_attachment_feedback_loop_layout_dynamic_state
3938- zink: slightly simplify bda allocation chaining
3939- zink: hook up some memory extensions
3940- zink: set higher prio on dedicated memory allocations
3941- zink: flag batch usage on swapchain images
3942- vulkan/wsi: add feedback loop usage to swapchain caps if supported
3943- zink: add feedback loop usage for swapchains
3944- vtn: add spirv index to type mismatch error for debugging
3945- vtn: print spirv id for type mismatch error
3946- vtn: print spirv ids for type mismatch in bcsel
3947- vtn: add more info to bitcast bit size error message
3948- zink: try update fb resource refs when starting new renderpass
3949- zink: add special-casing for (not) reordering certain image barriers
3950- zink: use batch usage function for a simple case
3951- zink: move zink_batch_state::submit_count to zink_batch_usage
3952- zink: move batch usage to substruct on zink_bo objects
3953- zink: track/check submit info on resource batch usage
3954- zink: disable unordered blits when swapchain images need aqcuire
3955- zink: explicitly disable reordering after restricted swapchain readback blits
3956- zink: explicitly disable promotion on images that are both unflushed and non-reorderable
3957- zink: flag 'has_work' on batch when promoting a cmd
3958- lavapipe: more correctly handle null pipeline states
3959- anv: more correctly handle null pipeline states
3960- vk/graphics_state: handle null pipeline state structs in creation
3961- zink: promote flushed clears to unordered cmdbuf when possible
3962- zink: also declare int size caps inline with signed int type usage
3963- zink: delete unnecessary bitcast in load_shared/scratch
3964- zink: use void return for store_dest
3965- zink: move get_alu_type() up in file
3966- zink: manually memcpy the spirv instruction buffer
3967- zink: write out register variables to a separate spirv buffer
3968- zink: dynamically emit non-bool register values using local_vars spirv buffer
3969- zink: store and use alu types for ntv defs
3970- zink: infer types from load_const instrs to avoid more bitcasts
3971- lavapipe: bump memory allocation heap to 3GiB
3972- lavapipe: report full memory in heap for 64bit processes
3973- lavapipe: EXT_memory_budget
3974- lavapipe: EXT_memory_priority
3975- lavapipe: store memory allocation size onto lvp_device_memory
3976- lavapipe: VK_EXT_pageable_device_local_memory
3977- zink: don't wait on queue thread if disabled
3978- zink: use the per-context track_renderpasses flag in more places
3979- zink: don't remove psiz from linked shaders if the consumer reads it
3980- zink: don't propagate psiz in quads emulation gs
3981- lavapipe: VK_EXT_dynamic_rendering_unused_attachments
3982- zink: require EXT_dynamic_rendering_unused_attachments for dynamic rendering
3983- zink: explicitly avoid ci errors due to unrecognized extensions in VVL
3984- vulkan: reorder vk_cmd_queue_entry
3985- vulkan/cmd_queue: allocate cmds based on the size of the cmd
3986- vulkan/cmd_queue: expose cmd sizes
3987- vulkan: use cmd size array for queued cmd allocations
3988- ci: uprev VVL to 1.3.251
3989- lavapipe: fix DS3 min sample setting
3990- lavapipe: bump max push constant size
3991- lavapipe: stop setting patch vertices constantly
3992- lavapipe: don't pass indirect info in streamout draws
3993- draw: add (disabled) vertex dumping for non-linear emit
3994- lavapipe: fix memory budget reporting
3995- zink: also disable bg compile for compute with nobgc
3996- zink: hook up VK_EXT_attachment_feedback_loop_dynamic_state
3997- zink: use dynamic state for feedback loops when available
3998- zink: enable EXT_shader_object globally with have_EXT_attachment_feedback_loop_dynamic_state
3999- zink: add a ci flake
4000- lavapipe: pass list to cmdbuf exec, not cmdbuf
4001- lavapipe: add a mapping for BDA
4002- lavapipe: add a zeroed buffer that can be bound in place of an index buffer
4003- lavapipe: handle index buffers with offsets for indirect draws
4004- lavapipe: NV_device_generated_commands
4005- zink: combine some rast state draw conditionals
4006- zink: don't check prog->shaders when creating gfx pipeline
4007- zink: check for cached mem correctly when mapping buffer
4008- zink: remove assert for dt in zink_kopper_update
4009- zink: stop swizzling conditional render during batch flush
4010- zink: update some radv qbo fails
4011- radv: tweak gfx pipeline stage binding
4012- zink: only try to create srgb mutable images if the vk format is supported
4013- vk: make vk_format_map[] public
4014- radv: directly use vk_format_map for vertex input
4015- lavapipe: use PACKAGE_VERSION for cache uuid in release builds
4016- zink: massively shrink qbo size for timestamp queries
4017- zink: assert that ntv image creation isn't clobbering existing images
4018- zink: add some ntv asserts for ms txf
4019- zink: add a dgc debug mode for testing
4020- lavapipe: add version uuid to shader binary validation
4021- egl/dri2: trigger drawable invalidation from surface queries for zink
4022- zink: add some ci flakes
4023- zink: break out vk flag unrolling into util function
4024- zink: add mem debugging
4025- zink: remove redundant conditional in set_sampler_views
4026- zink: wrap format mismatch checks for blit/surface
4027- zink: add srgb mutable for all resources by default
4028- zink: drop dt checks for mutable format init
4029- zink: strip format list when disabling mutable during image creation
4030- dri3: only invalidate drawables on geometry change if geometry has changed
4031- zink: more anv ci flakes
4032- aux/trace: add methods for mesh shaders
4033- lavapipe: more fixes for sample shading
4034- lavapipe: fix shader binary binding with mesh shaders
4035- lavapipe: correctly update shader object per-stage push constant sizes
4036- zink: add COHERENT requirement for CACHED memory
4037- zink: ZINK_HEAP_HOST_VISIBLE_CACHED -> ZINK_HEAP_HOST_VISIBLE_COHERENT_CACHED
4038- zink: fix anv ci flake wildcarding
4039- aux/pipebuffer: add a return to pb_slabs_reclaim()
4040- aux/pipebuffer: add a return to pb_cache_release_all_buffers()
4041- zink: only retry bo allocation after reclaim if reclaims actually happened
4042- zink: fix ubo array sizing in ntv
4043- zink: acquire persistently bound swapchain descriptors before setting usage
4044- zink: recache present semaphores
4045- zink: always clamp NUM_QUERIES to 500
4046- zink: radv vangogh ci updates
4047- radv: remove redundant intermediate variable in radv_is_mrt0_dual_src()
4048- radv: inline radv_can_enable_dual_src()
4049- zink: no-op redundant samplemask changes
4050- zink: force inlining for a bunch of functions
4051- zink: make invalidate_descriptor_state a ctx hook
4052- zink: specialize invalidate_descriptor_state hook for compact mode
4053- zink: clean up rp update tracking on dsa bind
4054- zink: use local screen var in blend state bind
4055- zink: track and apply ds3 states only on change
4056- zink: don't update tc info directly from cso binds
4057- zink: check sampler views pointer before loop
4058- zink: add fastpaths for no-op sampler/view rebinds
4059- nir/lower_tex: ignore saturate for txf ops
4060- radv: pre-init surface info
4061- ci: add a test-dozen-deqp flake
4062- lavapipe: handle multiview queries
4063- zink: fix assert for inline uniform invalidation with generated gs bound
4064- zink: fix unbinding generated gs on real gs bind
4065- zink: get new bda when rebinding invalidated buffers
4066- lavapipe: create a desc set for immutable sampler layouts
4067- lavapipe: split out descriptor stage setting
4068- lavapipe: EXT_descriptor_buffer
4069- lavapipe: VK_EXT_mutable_descriptor_type
4070- llvmpipe: flush/reference fs ubos on bind
4071- zink: do initial program unref during program creation
4072- zink: fix separate shader program refcounting
4073- docs: update lavapipe extensions
4074- zink: don't destroy swapchain on initial CreateSwapchainKHR fail
4075- aux/trace: fix bindless texture dumping
4076- vk/wsi/x11: move surface alpha check from get_caps to creation
4077- vk/wsi/x11: handle geometry updating more asynchronously
4078- vk/wsi/x11: stop roundtripping on presentation
4079- vk/wsi: unify dmabuf exporting
4080- vk/wsi: add error logging for syncfile import/export failures
4081- zink: fix anv ci flakes (for real this time)
4082- zink: fix batch disambiguation on first submit
4083- zink: set pipeline dynamic state count after all dynamic states are set
4084- zink: be even dumber about buffer refs when replacing storage
4085- zink: emit SpvCapabilitySampleMaskPostDepthCoverage with SpvExecutionModePostDepthCoverage
4086- zink: fix the fix for separate shader program refcounting
4087- kopper: handle pixmap creation failure more gracefully
4088- glxsw: check geometry of drawables on creation
4089- zink: don't clobber descriptor mode on multiple screen creation
4090- nir: fix slot calculations for compact variables with location_frac
4091- lavapipe: use the component offset directly for xfb
4092- glsl: only explicitly check GS components in PSIZ injection with output variables
4093- lavapipe: don't check geometry for fb attachments
4094- zink: better handle separate shader dsl creation when no bindings exist
4095- zink: force image barriers after dmabuf import
4096- zink: use VK_WHOLE_SIZE when binding null db buffer descriptors
4097- zink: unset line stipple ds3 state flags when stipple not available
4098- nir/lower_io_to_scalar: fix 64bit io splitting
4099- nir/linking_helpers: force type matching in does_varying_match
4100- zink: add batch refs for transient images
4101- zink: fix zs resolve attachment indexing
4102- zink: don't add VK_IMAGE_USAGE_ATTACHMENT_FEEDBACK_LOOP_BIT_EXT for transient images
4103- zink: don't append msrtss to dynamic render if not supported
4104- zink: set msrtss depth resolve mode when enabled
4105- zink: add more locking for pipeline cache
4106- aux/trace: fix winsys handle dumping
4107- zink: generated tcs is on the tes, not the vs
4108- llvmpipe: block weird uses of subsampled formats in buffers
4109- llvmpipe: fix early depth + alpha2coverage + occlusion query interaction
4110- lavapipe: fix resolves where src image has a layer offset
4111- lavapipe: block yuv formats from getting blit feature flags
4112- zink: explicitly set non-optimal last_vertex_stage shader key on ctx create
4113- zink: fix big tcs output io
4114- zink: fix crash in lower_pv_mode_gs_store
4115- u/draw: skip zero-sized indirect draws
4116- nir/zink: fix gs emulation xfb_info sizing
4117- vk/graphics: fix CWE handling with DS3
4118- Revert "vk/wsi/x11: handle geometry updating more asynchronously"
4119- zink: wait on async fence during ctx program removal
4120- zink: don't start multiple cache jobs for the same program
4121- zink: disable validation
4122- zink: be more precise about flagging rp changes around unordered u_blitter
4123- zink: fix linear modifier dmabuf imports
4124- aux/tc: handle stride mismatch during rp-optimized subdata
4125- zink: always add a per-prog ref for gpl libs
4126- zink: set is_xfb=false for all i/o variables
4127- nir/inline_uniforms: fix oob access with nir_find_inlinable_uniforms
4128- aux/tc: fix staging buffer sizing for texture_subdata
4129- aux/tc: fix address calc for segmented texture subdata
4130- glsl: check for xfb setting xfb info
4131- aux/tc: fix renderpass tracking fb state clobber scenario
4132- aux/tc: fix rp info handling around tc_sync calls
4133- aux/tc: don't use pipe_buffer_create_with_data() for rp-optimized subdata
4134- zink: flag db maps as unsynchronized
4135- lavapipe: clamp cache uuid size
4136- tu: handle unused color attachments without crashing
4137- zink: propagate rp_tc_info_updated across unordered blits
4138- zink: move swapchain fence to swapchain object
4139- zink: avoid UAF on wayland async present with to-be-retired swapchain
4140- zink: always trace_screen_unwrap in acquire
4141
4142MouriNaruto (1):
4143
4144- dzn: Fix segmentation fault when Direct3D 12 user mode driver from at least one of GPUs is not available.
4145
4146MrRobbin (1):
4147
4148- zink: Move the workaround before the EDS setting.
4149
4150Mykhailo Skorokhodov (4):
4151
4152- mesa: Implement GL_CLEAR_TEXTURE flag
4153- mesa: Fallthrough GL_SRB_DECODE_ARB pname
4154- iris: Fix memory size with disabled resizable bar
4155- nir: Rematerialize derefs after opt_dead_cf
4156
4157Mykola Piatykop (1):
4158
4159- mesa: Fix use after free.
4160
4161Nanley Chery (28):
4162
4163- iris: Allocate ZEROED BOs for shared resources
4164- iris/bufmgr: Add and use zero_bo
4165- iris/bufmgr: Handle flat_ccs for BO_ALLOC_ZEROED
4166- intel/isl: Bump the MCS halign value for BDW+
4167- iris: Add a barrier to iris_mcs_partial_resolve
4168- intel: Implement ISL_AUX_OP_AMBIGUATE for MCS
4169- iris: Enable MCS init with ISL_AUX_OP_AMBIGUATE
4170- anv: Drop the MCS initialization performance warning
4171- anv: Enable MCS init with ISL_AUX_OP_AMBIGUATE
4172- intel/blorp: Assert an 8bpp fast clear restriction
4173- iris: Init CCS_E to COMPRESSED_NO_CLEAR for XeHP
4174- intel/blorp: Use the depth copy format more on BDW+
4175- intel/blorp: Add depth usage check for copy format
4176- intel/blorp: Change condition for CCS_E copy formats
4177- intel/blorp: Add and use blorp_copy_get_formats
4178- iris: Use known formats for tex_cache_flush_hack
4179- iris: Drop a GFX12_CCS_E check in can_fast_clear_color
4180- intel: Rename the GFX12_CCS_E aux-usage to FCV_CCS_E
4181- iris: Avoid extra CCS_E flushes for aux mode changes
4182- iris: Avoid FCV_CCS_E for shader image accesses
4183- iris: Assert against FCV_CCS_E for blitter writes
4184- intel/blorp: Avoid 32bpc fast clear sampling issue
4185- Revert "iris: Add missed tile flush flag"
4186- iris: Drop the RT flush for PIPE_BARRIER_TEXTURE
4187- iris: Drop GPGPU Tex Invalidate restriction for TGL+
4188- isl: Add and use size and alignment calculators
4189- anv: Don't support ASTC images with modifiers
4190- intel/blorp: Ambiguate after CCS resolves on gfx7-8
4191
4192Oskar Rundgren (20):
4193
4194- pvr: Allow block compressed source blit
4195- pvr: Transfer PBE source snorm format should be signed
4196- pvr: Transfer PBE gamma is unset
4197- pvr: Transfer fix blit with multiple emits
4198- pvr: Transfer multiple emits clip rectangle
4199- pvr: Add back S8_UINT support
4200- pvr: Add PBE packmode for depth stencil formats
4201- pvr: Transfer add depth merge support for X8_D24
4202- pvr: Transfer add s8_uint support
4203- pvr: PBE fix mesa pipe swizzle conversion
4204- pvr: Transfer ignore non zero stride for twiddled surface
4205- pvr: Transfer block compressed with 3d twiddled layout
4206- pvr: Transfer support flipped rectangle mapping
4207- pvr: Transfer remove byte unwind workaround
4208- pvr: fix texel unwind workaround mappings
4209- pvr: Transfer check valid source address mask
4210- pvr: Transfer optimisation remove unused features from API
4211- pvr: Transfer image to buffer dest rect
4212- pvr: Fix transfer image clearing PBE packmodes
4213- pvr: add block compressed formats blit support
4214
4215Patrick Lerda (22):
4216
4217- r600: fix refcnt imbalance related to r600_set_vertex_buffers()
4218- r600: fix refcnt imbalance related to evergreen_set_shader_images()
4219- lima: fix refcnt imbalance related to framebuffer
4220- r600/sfn: fix memory leak related to sh_info->arrays
4221- aux/draw: fix memory leak related to ureg_get_tokens()
4222- crocus: fix refcnt imbalance related to framebuffer
4223- crocus: fix refcnt imbalance related to crocus_create_surface()
4224- r600: fix refcnt imbalance related to atomic_buffer_state
4225- radeonsi: set proper drm_amdgpu_cs_chunk_fence alignment
4226- crocus: fix scratch_bos memory leak
4227- mesa: fix refcnt imbalance related to egl_image_target_texture()
4228- glthread: fix typo related to upload_vertices()
4229- mesa: fix refcnt imbalance related to _mesa_delete_semaphore_object()
4230- mesa/st: fix refcnt imbalance related to st_feedback_draw_vbo()
4231- mesa/st: fix buffer overflow related to set_program_string()
4232- r600: fix r600_draw_vbo() buffer overflow
4233- nouveau: fix nouveau_heap_destroy() memory leak
4234- r600: fix cayman_convert_border_color() swizzle behavior
4235- util/blitter: fix util_blitter_clear_buffer() refcnt imbalance
4236- util/blitter: revert util_blitter_clear_buffer()
4237- radeonsi: fix refcnt imbalance related to util_blitter_save_fragment_constant_buffer_slot()
4238- panfrost: fix refcnt imbalance related to blitter
4239
4240Paul Gofman (2):
4241
4242- driconf: add a workaround for Captain Lycop: Invasion of the Heters
4243- driconf: add a workaround for Rainbow Six Extraction
4244
4245Paulo Zanoni (9):
4246
4247- iris: Store prime fd of external bos for Xe KMD
4248- iris: Add functions to import and export implicit sync state
4249- iris: Extend iris_bo_wait_syncobj() to wait on external implicit syncobj
4250- iris: Add iris_implicit_sync struct and functions to do implicit synchronization for Xe kmd
4251- iris: also avoid isl_memcpy_linear_to_tiled for Tile64
4252- intel/isl: tile 64 calculations work with 1D surfaces
4253- iris: assert bufmgr->bo_deps_lock is held
4254- iris: avoid stack overflow in iris_bo_wait_syncobj()
4255- iris: assert(bo->deps) after realloc()
4256
4257Pavel Ondračka (33):
4258
4259- r300: fix unconditional KIL on R300/R400
4260- r300: add CI list of known rv370 dEQP failures
4261- r300: remove simple duplicate ARL instructions
4262- r300: fuse ROUND and ARL to ARR
4263- r300: remove nir round lowering
4264- r300: enable PIPE_CAP_TGSI_TEXCOORD
4265- r300: fail linking instead of using dummy shaders
4266- CODEOWNERS: add r300 driver
4267- r300: move nir stuff to r300_nir file
4268- r300: move the ARL merging pass up in the opt loop
4269- r300: move the ROUND+ARL->ARR fusing to main optimization loop
4270- r300: optimize the load A0 pattern from wined3d
4271- r300: remove duplicate ARRs
4272- r300: be more agressive when merging A0 loads
4273- r300: remove unused SIN/COS lowering
4274- r300: remove unused SSG lowering
4275- r300: move CEIL lowering to NIR
4276- r300: remove unused FLR lowering
4277- r300: remove unused POW lowering
4278- r300: remove unused DST lowering
4279- r300: remove unused ROUND lowering
4280- r300: remove unused LIT lowering
4281- r300: remove unused opcodes from r300_tgsi_to_rc
4282- nir_opt_algebraic: don't use i32csel without native integer support
4283- r300: add partial CMP support on R5xx
4284- r300: properly count maximum used register index
4285- r300: lower undefs to zero
4286- r300: add some early safe bool lowering
4287- r300: remove most of backend contant folding
4288- r300: disable ntt regalloc for vertex shaders
4289- r300: assert that every writer has a reader
4290- r300: update RV370 failures
4291- r300: don't abort on flow control when using draw for vs
4292
4293Philipp Zabel (1):
4294
4295- etnaviv: fix segfault after compile failure
4296
4297Pierre-Eric Pelloux-Prayer (13):
4298
4299- amd: update amdgpu_drm.h
4300- amd: determine info->has_fw_based_shadowing
4301- radeonsi: implement fw based mcbp
4302- amd: update amdgpu_drm.h
4303- radeonsi: stop reporting reset to app once gpu recovery is done
4304- winsys/amdgpu: add a helper function to submit a no-op job
4305- winsys/amdgpu: use the no-op helper to detect if reset completion
4306- mesa: don't share reset status across contexts
4307- mesa: remove unused bools
4308- llvmpipe: only include old Transform includes when needed
4309- Revert "gallium/u_threaded: buffer sharedness tracking"
4310- st/mesa: check renderbuffer before using it
4311- radeonsi: emit framebuffer state after allocating cmask
4312
4313Qiang Yu (119):
4314
4315- nir: add nir_load_barycentric_optimize_amd intrinsic
4316- radeonsi: implement nir_load_barycentric_optimize_amd
4317- ac/nir/ps: lower barycentric load when bc_optimize
4318- ac/nir/ps: add force lower barycentric load options
4319- ac/nir/ps: lower sample mask input when needed
4320- ac/llvm,radeonsi: lower ps color load in nir
4321- radeonsi: add si_nir_lower_ps_color_input
4322- radeonsi: add si_nir_emit_polygon_stipple
4323- radeonsi: handle lowered ps in scan_io_usage
4324- radeonsi: monolithic ps emit prolog in nir directly
4325- radeonsi: restructure mono merged shader build
4326- radeonsi: remove separate_prolog parameter
4327- radeonsi: add si_mark_divergent_texture_non_uniform
4328- ac/llvm,radeonsi: use texture non-uniform flag as waterfall switch
4329- nir,ac/llvm,radeonsi: replace nir_load_smem_buffer_amd with nir_load_ubo
4330- ac/llvm,radeonsi: lower nir_load_point_coord_maybe_flipped in nir
4331- ac,radv: move ps arg compation to common place
4332- aco: support 32bit address in nir_load_smem_amd
4333- nir: add missing image atomic_inc/dec_wrap intrinsic
4334- aco: implement nir_bindless_image_atomic_inc/dec_wrap
4335- aco: skip scratch buffer init when its arg is not used
4336- aco: fix nir_f2u64 translation
4337- nir: add nir_export_dual_src_blend_amd intrinsic
4338- aco: move create_fs_dual_src_export_gfx11 above
4339- aco: implement nir_export_dual_src_blend_amd
4340- ac/nir/ps: use nir_export_dual_src_blend_amd when aco
4341- ac/nir/ps: add no_color_export option
4342- aco: support nir_export_amd with ps targets
4343- aco,radv: lower outputs to exports when nir for monolithic ps
4344- ac/llvm: remove output variable declaration for radv ps
4345- radv: implement nir_load_barycentric_optimize_amd
4346- ac/nir/ps: remove used nir_variable if created
4347- aco,ac/llvm,radv,radeonsi: handle ps bc optimization in nir for radv
4348- aco,radv: remove unused aco compile options
4349- aco,radv: support symbol relocation in aco
4350- aco: get scratch addr from symbol for radeonsi
4351- aco: allow no export instruction for gfx10+ fs
4352- ac/nir/cull: fix line position w culling
4353- meson: build radeonsi with aco
4354- radeonsi: add aco debug option
4355- radeonsi: add use_aco field for struct si_shader
4356- radeonsi: add shader info for frag coord and sample pos read
4357- radeonsi: add shader info uses_sampleid
4358- radeonsi: pack spi ps input fixup to a function
4359- radeonsi: init spi ps input shader config when aco
4360- radeonsi: add a raw shader binary type
4361- ac/binary: pack prefech align code to a function
4362- radeonsi: support raw shader binary upload
4363- radeonsi: support print raw shader binary
4364- radeonsi: remove ps vgpr index save when args init
4365- tgsi_to_nir: call nir_lower_int64 when required
4366- ac/llvm,radeonsi: lower idiv in nir
4367- ac/llvm,radeonsi: lower fsin/fcos in nir
4368- ac/llvm,radeonsi: lower txf offset in nir
4369- ac/llvm,radeonsi: lower ineg in nir
4370- ac/llvm,radeonsi: lower some pack/unpack ops not supported by aco
4371- ac/llvm,radeonsi: lower nir_fpow for aco and llvm
4372- radeonsi: lower some 64bit ops aco does not support
4373- radeonsi: lower vector const to scalar at last for aco
4374- radeonsi: add has_non_uniform_tex_access shader info
4375- radeonsi: lower non uniform texture access when aco
4376- radeonsi: add initial aco compile code
4377- radeonsi: add symbols to si_shader_binary
4378- radeonsi: resolve aco scratch addr symbols
4379- radeonsi: adjust ps args for aco
4380- radeonsi: pass use_aco to ac_nir_lower_ps
4381- radeonsi: clamp shadow texture reference in nir for aco
4382- ac/llvm,radeonsi: enable lower_array_layer_round_even
4383- radeonsi: fixup sampler desc for tg4 in nir
4384- radeonsi: be able to use aco compiler for mono ps
4385- ac/llvm: remove the double frcp special handling
4386- radeonsi: fix aco compile for atomic ops
4387- ac/llvm: remove redundant nir_lower_legacy_atomics
4388- radeonsi: fix uses_instanceid for merged mono shader stage
4389- aco: implement two load lds ngg intrininsic for radeonsi
4390- aco,radv: remove unused aco_shader_info fields
4391- ac/nir/ngg: don't use 8bit alu ops
4392- aco: implement load buffer with ACCESS_USES_FORMAT_AMD
4393- aco/assembler: handle ds_(add|sub)_gs_reg_rtn encoding
4394- aco: use gds reg when ordered xfb counter add
4395- aco: implement nir_xfb_counter_sub_amd
4396- aco: implement nir_bindless_image_fragment_mask_load_amd
4397- aco: use ac_get_image_dim for array check when image intrinsic
4398- radeonsi: resolve lds ngg aco symbols
4399- radeonsi: add scratch offset vs args explicitly for aco
4400- ac/llvm,radeonsi: lower nir_load_gs_vertex_offset_amd in abi
4401- ac/llvm,radeonsi: lower nir_load_merged_wave_info_amd in abi
4402- ac/llvm,radeonsi: lower load_workgroup_num_input_(vertices|primitives) in abi
4403- ac/llvm,radeonsi: lower nir_load_initial_edgeflags_amd in abi
4404- ac/llvm,radeonsi: lower nir_load_packed_passthrough_primitive_amd in abi
4405- ac/llvm,radeonsi: lower nir_load_ordered_id_amd in abi
4406- ac/llvm,radeonsi: lower nir_load_ring_esgs_amd in abi
4407- nir,ac/llvm,radeonsi: replace nir_buffer_atomic_add_amd with ssbo atomic
4408- radeonsi: fill aco shader info for mono standalone vs
4409- radeonsi: calculate needed lds size when upload raw binary for vs
4410- radeonsi: use nir_umul_high for fast udiv
4411- radeonsi: always use scoped barrier
4412- ac/llvm: remove unused barrier implementation
4413- radeonsi: enable aco for mono standalone vs
4414- aco,radv: remove unused gs aco shader info
4415- ac/nir,radv: add 1 dword to LS/HS vertex stride
4416- ac/nir,radv: add 1 dword to ES/GS item size
4417- radeonsi: add scratch_offset arg for aco tcs
4418- radeonsi: lower nir_load_tess_rel_patch_id_amd in abi for aco
4419- ac/llvm,radeonsi: lower nir_load_ring_tess_offchip_amd in abi
4420- radeonsi: enable aco support for mono standalone tcs
4421- radeonsi: add scratch_offset arg for aco tes
4422- radeonsi: init tes aco shader info fields
4423- radeonsi: update lds size for tes
4424- radeonsi: enable aco support for standalone tes
4425- radeonsi: add scratch_offset arg for aco gs
4426- ac/llvm,radeonsi: lower nir_load_ring_gsvs_amd in abi
4427- radeonsi: enable aco for standalone gs
4428- radeonsi: enable aco support for gs copy shader
4429- radeonsi: add scratch_offset arg for aco cs
4430- ac/llvm,radeonsi: lower nir_load_user_data_amd in abi
4431- radeonsi: fix crash when AMD_DEBUG=cs,initnir
4432- radeonsi: enable aco support for compute shader
4433- ac/nir/ngg: fix ngg_gs_clear_primflags crash
4434
4435QwertyChouskie (1):
4436
4437- docs/features.txt(fix): mark VK_EXT_pipeline_robustness as supported on radv
4438
4439Rajnesh Kanwal (9):
4440
4441- pvr: Add support to process transfer and blit cmds
4442- pvr: Implement vkCmdCopyBufferToImage API.
4443- pvr: Implement vkCmdCopyImage2KHR API.
4444- pvr: Implement vkCmdBlitImage API.
4445- pvr: Implement vkCmdClearColorImage API.
4446- pvr: Implement vkCmdCopyImageToBuffer2 API.
4447- pvr: Implement vkCmdFillBuffer API.
4448- pvr: Implement vkCmdResolveImage2KHR API.
4449- pvr: Implement vkCmdClearDepthStencilImage API.
4450
4451Rhys Perry (92):
4452
4453- nir: add is_gather_implicit_lod
4454- vtn: set is_gather_implicit_lod
4455- aco: support implicit LOD for nir_texop_tg4
4456- ac/llvm: support implicit LOD for nir_texop_tg4
4457- aco: remove SMEM_instruction::prevent_overflow
4458- aco: use apply_nuw_to_ssa() with load_smem_amd
4459- ac/nir/ps: fix null export write mask miss set to 0xf
4460- aco: don't move exec reads around exec writes
4461- aco: don't move exec writes around exec writes
4462- radv: fix bc optimization with POS_W_FLOAT_ENA(1)
4463- aco/ra: create M0-affinities for s_sendmsg
4464- aco/gfx11: fix VMEM/DS->VALU WaW/RaW hazard
4465- amd/drm-shim: move device list to external file
4466- amd/drm-shim: add polaris10
4467- amd/drm-shim: add vega10
4468- amd/drm-shim: add navi10
4469- aco: add get_op_fixed_to_def() helper
4470- aco: consider how definitions fixed to operands can change register demand
4471- nir/fold_16bit_tex_image: skip tex instructions with backend1
4472- nir,vtn,aco,ac/llvm: make cube_face_coord_amd more direct
4473- ac/nir: add pass for lowering 1d/cube coordinates
4474- ac/nir: round layer in ac_nir_lower_tex
4475- radv,radeonsi: use ac_nir_lower_tex
4476- nir/lower_tex: remove lower_array_layer_round_even
4477- ac/nir: add fix_derivs_in_divergent_cf
4478- aco: remove unused RegType
4479- aco: let p_start_linear_vgpr take an operand
4480- aco: add MIMG_instruction::strict_wqm
4481- aco: implement strict_wqm_coord_amd
4482- aco: implement texture samples with strict WQM coordinates
4483- radv: use fix_derivs_in_divergent_cf
4484- aco/tests: improve performance of declaration parsing
4485- aco/tests: add fix_derivs_in_divergent_cf tests
4486- aco: fix update_alu(clear=true) for exports
4487- aco: use pass_flags to recover s_delay_alu cycles
4488- aco: insert s_delay_alu on the linear CFG
4489- aco: improve printing of s_delay_alu
4490- radv: allow wave32 for geometry shaders
4491- aco: fix has_color_exports=true for mrtz exports
4492- aco/tests: add discard export target tests
4493- aco: fix ds_sub_gs_reg_rtn validation
4494- radv: initialize aco_compiler_options::is_opengl
4495- radv: correctly skip vertex loads with packed formats
4496- aco: consider position/primitive exports around memory barriers
4497- ac/nir: use scoped barriers to finish stores before exports
4498- aco: remove memory_barrier_buffer implementation
4499- aco: mask bits source of s_bfe
4500- aco/tests: test that s_bfe bits is masked
4501- util: fix gc_alloc_size alignment
4502- util/tests: add gc_alloc_size alignment tests
4503- aco: run nir_lower_int64 after nir_opt_uniform_atomics
4504- ac: fix PIPE_FORMAT_R11G11B10_FLOAT DST_SEL_W
4505- radv: refactor CS subgroup size determination
4506- radv: use wave32 for small workgroups
4507- aco: don't try to form load+store clauses
4508- aco/gfx11: use s_clause with stores
4509- aco/gfx11: schedule for VMEM store clauses
4510- aco: don't set exec_hi for wave32 scan reductions
4511- amd/drm-shim: use fixed-width types
4512- nir/peephole_select: allow some invocation broadcast intrinsics
4513- aco: include helpers in emit_uniform_{reduce,scan}
4514- nir,aco: add INCLUDE_HELPERS index to reduce intrinsic
4515- nir/opt_intrinsic: optimize quad vote
4516- radv: use nir_opt_intrinsics
4517- aco,ac/llvm,ac/nir,vtn: unify cube opcodes
4518- nir: split nir_lower_mov64
4519- radv: use nir_lower_conv64
4520- radv: call nir_lower_int64 later
4521- radeonsi: use nir_lower_conv64
4522- aco: remove 64-bit integer conversion opcodes
4523- ac/llvm: fix AC_TM_CHECK_IR
4524- radv: fix radv_get_ballot_bit_size with CS
4525- ac/llvm: fix wave32 ac_build_mbcnt_add with 64-bit mask
4526- ac/llvm: skip ballot zext for 32-bit dest with wave32-as-wave64
4527- radv: add conformant_trunc_coord to cache UUID
4528- ac/nir: always round cube array layers
4529- nir/unsigned_upper_bound: fix phi(bcsel)
4530- nir/opt_dead_cf: remove nodes after a jump earlier
4531- aco: insert s_nop before VGPR deallocation
4532- radv: workaround WWZ exporting index=1 through location=1
4533- radv: correctly skip MRT output NaN fixup for meta shaders
4534- aco: summarize register demand after handling branches
4535- aco: don't create sendmsg(dealloc_vgprs) if scratch is used
4536- radv: disable 64-bit color attachments
4537- aco: fix p_bpermute_gfx6 with input at non-zero byte
4538- radv: fix 128bpp comp-to-single clears
4539- aco/spill: skip p_branch in process_block
4540- aco/spill: add all live-in to merge block spill candidates
4541- aco/optimizer_postRA: check overwritten_subdword in is_overwritten_since()
4542- aco: check logical_phi_info at p_logical_end when eliminating exec writes
4543- aco: remove unused p_logical_end check when optimizing branching sequence
4544- aco: reset prefetch in the correct block after removing the exit
4545
4546Rob Clark (58):
4547
4548- freedreno/a6xx: Fix valid_format_cast logic for newer a6xx
4549- freedreno: Remove unused fd_batch_reset()
4550- freedreno: Inline single-caller helpers
4551- freedreno: Extra casting to make C++ happy
4552- freedreno/registers: C++ struct casting
4553- util/log: Add missing "const"
4554- freedreno/ir3: More perfetto tracing
4555- mesa/nir: Add some perfetto traces
4556- freedreno/perfetto: Add shader_id for compute stages
4557- freedreno: Add dirty state logging
4558- freedreno/a6xx: Pass ring to __ONE_REG()
4559- freedreno: Add more tracepoint fields
4560- freedreno: Fix resource tracking vs rebind/invalidate
4561- freedreno/a6xx: Change a618 tile_align_h back to 32
4562- dri/android: Fix MSAA resolve
4563- Revert "ci: disable all a306/a530/a630 jobs"
4564- freedreno/a6xx: Rework set_bin_size()
4565- freedreno/a6xx+: Use template to handle a6xx vs a7xx differences
4566- freedreno/batch: Add helper to set fb state
4567- freedreno/a6xx: Move LRZ clear to blitter
4568- freedreno/a6xx: Add ctx->emit_sysmem()
4569- freedreno/a6xx: Simplify per-tile conditional IBs
4570- freedreno/a6xx: Switch to batch->cleared
4571- freedreno/a6xx: Split tile loads and clears
4572- freedreno/a6xx: Introduce batch subpasses
4573- freedreno/a6xx: Per-subpass LRZ
4574- freedreno/a6xx: New subpass on mid-frame clears
4575- freedreno/a6xx: Move LRZ clears to gmem
4576- freedreno/a6xx: Actually use LRZ for ms
4577- freedreno/a5xx+a6xx: Don't allocate LRZ for z32
4578- tu: Move queue deletion to last
4579- mesa: Skip update_gl_clamp() if samplers need clamp
4580- freedreno/a6xx: Template specialization for draw type
4581- freedreno/a6xx: Template specialization for pipeline type
4582- freedreno/a6xx: Optimize max_indices calculation
4583- freedreno/batch: Move submit bo tracking to batch
4584- freedreno/drm: Don't try to export suballoc bo
4585- freedreno: Handle export error handling
4586- freedreno: Add aux-context support
4587- freedreno: Reallocate on unshared export
4588- freedreno/a6xx: Clean up open coded flushes
4589- freedreno/a6xx: Stop using fd_wfi()
4590- freedreno/a6xx: Add missing cap
4591- freedreno/a6xx: Fix xfb stream configuration
4592- freedreno/a6xx: Remove primitives_relocw()
4593- freedreno/a6xx: GL_ARB_transform_feedback_overflow_query
4594- freedreno/a6xx: Split primitives and pipeline-stats queries
4595- freedreno/a6xx: Handle nested pipeline stats queries
4596- freedreno: Handle compute queries
4597- freedreno/a6xx: GL_ARB_pipeline_statistics_query
4598- freedreno/a6xx: Enable gl46
4599- freedreno: Add extra assert
4600- freedreno/batch: Add driver-thread assert
4601- freedreno/a6xx: Directly invalidate on samp view update
4602- freedreno/a6xx: Use idalloc for samp/view seqno's
4603- freedreno/fdperf: Use common device info helpers
4604- freedreno/drm/virtio: Trigger host side wait boost
4605- tu/drm: Add missing error path cleanup
4606
4607Robert Beckett (1):
4608
4609- winsys/panfrost: Fix a scanout resource leak
4610
4611Robert Mader (1):
4612
4613- egl/wayland: wait for compositor to release shm buffers
4614
4615Rohan Garg (41):
4616
4617- anv: use the workaround framework for WA 14013111325
4618- hasvk: drop dead code
4619- iris: use the workaround framework for WA 14013111325
4620- anv: use the common vulkan runtime to do the heavy lifting
4621- anv: drop duplicated nir_opt_dce passes
4622- intel: infer scalar'ness locally for brw_postprocess_nir
4623- intel: drop unused is_scalar function parameter in brw_nir_apply_key
4624- intel: update comments about non-existent function parameter
4625- intel: infer scalar'ness locally for brw_vectorize_lower_mem_access
4626- anv: drop duplicate checks when setting the compressed bit
4627- iris: correctly set alignment to next power of two for struct size
4628- ac/surface: make sure alignment is a POT
4629- freedreno: set alignment to next POT
4630- util: fix ROUND_DOWN_TO alignment type
4631- util: migrate alignment functions and macros to use ALIGN_POT
4632- util: revert back to ALIGN since it moved to util
4633- util: move pot functions to use existing macros
4634- anv: enable single texel alignment
4635- isl: add helper to check if aux usage is CCS_E
4636- anv: set aux usage to GFX12_CCS_E if a platform needs WA 14010672564
4637- anv: limit non zero fast clear check to GFX12_CCS_E
4638- anv: fix incorrect asserts when combining CPS and per sample interpolation
4639- hasvk: enable single texel alignment
4640- anv: split ANV_PIPE_RENDER_TARGET_BUFFER_WRITES for finer grained flushing
4641- anv: move WA 1607854226 to use the WA infrastructure
4642- intel/compiler: construct masks instead of using magic values
4643- intel/compiler: reuse previously computed bitsize
4644- anv: retry batchbuffer submission with i915
4645- iris: migrate WA 14013910100 to use the WA framework
4646- iris: migrate WA 14016118574 to use the WA framework
4647- iris: fix iris for WA 16013000631
4648- intel/perf: add perf query support for Intel Raptorlake
4649- anv: use the correct GFX_VERx10 macro for WA
4650- anv,iris: program the maximum number of threads on compute queue init
4651- anv: partially revert 2e8b1f6d
4652- anv: drop dead ifdef
4653- iris: use the correct WA macros and lineage numbers
4654- anv: use the lineage number for WA
4655- crocus: fix GFX_VERx10 macro
4656- blorp: drop undefined macro
4657- iris: migrate preemption streamwout wa to WA infra
4658
4659Roland Scheidegger (2):
4660
4661- llvmpipe: minor cleanups in line rendering code
4662- llvmpipe: fix some corner cases with line rendering
4663
4664Romain Failliot (1):
4665
4666- docs(fix): remove last ref to i965 in features.txt
4667
4668Ruijing Dong (19):
4669
4670- radeonsi/vcn: add macros used in av1 encoding
4671- radeonsi/vcn: enable 2 pass search center map
4672- radeonsi/vcn: enable swizzle mode in encoding ref frames.
4673- radeonsi/vcn: merge get_output_format_param function
4674- radeonsi/vcn: remove extra zero bytes from bitstream
4675- radeonsi/vcn: add av1 dpb variables and cdf table
4676- gallium/pipe: add av1 encoding data structure in pipe
4677- radeonsi/vcn: add av1 enc data structure
4678- radeonsi/vcn: add some av1 encoding function
4679- radeonsi/vcn: add av1 encoding ib packages and get_info
4680- frontends/va: adding va av1 encoding functions
4681- radeonsi/vcn: use PIPE_ENC_FEATURE enum
4682- frontends/va: define va av1 encoding caps
4683- radeonsi/vcn: correct cropping for hevc case
4684- radeonsi/vcn: fix decoding bs buffer alignement issue.
4685- gallium/pipe: add interface update_decoder_target
4686- radeonsi/vcn: apply update_decoder_target logic
4687- frontends/va: remove private member and update target buffer
4688- radeonsi/vcn: change max_poc to fixed value for hevc encoder.
4689
4690Ryan Houdek (1):
4691
4692- util: move check for AVX512
4693
4694Ryan Neph (2):
4695
4696- virgl: add debug flag to force synchronous GL shader compilation
4697- virgl: check a debug option again at context creation
4698
4699Sagar Ghuge (20):
4700
4701- anv: Factor out code from anv_image_hiz_clear
4702- anv: Move and make anv_can_hiz_clear_ds_view non-static
4703- anv: Fast clear depth/stencil surface in vkCmdClearAttachments
4704- anv: Set CS stall bit during HIZ_CCS_WT surface fast clear
4705- iris: Set CS stall bit during HIZ_CCS_WT surface fast clear
4706- intel/genxml: Add CCS cache flush field to PIPE_CONTROL
4707- intel/genxml: Add Compute/Blitter CCS aux invalidation register
4708- anv: Add CCS cache flush bits to anv_pipe_bits
4709- anv: Fix AUX-TT invalidation
4710- anv: implement recommended flush/wait of AUX-TT invalidation on compute
4711- iris: Add CCS cache flush bits
4712- iris: Fix AUX-TT invalidation
4713- iris: implement recommended flush/wait of AUX-TT invalidation
4714- intel/ds: Track CCS cache flush bit
4715- iris: Use correct CCS0 aux-map register offset
4716- intel/genxml: Fix typo in CCS cache flush enable
4717- intel/genxml: Drop incorrect compute aux-inv register entry
4718- anv: Drop depth cache flush requirement after depth clear/resolve
4719- iris: Drop depth cache flush requirement after depth clear/resolve
4720- blorp: Drop unnecessary assertions in blorp_can_hiz_clear_depth
4721
4722Samuel Holland (3):
4723
4724- Android.mk: Allow building only Vulkan drivers
4725- Android.mk: Explicitly enable/disable LLVM support
4726- Android.mk: Only link LLVM for radeonsi, not amd_vk
4727
4728Samuel Pitoiset (203):
4729
4730- radv: fix detecting FMASK_DECOMPRESS/DCC_DECOMPRESS meta pipelines
4731- vulkan: ignore rasterizationSamples when the state is dynamic
4732- radv: try to keep HTILE compressed for READ_ONLY_OPTIMAL layout
4733- radv: re-emit the guardband state when related PSO are bound
4734- radv: tidy up dirtying RBPLUS state in radv_bind_dynamic_state()
4735- radv: disable fast-clears with CMASK for 128-bit formats
4736- radv: require DRM 3.27
4737- radv/amdgpu: remove legacy code path for creating the BO list
4738- radv/amdgpu: remove legacy code for querying context status
4739- radv: do not allow 1D block-compressed images with (extended) storage on GFX6
4740- radv: fix usage flag for 3D compressed 128 bpp images on GFX9
4741- radv: wait for occlusion queries in the resolve query shader
4742- radv: delay enabling/disabling occlusion queries at draw time
4743- radv: track DB_COUNT_CONTROL changes to avoid context rolls
4744- radv: emit PIXEL_PIPE_STAT_CONTROL in the gfx preamble for GFX11
4745- radv: use gfx_level in radv_flush_occlusion_query_state()
4746- radv: update binning settings to work around GPU hangs
4747- radv/ci: remove one expected test failure on PITCAIRN
4748- radv/amdgpu: fix adding continue preambles and postambles BOs to the list
4749- Revert "ci/radv: Demote navi21 to manual until recent flakiness resolves."
4750- radv: add the perf counters BO to the preambles BO list
4751- radv: do not overallocate the CS array during submissions
4752- ac/sqtt: add rgp_sqtt_marker_cb_id definition
4753- ac/sqtt: add a helper to get cmdbuf IDs per queue
4754- radv: reserve command buffer index for SQTT
4755- docs: rename ACO_DEBUG=noscheduling to ACO_DEBUG=nosched
4756- docs: add missing ACO_DEBUG=force-waitdeps
4757- radv: only enable extendedDynamicState3ConservativeRasterizationMode on GFX9+
4758- ac/spm: introduce ac_spm_trace and ac_spm_get_trace()
4759- ac/spm: rename ac_spm_trace_data to ac_spm
4760- ac/sqtt: add a helper for adding clock calibration records
4761- ac/sqtt: add helpers for initializing ac_thread_trace_data
4762- ac/sqtt: initialize clock calibration/queue info/queue event records
4763- radv/sqtt: sample CPU/GPU clocks before starting the trace
4764- radv/sqtt: add support for queue info
4765- ac/sqtt: add new bits to rgp_sqtt_marker_barrier_end
4766- ac/sqtt: add missing EventUnknown to rgp_sqtt_marker_event_type
4767- ac/rgp: update SQTT_FILE_CHUNK_TYPE_API_INFO to minor version 2
4768- ac/rgp: update SQTT_FILE_CHUNK_TYPE_ASIC_INFO to minor version 5
4769- ac/sqtt: add ac_sqtt_se_is_disabled() helper
4770- ac/sqtt: add ac_sqtt_get_trace() helper
4771- radv: do not abort when the SQTT buffer resize failed
4772- ac/rgp: remove ac_thread_trace_data from ac_thread_trace
4773- ac,radv,radeonsi: rename thread_trace to sqtt everywhere
4774- ac/nir: fix 8-bit/10-bit PS exports clamping
4775- radv: enable RADV_THREAD_TRACE_CACHE_COUNTERS by default
4776- radv: fix dynamic depth clamp enable support
4777- radv: fix invalid type for usage in radv_get_buffer_memory_requirements()
4778- radv: fix fast-clearing images with VK_REMAINING_{ARRAY_LAYERS,MIP_LEVELS}
4779- radv: replace radv_get_layerCount by vk_image_subresource_layer_count()
4780- radv: replace radv_get_levelCount() by vk_image_subresource_level_count()
4781- radv/meta: rename dest to dst
4782- radv: disable RB+ blend optimizations on GFX11 when a2c is enabled
4783- radv: use vk_image::mip_levels instead of radv_image::info::levels
4784- radv: use vk_image::array_layers instead of radv_image::info::array_size
4785- radv: use vk_image::samples instead of radv_image::info::storage_samples
4786- radv: use vk_image::samples instead of radv_image::info::samples
4787- radv: use vk_image::extent instead of radv_image::info::{width,height,depth}
4788- radv: remove ac_surf_info from radv_image
4789- ac/spm: switch to SPM version 2.0
4790- vulkan: Update XML and headers to 1.3.250
4791- radv: implement VK_EXT_attachment_feedback_loop_dynamic_state
4792- radv: advertise VK_EXT_attachment_feedback_loop_dynamic_state
4793- spirv: ignore SpvDecorationInvariant warning on struct members
4794- radv/ci: stop setting MESA_SPIRV_LOG_LEVEL
4795- radv: reset the emitted VS prolog when a new vertex shader is bound
4796- radv: dirty the dynamic vertex input state only when needed
4797- radv: re-emit fragment shading rate state when PA_CL_VRS_CNTL changes
4798- radv: configure PA_CL_VRS_CNTL entirely from the cmd buffer
4799- radv: implement dynamic sample locations enable
4800- radv: handle NULL fragment shaders when recording cmdbuf
4801- radv: handle NULL fragment shaders when creating graphics pipelines
4802- radv: rework the checks for implicit exports with GPL
4803- radv: allow to determine NGG settings with a NULL fragment shader
4804- radv: stop compiling a noop FS when the application doesn't provide a FS
4805- radv: advertise VK_EXT_tooling_info
4806- radv: reset the emitted PS epilog when a new fragment shader is bound
4807- radv: remove unused pipeline param in radv_generate_ps_epilog_key()
4808- radv: stop using the pipeline for determining the null export workaround
4809- radv: fix emitting VRS state with a null fragment shader
4810- radv: fix resetting VRS if the graphics pipeline doesn't enable it
4811- radv: fix a sync issue with primitives generated query and NGG/legacy
4812- amd/drm-shim: add navi21
4813- amd/drm-shim: add pitcairn
4814- amd/drm-shim: add bonaire
4815- amd/drm-shim: update README about which file to modify
4816- ci: build drm-shim in debian-testing
4817- ci,radv: use drm-shim instead of the null winsys for radv-fossils
4818- ci: stop using the hang-detection tool for vkd3d-proton
4819- ci: rework vkd3d-proton runner and fix detecting failures
4820- radv: reserve cmdbuf space in radv_flush_gfx2ace_semaphore()
4821- radv: bump the global VRS image size to maximum supported FB dimensions
4822- radv: disable IMAGE_USAGE_STORAGE with depth-only and stencil-only formats
4823- radv: remove useless check about USAGE_STORAGE for TC-compat HTILE
4824- nir: add nir_intrinsic_load_poly_line_smooth_enabled
4825- radeonsi: lower nir_intrinsic_load_poly_line_smooth_enabled_amd
4826- nir: lower smooth lines conditionally using the new intrinsic
4827- radv: track if the smoothLines features is enabled in the device
4828- radv: determine if smooth lines can be used in the pipeline key
4829- radv: declare a new user SGPR for the dynamic line rasterization mode
4830- radv: lower nir_intrinsic_load_poly_line_smooth_enabled_amd
4831- radv: add support for smooth lines
4832- radv: enable smoothLines
4833- radv: apply a bug workaround for smoothing on GFX6
4834- radv: do not enable VRS flat shading if the VRS builtin is read
4835- zink/ci: update VANGOGH expected list of failures
4836- vulkan/pipeline_cache: remove a bogus assert when inserting objects
4837- zink/ci: skip arb_texture_buffer_object@texture-buffer-size-clamp* with RADV
4838- radv: fix copying 2D to 3D images
4839- ci: uprev vkd3d-proton to 2.9
4840- amd: fix 64-bit integer color image clears
4841- radv: rework configuring VGT_SHADER_STAGES_EN
4842- radv/ci: update list of expected failures since Vulkan loader 1.3
4843- radv/ci: skip tests that timeout since Vulkan loader 1.3
4844- vulkan: Update XML and headers to 1.3.251
4845- radv: advertise VK_EXT_dynamic_rendering_unused_attachments
4846- aco: remove nir_intrinsic_load_barycentric_at_sample occurences
4847- radv/ci: removed expected failures that are skipped now
4848- radv/nir: use ac_nir_unpack_arg() for packed shader input user SGPRS
4849- radv: introduce SHIFT/MASK for unpacking shader input args
4850- radv: regroup fragment shader user SGPRs emission
4851- radv: merge all FS user SGPRs into one using packed arguments
4852- spirv: add support for SpvCapabilityFragmentBarycentricKHR
4853- spirv,nir: add support for BaryCoord{NoPersp}KHR builtins
4854- spirv,nir: add support for SpvDecorationPerVertexKHR
4855- nir/lower_io: add nir_intrinsic_load_input_vertex to is_input()
4856- nir: print locations for per-vertex fragment shader inputs
4857- zink/ci: remove useless RADV_PERFTEST=gpl
4858- radv: initialize the device cache UUID even if on-disk cache is disabled
4859- nir: add nir_intrinsic_load_provoking_vtx_amd
4860- radv: add support for nir_intrinsic_load_provoking_vtx_amd
4861- radv: track if the rasterization primitive is known at compile time
4862- nir: add nir_intrinsic_load_rasterization_primitive_amd
4863- radv: add support for nir_intrinsic_load_rasterization_primitive_amd
4864- radv: handle per_vertex variables when gathering FS inputs
4865- radv: set ROTATE_PC_PTR for custom interpolations
4866- radv: configure RSRC1.LOAD_PROVOKING_VTX for the fragment shader
4867- radv: add a NIR pass that lower fragment shader barycentric intrinsics
4868- radv: gather info about nir_intrinsic_load_sample_positions_amd
4869- radv: advertise VK_KHR_fragment_shader_barycentric on GFX10.3+
4870- radv: add a helper for emitting a null depth/stencil target
4871- radv: reset more DB registers when emitting a null ds target
4872- radv: emit DB_RENDER_CONTROL as part of the framebuffer
4873- radv: disable HTILE compression only when layouts are compressed
4874- radv/ci: update the list of expected failures on STONEY
4875- radv: gather info about load_poly_line_smooth_enabled
4876- radv: add a helper for forcing VRS 1x1 in some situations
4877- radv: do not force VRS 1x1 when smooth lines are enabled
4878- radv: fix smooth lines with graphics pipeline library
4879- radv: fix re-emitting some dynamic states when the previous FS is NULL
4880- radv: fix re-emitting early_z/late_z when the bound PS changes
4881- radv: reset some dynamic states when the fragment shader stage is unbound
4882- radv: remove unused radv_dgc_token struct
4883- radv: add dgc_emit_state() helper
4884- radv: add dgc_emit_push_constant() helper
4885- radv: add dgc_emit_vertex_buffer() helper
4886- radv: add dgc_emit_draw() helper
4887- radv: add dgc_emit_draw_indexed() helper
4888- radv: add dgc_emit_index_buffer()
4889- radv: do not use IB for the GFX preamble with RADV_DEBUG=noibs
4890- radv: use IB for the GFX preamble on GFX6
4891- radv: reserve space for shadowed regs
4892- radv/amdgpu: fix a buffer overflow for submissions with RADV_DEBUG=noibs
4893- radv/amdgpu: remove useless assert in radv_amdgpu_winsys_cs_submit_internal()
4894- radv/amdgpu: add cs_execute_ib() for executing IBs
4895- radv: use cs_execute_ib() for GFX, MBCP and DGC IBs
4896- vulkan/runtime: call CmdSetDepthBias2EXT() from CmdSetDepthBias()
4897- radv: implement VK_EXT_depth_bias_control
4898- radv: advertise VK_EXT_depth_bias_control
4899- radv: implement padding cmdbuffer for DGC on GFX6
4900- radv: enable NV_device_generated_commands on GFX6
4901- radv: reserve more space in CS for SQTT
4902- radv/amdgpu: fix dumping cs with RADV_DEBUG=noibs
4903- radv/amdgpu: dump all cs with RADV_DEBUG=noibs
4904- radv: only dirty the index type when necessary with DGC
4905- radv: only dirty the active push constant stages with DGC
4906- radv: adjust alignment of the preprocess buffer with DGC
4907- radv/amdgpu: use the correct IB size when growing a CS with RADV_DEBUG=noibs
4908- radv/amdgpu: rework growing a CS with the chained IB path slightly
4909- radv/amdgpu: do not set the IB size when ending a CS with RADV_DEBUG=noibs
4910- radv/amdgpu: use the array of IB buffers for the chained IB path
4911- radv/amdgpu: use cs_finalize() when growing a CS
4912- radv/amdgpu: rename old_ib_buffers to ib_buffers
4913- radv/amdgpu: add a helper to get a new IB
4914- radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build
4915- radv/amdgpu: workaround a kernel bug when replacing sparse mappings
4916- radv/amdgpu: add more small helpers for managing CS
4917- radv/amdgpu: add support for executing DGC cmdbuf with RADV_DEBUG=noibs
4918- radv: allow NV_device_generated_commands with RADV_DEBUG=noibs
4919- radv: stop emitting TILE_SURFACE_ENABLE for the ZRANGE_PRECISION workaround
4920- radv: inline more values in radv_emit_fb_ds_state()
4921- radv: emit PA_SC_SCREEN_SCISSOR_BR with the actual fb extent
4922- zink/ci: update list of expected failures for NAVI10
4923- zink: fix setting VkShaderCreateInfoEXT::nextStage
4924- radv/rt: fix capture/replay support
4925- vulkan: ignore VkPipelineColorWriteCreateInfoEXT if the state is dynamic
4926- Revert "radv/amdgpu: workaround a kernel bug when replacing sparse mappings"
4927- Revert "radv/amdgpu: skip adding per VM BOs for sparse during CS BO list build"
4928- radv/amdgpu: fix executing secondaries without IB2
4929- radv/amdgpu: do not copy the original chain link for IBs
4930- radv: fix emitting SQTT userdata when CAM is needed
4931- radv: fix capturing RGP on RDNA3 with more than one Shader Engine
4932- radv: set THREAD_TRACE_MARKER_ENABLE for mesh/task draws
4933
4934Sarah Walker (15):
4935
4936- pvr: Support single core transfer queue commands on multicore GPUs
4937- pvr: Implement pvr_pbe_setup_modify_defaults()
4938- pvr: Complete pvr_modify_command()
4939- pvr: Complete pvr_unwind_rects()
4940- pvr: Complete pvr_double_stride()
4941- pvr: Implement pvr_isp_scan_direction()
4942- pvr: Implement pvr_reroute_to_clip()
4943- pvr: Support ipf_creq_pf in pvr_isp_ctrl_stream()
4944- pvr: Complete pvr_3d_validate_addr()
4945- pvr: Support multiple sources per pass in TQ job submission
4946- pvr: Complete pvr_generate_custom_mapping()
4947- pvr: Fragment register fb_cdc_zls is feature dependent
4948- pvr: use pvr_csb_pack() to setup CR_FB_CDC_ZLS
4949- pvr: Rename heap reserved area to static data carveout
4950- pvr: Merge main and extension command streams
4951
4952Sathishkumar S (5):
4953
4954- util/format: add planar3 r8_g8_b8_unorm pipe format
4955- frontends/va: add support for RGBP rt_format
4956- radeonsi/vcn: enable RGBP format on gfx940 jpeg
4957- radeonsi/vcn: engage all jpeg engines on gfx940 for mjpeg decode
4958- frontends/va: return matching drm format for yuyv pipe format
4959
4960Semjon Kravtsenko (1):
4961
4962- glx: Assign unique serial number to GLXBadFBConfig error
4963
4964Sergi Blanch Torne (8):
4965
4966- ci: Move Vulkan CTS patches to their own directory
4967- ci: disable Collabora's LAVA lab for maintance
4968- Revert "ci: disable Collabora's LAVA lab for maintance"
4969- ci: Allow zink-radv jobs to be manual when uprev piglit
4970- ci: disable Collabora's LAVA lab for maintance
4971- Revert "ci: disable Collabora's LAVA lab for maintance"
4972- ci: disable Collabora's LAVA lab for maintance
4973- Revert "ci: disable Collabora's LAVA lab for maintance"
4974
4975Shan-Min Chao (1):
4976
4977- tu/kgsl: Fix memory overwrite with vkFlushMappedMemoryRanges when more than 1 range
4978
4979Sil Vilerino (30):
4980
4981- d3d12: Do not fail d3d12_screen creation if D3D12_FEATURE_D3D12_OPTIONS14 not available
4982- frontend/va: Support QVBR rate control mode
4983- frontend/va: Allow distinction for HRD params sent from app and frontend defaults
4984- frontend/va: Allow distinction for Min/MaxQP params sent from app and frontend defaults
4985- d3d12: Support QVBR rate control mode
4986- d3d12: Support rate control HRD and MaxFrameSize app params
4987- d3d12: Support QPMin/QPMax app params
4988- d3d12: Support PIPE_VIDEO_CAP_MIN_WIDTH/HEIGHT caps
4989- d3d12: Support PIPE_VIDEO_CAP_ENC_QUALITY_LEVEL
4990- frontend/va: Add VAProfileH264High10
4991- frontend/va: Add H264 decode slice data
4992- d3d12: Use frontend H264 decode slice offsets and sizes instead of parsing buffer
4993- d3d12: Clean unused code for parsing slices
4994- frontends/va: Extend AV1 Encode params
4995- d3d12: AV1 Encode
4996- CI/windows: Update headers and Agility redist to 1.711.3-preview
4997- d3d12: Correct tx_mode_support reporting as specified in libva spec
4998- d3d12: Only set reduced_tx_set when supported by D3D12 caps (no libva caps for reduced_tx_set to map to)
4999- d3d12: Fix usage of D3D12_VIDEO_ENCODER_RATE_CONTROL_FLAG, was using D3D12_VIDEO_ENCODER_SUPPORT_FLAG wrongly instead
5000- frontend/va: Pass surf->fence in PIPE_VIDEO_ENTRYPOINT_ENCODE contexts for driver to wait on input surface pending work
5001- frontend/va: Add video processing async fence support
5002- d3d12: Video Decode - Implement get_decoder_fence and async queing
5003- d3d12: Apply style format to d3d12_video_dec.cpp
5004- d3d12: Video Decode - Sync 3D context copy with decode work for texture array case
5005- d3d12: Video Encode - GPU wait on input surface fence
5006- d3d12: Video Process - Implement get_processor_fence and async queing
5007- d3d12: Video Decode - Refactor and style fixes
5008- frontend/va: Fix vaSyncSurface and vaQuerySurface status for drivers not implementing get_processor_fence
5009- frontend/va: Remove fence_server_sync for surface in vlVaHandleVAProcPipelineParameterBufferType
5010- aux/tc: Add ASSERTED to unreferenced release build variable
5011
5012Simon Perretta (5):
5013
5014- pvr: Amend validation when checking multiple supported types
5015- pvr: Use movc for reading special registers
5016- pvr: Add support for generating transfer fragment programs
5017- pvr: Add support for generating transfer EOT programs
5018- pvr: Use driver vertex input data in the compiler
5019
5020Simon Ser (4):
5021
5022- wayland: generalize wayland-protocols code generation
5023- radv: advertise LINEAR filter support for multiplanar/subsampled
5024- vulkan/wsi/wayland: add 16-bit formats
5025- Update OpenGL headers
5026
5027SoroushIMG (8):
5028
5029- zink: do not emit line stipple dynamic state when emulating
5030- zink: take location_frac into account in lower_line_smooth_gs
5031- zink: fix incorrect line mode check for bresenham
5032- zink: refcount the correct query pool
5033- pvr: fix sync waiting while using pvrsrvkm
5034- pvr: fix infinite recursion in pvr_cmd_buffer_{start,end}_sub_cmd
5035- pvr: add missing frag to geom dependency for jobs targetting same render target
5036- pvr: Fix barrier insertion on merged subpasses
5037
5038SureshGuttula (2):
5039
5040- va/surface : Add Nv12 support for PRIME_2 imports
5041- radeonsi/vcn: update luma and chroma size
5042
5043Sviatoslav Peleshko (7):
5044
5045- isl: Check all channels in isl_formats_have_same_bits_per_channel
5046- anv: Handle UNDEFINED format in image format list
5047- anv: Improve image/view usage bits verification
5048- nir/lower_shader_calls: Fix cursor if broken after nir_cf_extract() call
5049- glsl: Fix yylloc.source propagation in YYLLOC_DEFAULT
5050- dri: Use RGB internal formats for RGBX formats
5051- intel/fs: Check if the whole ubo load range is in the push const range
5052
5053Sylvain Munaut (1):
5054
5055- egl/dri2: Add a couple of missing mutex release in error path
5056
5057Tapani Pälli (33):
5058
5059- isl: disable mcs (and mcs+ccs) for color msaa on gfxver 125
5060- iris: implement state cache invalidate for Wa_16013063087
5061- anv: cleanup bitmask construction for PIPELINE_SELECT
5062- anv: implement state cache invalidate for Wa_16013063087
5063- isl: fix layout for comparing surf and view properties
5064- egl/loader: move crtc resource infrastructure as common helper
5065- anv: handle missing astc for gfx125 in CreateImageView
5066- mesa: set a type for depth fallback texture
5067- intel/dev: provide helper to check if devinfo is ATS-M
5068- anv: add required invalidate/flush for Wa_14014427904
5069- iris: add required invalidate/flush for Wa_14014427904
5070- mesa: validate shader binary format in _mesa_spirv_shader_binary
5071- iris: make Wa_16013994831 to use intel_needs_workaround
5072- anv: make Wa_16013994831 to use intel_needs_workaround
5073- anv: remove BDW specific WA for CS stall enable
5074- intel/dev: add parentheses around intel_needs_workaround macro
5075- iris: use workaround framework for 1408224581, 14014097488
5076- anv: use workaround framework for 1408224581, 14014097488
5077- anv: wrap pipe control emission to a set of helper functions
5078- anv: implement flush part of emit_apply_pipe_flushes with helper
5079- anv: implement invalidate part of emit_apply_pipe_flushes with helper
5080- anv: convert genX_query pipe controls to use pc helper
5081- anv: change pipe controls in genX_state to use pc helper
5082- anv: change pipe control in genX_pipeline to use pc helper
5083- anv: change pipe controls in genX_gpu_memcpy to use pc helper
5084- anv: change pipe control in indirect draw gen to use pc helper
5085- anv: change most pipe controls in gfx8_cmd_buffer to use pc helper
5086- anv: convert most pc in genX_cmd_buffer to use pc helper
5087- isl: handle DRM_FORMAT_MOD_INVALID in isl_drm_modifier_has_aux
5088- intel/compiler: add more validation for acc register usage
5089- mesa: fix some TexParameter and SamplerParameter cases
5090- iris: avoid issues with undefined clip distance
5091- crocus: avoid issues with undefined clip distance
5092
5093Tatsuyuki Ishi (18):
5094
5095- util: Add dedicated hex conversion functions and use it.
5096- util: Call mesa_bytes_to_hex directly instead of disk_cache_format_hex_id.
5097- util: Add a copy of BLAKE3 hash library.
5098- util/blake3: Patch with hidden visibility for asm symbols.
5099- util: Add mesa_blake3 wrappers.
5100- nir: Fix serializing pointer initializers.
5101- radv: Make shader related destruction happen before hw_ctx.
5102- radv: Add RGP barrier markers for render pass transition and copy.
5103- radv: Guard against misplaced RGP barrier markers.
5104- util/blake3: Add blake3_hash typedef.
5105- vulkan: Migrate shader module hash to BLAKE3.
5106- vulkan/pipeline_cache: Do not consume object passed into remove_object.
5107- vulkan/pipeline_cache: Move locking outside of remove_object.
5108- vulkan/pipeline_cache: Move cache_object_unref out of header.
5109- vulkan/pipeline_cache: Introduce weak reference mode.
5110- radv: Enable weak reference cache for device->mem_cache.
5111- zink/ci: Add ext_transform_feedback@api-errors to fail list.
5112- radv/amdgpu: Do not pass in a BO handle when clearing PRT VA region.
5113
5114Teng, Jin Chung (1):
5115
5116- d3d12: HEVC Encode - Fix num_subregions_per_scanline rounding
5117
5118Thomas H.P. Andersen (30):
5119
5120- nir/nir_lower_wpos_center: Use the nir_shader_instructions_pass() helper
5121- nir/nir_lower_wpos_ytransform: Use the nir_shader_instructions_pass() helper
5122- nir/nir_lower_viewport_transform: Use the nir_shader_instructions_pass() helper
5123- nir/nir_lower_var_copies: Use the nir_shader_instructions_pass() helper
5124- nir/nir_lower_uniforms_to_ubo: Use the nir_shader_instructions_pass() helper
5125- nir/nir_lower_two_sided_color: Use the nir_shader_instructions_pass() helper
5126- nir/nir_lower_to_source_mods: Use the nir_shader_instructions_pass() helper
5127- nir/nir_lower_vec3_to_vec4: Use the nir_shader_instructions_pass() helper
5128- r600: remove unused code
5129- tgsi: delete unused functions
5130- aux: remove unused tgsi includes
5131- d3d12: remove unused tgsi includes
5132- etnaviv: remove unused tgsi includes
5133- freedreno: remove unused tgsi includes
5134- i915: remove unused tgsi includes
5135- llvmpipe: remove unused tgsi includes
5136- nouveau: remove unused tgsi includes
5137- r300: remove unused tgsi includes
5138- r600: remove unused tgsi includes
5139- radeonsi: remove unused tgsi includes
5140- softpipe: remove unused tgsi includes
5141- svga: remove unused tgsi includes
5142- v3d: remove unused tgsi includes
5143- vc4: remove unused tgsi includes
5144- virgl: remove unused tgsi includes
5145- zink: remove unused tgsi includes
5146- lavapipe: remove unused tgsi includes
5147- st: remove unused tgsi includes
5148- r600: tgsi cleanup
5149- tgsi: remove unused functions and structs
5150
5151Thong Thai (11):
5152
5153- gallium/pipe: add min width and min height video cap enums
5154- radeonsi: return min width and min height video cap values
5155- frontends/va: report min width and min height values if available
5156- mesa/main: rework locale setup/teardown
5157- util: check and initialize locale before using it
5158- tgsi: use locale independent float and double parsing
5159- frontends/va/config: add disable packed headers as valid config
5160- frontends/va/context: check min supported resolution when creating
5161- frontends/va/config: check for QVBR support when creating
5162- frontends/va/context: return error if context_id == 0
5163- frontends/va: fix some coverity scan reported issues
5164
5165Tim Pambor (1):
5166
5167- virgl: Fix stack overflow in virgl_bind_sampler_states
5168
5169Timothy Arceri (32):
5170
5171- util: add Pixel Game Maker MV workaround
5172- util: add Jamestown+ workaround
5173- st/glsl: move linking code to the same st file
5174- glsl: call nir_opt_find_array_copies() when linking
5175- glsl: port lower_blend_equation_advanced() to nir
5176- glsl: call nir version of lower_blend_equation_advanced()
5177- glsl: remove old lower_blend_equation_advanced() code
5178- glsl: add some more c wrappers for string_to_uint_map
5179- mesa: add some new constants
5180- glsl: move some compiler code out of st
5181- glsl: move lowering linker code out of st
5182- glsl: port assign location code for VS inputs or FS outputs
5183- glsl: call assign_attribute_or_color_locations() in NIR linker
5184- glsl: remove unused buffer objects with packed layout
5185- glsl: remove unused system vars
5186- glsl: drop the dce of global vars from GLSL IR linker
5187- nir/glsl: add nir_var_declared_implicitly enum
5188- glsl: move disable_varying_optimizations_for_sso() to NIR linker
5189- glsl: remove the always_active_io flag from GLSL IR
5190- glsl: inline link_varyings()
5191- glsl: set last_vert_prog in the nir linker
5192- glsl: drop link_invalidate_variable_locations()
5193- glsl: move store_fragdepth_layout() to nir linker
5194- glsl: remove glsl ir optimisation loop from linker
5195- st/glsl: merge link_shader() into st_link_nir()
5196- st/glsl: merge st_link_glsl_to_nir() into st_link_nir()
5197- st/glsl: merge st_glsl_to_ir.cpp with st_glsl_to_nir.cpp
5198- glsl: remove dead varyings before assigning attr locations
5199- glsl: do vs attribute validation in NIR linker
5200- glsl: fix validation of ES vertex attribs
5201- glsl: fix spirv sso validation
5202- util: add radeonsi workaround for Nowhere Patrol
5203
5204Timur Kristóf (61):
5205
5206- radv/amdgpu: Remove unnecessary assertions from chaining.
5207- radv: Disallow IB2 on GFX6 when using draw_indirect_multi.
5208- radv: Use IB BOs (chaining) by default on GFX6.
5209- radv: Chain command buffers on GFX6 in radv_queue.
5210- amd: Rename INDIRECT_BUFFER_CIK to just INDIRECT_BUFFER.
5211- radv: Simplify IB2 workaround.
5212- radv: Remove IB2 workaround from mesh shader draws.
5213- radv: Enable IB2 workaround on all indirect draws.
5214- radv: Fix dword alignment in SDMA buffer copy.
5215- aco: Disallow constant propagation on SOPP and fixed operands.
5216- amd: Add and implement sendmsg_amd intrinsic.
5217- amd: Add and implement gs_wave_id sysval.
5218- amd: Move sendmsg defines to ac_shader_util.
5219- ac/llvm: Clarify arguments of ac_build_sendmsg.
5220- ac/nir: Use sendmsg in legacy GS lowering.
5221- ac/nir: Emit legacy GS DONE signal in NIR.
5222- ac/nir/ngg: Use sendmsg in NGG lowering.
5223- amd: Cleanup old GS intrinsics code.
5224- aco: Don't allow any VALU instruction to write m0.
5225- aco: Initialize vcmpx field in get_cmp_info.
5226- radv/amdgpu: Remove unused extra BO array.
5227- radv/amdgpu: Split radv_amdgpu_get_bo_list to smaller functions.
5228- radv/amdgpu: Pass preambles to get_bo_list.
5229- radv/amdgpu: Use STACK_ARRAY for IB array to reduce stack usage.
5230- radv: Move perf counter CS creation to where it's used.
5231- ac: Use const keyword for some function arguments.
5232- radv: Use const keyword more.
5233- radv: Emit primitive reset index with primitive restart enable.
5234- radv: Compute tess info when emitting patch control points.
5235- radv: Move ignore forced VRS code to more optimal place.
5236- radv: Set last_index_type in radv_before_draw.
5237- radv: Slight refactor to late_scissor_emission.
5238- radv: Move indirect check from index buffer emission to caller.
5239- radv: Move empty dynamic states check to caller.
5240- radv: Clear query dirty flags when flushing them.
5241- radv: Clarify gang submit terminology.
5242- radv: Use RESET_FILTER_CAM for some mesh shading draws.
5243- aco: Mark exec write used when it writes other registers.
5244- radv: Remove primitive reset index from late scissor workaround.
5245- radv: Leave primitive reset index at max on GFX8+.
5246- ac: Add ac_hw_stage enum.
5247- aco: Use ac_hw_stage instead of aco-specific HWStage.
5248- aco: Add hw_stage field to aco_shader_info.
5249- radeonsi: Set aco_shader_info::hw_stage
5250- radv: Set aco_shader_info::hw_stage
5251- aco: Use aco_shader_info::hw_stage instead of guessing.
5252- aco: Remove unneeded stage related info fields.
5253- ac/nir/ngg: Call nir_convert_to_lcssa before divergence analysis.
5254- ac/nir/ngg: Add upper limit to reusable uniforms.
5255- ac/nir/ngg: Follow intrinsic sources when analyzing before culling.
5256- ac/nir/ngg: Follow tex sources when analyzing before culling.
5257- radv: Refactor required subgroup size in pipeline key.
5258- radv: Use required subgroup info for graphics shaders.
5259- radv: Enable required subgroup size on mesh/task.
5260- aco: Add MESA_SHADER_KERNEL to instruction selection setup.
5261- aco: Fix subgroup_id intrinsic on GFX10.3+.
5262- ac/nir: Add done arg to ac_nir_export_position.
5263- ac/nir: Slightly refactor how pos0 exports are added when missing.
5264- ac/nir/ngg: Wait for attribute stores before VS/TES/GS pos0 export.
5265- ac/nir/ngg: Refactor mesh shader primitive export.
5266- ac/nir/ngg: Wait for attribute ring stores in mesh shaders.
5267
5268Tony Wasserka (2):
5269
5270- aco/spill: Use arena allocator for next use distances
5271- aco/spill: Use arena allocator for spills
5272
5273Veerabadhran Gopalakrishnan (2):
5274
5275- radeonsi: return kernel queried video capability for HEVC and JPEG
5276- radeonsi: return kernel queried video capability for HEVC and JPEG
5277
5278Viktoriia Palianytsia (1):
5279
5280- iris,crocus: Add proper way of assigning num_levels value
5281
5282Vinson Lee (10):
5283
5284- r600/sfn: Initialize BlockScheduler member m_chip_family.
5285- freedreno/a6xx: Fix memory leak on error path.
5286- nv50: Fix memory leak in error path
5287- pvr: Fix signed comparison
5288- dzn: Fix qpool->queries_lock double lock
5289- tu: Fix missing unlock
5290- vulkan/wsi: Remove duplicate NULL check
5291- frontends/va: Fix missing unlock
5292- r600/sfn: Remove duplicate assignment
5293- vk/wsi/x11: Remove dead code
5294
5295Vitaliy Triang3l Kuzmin (27):
5296
5297- lavapipe: Fix vk_instance_init vk_error instance use-after-free
5298- radv: Fix vk_instance_init vk_error instance use-after-free
5299- radv: Move most of DB_SHADER_CONTROL to PS, more precise GFX11 blend WA
5300- docs/amd: Document Primitive Ordered Pixel Shading
5301- ac/nir: Support Primitive Ordered Pixel Shading in lower_ps
5302- aco: Support pops_exiting_wave_id PhysReg usage
5303- ac: Define POPS collision wave ID argument SGPR
5304- aco: Add s_wait_event argument bit definitions
5305- aco: Add Primitive Ordered Pixel Shading pseudo-instructions
5306- aco: Skip waitcnt insertion in the discard early exit block
5307- aco: Add Primitive Ordered Pixel Shading scheduling rules
5308- aco: Send MSG_ORDERED_PS_DONE where necessary
5309- aco: Add Primitive Ordered Pixel Shading waitcnt rules
5310- aco: Implement fragment shader interlock intrinsics
5311- radeonsi: Remove unconditional POPS_DRAIN_PS_ON_OVERLAP setting
5312- radv: Remove unconditional POPS_DRAIN_PS_ON_OVERLAP setting
5313- radv: Detect the use of Primitive Ordered Pixel Shading
5314- radv: Ensure 1x1 shading rate on GFX10.3 with interlock execution mode
5315- radv: Declare POPS collision wave ID shader argument
5316- radv: Enable POPS collision wave ID shader argument
5317- radv: Enable the null export workaround with POPS
5318- radv: Handle Primitive Ordered Pixel Shading in DB_SHADER_CONTROL
5319- ac/gpu_info: Check whether the device has the POPS missed overlap bug
5320- radv: Apply the POPS missed overlap hardware bug workaround
5321- radv: Disable VRS forcing with Primitive Ordered Pixel Shading
5322- zink/ci: Add broken fragment shader interlock test to RADV flakes
5323- radv: Enable VK_EXT_fragment_shader_interlock
5324
5325Víctor Manuel Jáquez Leal (1):
5326
5327- vulkan: complete the usage flags for video layouts
5328
5329Weibin Wu (1):
5330
5331- winsys/gdi: GDI B5G6R5 display target support
5332
5333Xaver Hugl (1):
5334
5335- vulkan wsi: add support for PresentOptionAsyncMayTear
5336
5337Xi Ruoyao (1):
5338
5339- Revert "glx: Remove pointless GLX_INTEL_swap_event paranoia"
5340
5341Yiwei Zhang (46):
5342
5343- radv: respect VK_QUERY_RESULT_WAIT_BIT in GetQueryPoolResults
5344- venus: stop query experimental features
5345- venus: adopt venus protocol release
5346- meson/ci: promote virtio-experimental to virtio
5347- docs: update Virtio-GPU Venus driver page
5348- ci: carry venus-protocol 1.0 release patches in virglrenderer
5349- ci: uprev virglrenderer to drop venus release patches
5350- anv: apply ANV_BO_ALLOC_IMPLICIT_SYNC for external memory
5351- pipe-loader: avoid undefined memcpy behavior
5352- lvp: avoid accessing member of NULL ptr for global entries
5353- venus: bump ring space to 128K
5354- docs/venus: update vtest instructions
5355- radv: fix radv_emit_userdata_vertex for vertex offset -1
5356- venus: silence -Wuninitialized
5357- venus: sync to latest protocol from header v1.3.248
5358- venus: sync protocol for VK_EXT_image_2d_view_of_3d
5359- venus: enable VK_EXT_image_2d_view_of_3d
5360- docs/venus: advertise VK_EXT_image_2d_view_of_3d
5361- venus: temporarily disable VK_EXT_memory_budget
5362- venus: refactor vn_device_memory to track VkMemoryType
5363- venus: handle device memory report requests
5364- venus: emit device memory report for device memory events
5365- venus: enable VK_EXT_device_memory_report
5366- docs: update venus VK_EXT_device_memory_report support
5367- anv: avoid requiring ordered memory planes for explicit import
5368- venus: suballocate feedback slot with feedback buffer alignment
5369- venus: refactor ahb buffer mem type bits cache to be lazy
5370- venus: refactor buffer cache related bits
5371- venus: extend VkBuffer cache to cover concurrent sharing
5372- venus: fix a cmd tmp storage leak
5373- venus: fix leaks from tracked present src images
5374- venus: track pool in cmd and track device in pool
5375- venus: cmd to reuse alloc copy from cmd pool
5376- venus: refactor vn_cmd_add_query_feedback and miscs
5377- venus: cache query batches at cmd pool
5378- venus: refactor query batch handling
5379- venus: recheck valid bit after acquiring lock to init ahb mem type bits
5380- venus: handle query feedback creation failure
5381- venus: ensure consistency of query overflow behavior
5382- venus: add a missing barrier before copying query feedback
5383- turnip: flush cache for dstBuffer in vkCmdCopyQueryPoolResults
5384- lvp: avoid reading immutable sampler from desc write info
5385- venus: fix a cmd builder render_pass state leak across reset
5386- venus: fix cmd state leak across implicit reset
5387- venus: fix a device memory report leak
5388- vulkan/android: add missing AHARDWAREBUFFER_USAGE_GPU_DATA_BUFFER usage
5389
5390Yogesh Mohan Marimuthu (2):
5391
5392- ac/gpu_info: num_cu = 4 and gfx11 enable dcc with retile
5393- ac/gpu_info: rearrange if checks for dcc config
5394
5395Yonggang Luo (121):
5396
5397- loader: Replace usage of mtx_t with simple_mtx_t in loader/loader_dri3_helper.c
5398- v3d: Replace usage of mtx_t with simple_mtx_t in v3d_simulator.c
5399- vc4: Replace usage of mtx_t with simple_mtx_t in vc4/vc4_simulator.c
5400- drm-shim: Replace usage of mtx_t with simple_mtx_t in drm_shim.c
5401- drm: Replace usage of mtx_t with simple_mtx_t in virgl/drm/virgl_drm_winsys.c
5402- drm: Replace usage of mtx_t with simple_mtx_t in drm/radeon_drm_winsys.c
5403- drm: Replace usage of mtx_t with simple_mtx_t in nouveau_drm_winsys.c
5404- hud: Replace usage of mtx_t with simple_mtx_t in hud_cpufreq.c
5405- hud: Replace usage of mtx_t with simple_mtx_t in hud_diskstat.c
5406- hud: Replace usage of mtx_t with simple_mtx_t in hud_nic.c
5407- hud: Replace usage of mtx_t with simple_mtx_t in hud_sensors_temp.c
5408- xlib: Replace usage of mtx_t with simple_mtx_t in xm_api.c
5409- rtasm: Trim trailing spaces and replace tab with 3 space
5410- rtasm: Replace usage of mtx_t with simple_mtx_t in rtasm_execmem.c
5411- nine: Replace usage of mtx_t with simple_mtx_t in nine_lock.c
5412- omx: Replace usage of mtx_t with simple_mtx_t in vid_omx_common.c
5413- vdpau: Replace usage of mtx_t with simple_mtx_t in htab.c
5414- c11: Remove _MTX_INITIALIZER_NP as it's not used anymore
5415- microsoft/compiler: Getting function impl to be consistence with decl in dxil_enums.*
5416- compiler: Getting shader_prim to be PACKED that consistence with pipe_prim_type
5417- compiler: Add SHADER_PRIM_COUNT to be SHADER_PRIM_MAX + 1
5418- compiler: Rename shader_prim to mesa_prim and replace all usage of pipe_prim_type with mesa_prim
5419- docs: Update document about pipe_prim_type with mesa_prim
5420- util: Replace all usage of PIPE_TIMEOUT_INFINITE with OS_TIMEOUT_INFINITE
5421- r300: Replace usage of os_get_process_name with util_get_process_name in r300_chipset.c
5422- virgl: Array cmdline on stack should initialized to 0
5423- virgl: Replace the usage of os_get_process_name with util_get_process_name
5424- compiler: Combine duplicated implementation of is_gl_identifier into glsl_types.h
5425- compiler: Move can_implicitly_convert_to helper to glsl module from glsl_types.h
5426- mesa, compiler: Move gl_texture_index to glsl_types.h
5427- compiler: Remove the need include "util/glheader.h" and "util/ralloc.h" in glsl_types.h
5428- compiler: Remove redundant struct glsl_type in nir_types.h
5429- vulkan: move nir_convert_ycbcr into vulkan runtime
5430- util: Remove redundant type cast in function align64
5431- util: use uint32_t as the parameter of align function
5432- util: Do not use align as variable name
5433- compiler: use align instead glsl_align and remove glsl_align
5434- panfrost: Replace the usage of PIPE_BIND_* with PAN_BIND_*
5435- ac: Replace the usage of pipe_compare_func with compare_func
5436- dri: Replace usage of boolean/TRUE/FALSE with bool/true/false
5437- freedreno: Fixes error: passing argument 1 of ‘pthread_mutex_unlock’ from incompatible pointer type in tu_pipeline.c
5438- wsi: Fixes passing argument 1 of ‘mtx_unlock’ from incompatible pointer type
5439- c11: Improve timespec_get to support TIME_MONOTONIC TIME_ACTIVE TIME_THREAD_ACTIVE TIME_MONOTONIC_RAW
5440- c11: Improve mtx_timedlock to use timespec_get instead of time(NULL)
5441- c11: Implement os_time_get_nano with timespec_get(&ts, TIME_MONOTONIC)
5442- zink: Replace the usage of os_get_process_name with util_get_process_name
5443- dd: Replace the usage of os_get_process_name with util_get_process_name in dd_draw.c
5444- gallium: Remove unused os_process.h in gallium/auxiliary
5445- util: Fixes prototype of threads_timespec_compare
5446- mapi: Fixes check_table.cpp for DrawArraysInstancedARB and DrawElementsInstancedARB
5447- meson: Use consistence disabled/enabled comment for shared-glapi option
5448- mapi: Fixes non-constant-expression cannot be narrowed from type 'unsigned long' to 'unsigned int' in initializer list with clang
5449- meson: Guard the glsl tests that only working when OpenGL ES2 is enabled
5450- draw: Replace usage of boolean/TRUE/FALSE with bool/true/false in draw_pt_vsplit*
5451- draw: Replace usage of ubyte/ushort/uint with uint8_t/uint16_t/uint32_t in draw_pt_vsplit.c
5452- draw: Update the comment and function name to match the type
5453- vtn: Do not assign main_entry_point->impl twice
5454- nir: Add function nir_function_set_impl
5455- hud: Use bool/true/false to replace boolean/TRUE/FALSE in hud/hud_context.c
5456- gallium/draw: Replace the usage of ushort to uint16_t in files that can not found by tools
5457- llvmpipe: altivec.h inclusion in -std=c++98..11 causes bool to be redefined
5458- treewide: replace usage of boolean to bool
5459- treewide: style fixes after replace usage of boolean to bool
5460- treewide: Replace the usage of TRUE/FALSE with true/false
5461- treewide: Replace the usage of ubyte/ushort with uint8_t/uint16_t
5462- treewide: style fixes after replace the usage of ubyte/ushort with uint8_t/uint16_t
5463- util: Merge p_compiler.h into src/util/compiler.h
5464- util: include "util/compiler.h" instead of "pipe/p_compiler.h"
5465- mapi: Fixes compile error with build option "-D shared-glapi=disabled"
5466- mapi: Now _glapi_get_dispatch_table_size always equal to sizeof(struct _glapi_table) / sizeof(void \*)
5467- mapi: Hide OpenGL functions to be exported when shared-glapi is disabled
5468- ci: Testing -D shared-glapi=disabled with debian-clang-release
5469- d3d12: Fixes unused-variable compile error
5470- compiler: set alignment=1 by default for handling empty struct/interface in glsl_types.cpp
5471- util: Add function util_is_power_of_two_nonzero64 in bitscan.h
5472- util: use uint32_t instead of unsigned in bitscan.h
5473- util: Getting align and align64 consistence with ALIGN
5474- util: Replace the usage of redundant u_align_u32 with align and remove u_align_u32
5475- util: Do not use align64 over unsigned int in register_allocate.c
5476- util: sizeof bucket are always 32bit width, use align instead align64
5477- mapi: Style fixes in glapi/glapi_getproc.c
5478- mapi: Merge get_static_proc_address into _glapi_get_proc_address
5479- mapi: Remove dead struct _glapi_function in glapi/glapi_getproc.c
5480- nir: Split macro nir_foreach_function_with_impl out of nir_foreach_function_impl
5481- clang-format: Add nir_foreach_function_with_impl into src/.clang-format
5482- treewide: Switch to use nir_foreach_function_with_impl when possible
5483- clang-format: Add nir_foreach_function_impl into src/.clang-format
5484- gallium/auxiliary: Switch to use nir_foreach_function_impl
5485- asahi: Use nir_foreach_function_impl instead nir_foreach_function in function agx_nir_lower_zs_emit
5486- d3d12: Switch to use nir_foreach_function_impl
5487- glsl: Switch to use nir_foreach_function_impl from nir_foreach_function
5488- glsl: Remove the extra scope in gl_nir_link_uniforms.c
5489- crocus: Switch to use nir_foreach_function_impl
5490- intel/compiler: Switch to use nir_foreach_function_impl
5491- broadcom: replace redefined ALIGN() macro with common util functions
5492- util: Remove redundant defined(_WIN32) in u_string.h
5493- util: Remove redundant #if !defined(XF86_LIBC_H) in u_string.h
5494- nir: Strip the const modifier on nir_function * in nir_foreach_function_with_impl
5495- panfrost: Convert to use nir_foreach_function_with_impl in function midgard_compile_shader_nir
5496- panfrost: Convert to use nir_foreach_function_impl when possible
5497- mesa: Convert to use nir_foreach_function_impl
5498- llvmpipe: Convert to use nir_foreach_function_impl
5499- sfn: Convert to use nir_foreach_function_impl
5500- sfn: indent fixes after switch to use nir_foreach_function_impl
5501- compiler/clc: Switch to use nir_foreach_function_impl in function nir_lower_libclc
5502- dxil: Use nir_remove_non_entrypoints
5503- nir: Update the comment to call nir_remove_non_entrypoints directly
5504- glsl: Use nir_remove_non_entrypoints to simplify the code
5505- radv: Use nir_remove_non_entrypoints in radv_shader.c
5506- nir: Add nir_foreach_function_safe and use it
5507- pvr: Use alignas instead of ALIGN_ATTR and remove ALIGN_ATTR
5508- vc4: Convert to use nir_foreach_function_impl when possible
5509- v3d: Switch to use nir_foreach_function_impl
5510- broadcom: Switch to use nir_foreach_function_impl
5511- radeonsi: Use ALIGN_POT instead ALIGN_TO
5512- etnaviv: Convert to use nir_foreach_function_impl
5513- intel/vulkan: Convert to use nir_foreach_function_impl when possible
5514- iris: Convert to use nir_foreach_function_impl
5515- treewide: Remove all usage of nir_builder_init with nir_builder_create and nir_builder_at
5516- treewide: remove unused nir_builder
5517- nir: Remove nir_builder_init, it's not used anymore
5518
5519Zhang Ning (2):
5520
5521- lima: use u_pipe_screen_lookup_or_create in the renderonly path too
5522- Revert "intel/ci: disable iris-jsl-deqp because it always fails for an AMD MR"
5523
5524Zhang, Jianxun (3):
5525
5526- intel/isl: Fix map between sRGB and linear formats
5527- anv: Support 1MB AUX mapping (MTL)
5528- anv: Remove alignment to aux ratio on size of main surface
5529
5530antonino (29):
5531
5532- zink: don't emulate edgeflags for patches
5533- zink: use correct primitives for passthrough gs with tess
5534- zink: add \`single_sample` to fs key
5535- zink: add to multisample field to \`zink_gfx_pipeline_state`
5536- zink: don't render with multisampling when it is disabled
5537- zink/ci: remove xt_framebuffer_multisample-interpolation fail
5538- zink: fix pv mode lowring index calculation
5539- zink: use ring buffer to preserve last element
5540- zink: fix exit condition on pv emulation loop
5541- zink: fix line strip offsets in pv mode emulation
5542- nir/zink: use sysvals in \`nir_create_passthrough_gs`
5543- zink: fix store subsitution in \`lower_pv_mode_gs_store`
5544- zink: set when pipeline dirty flag when multisample changes
5545- Revert "zink: set when pipeline dirty flag when multisample changes"
5546- Revert "zink/ci: remove xt_framebuffer_multisample-interpolation fail"
5547- Revert "zink: don't render with multisampling when it is disabled"
5548- Revert "zink: add to multisample field to \`zink_gfx_pipeline_state`"
5549- Revert "zink: add \`single_sample` to fs key"
5550- zink: take location_frac into account in pv emulation
5551- nir: use \`nir_variable_clone` in \`nir_create_passthrough_gs`
5552- nir: don't create invalid inputs in \`nir_create_passthrough_gs`
5553- zink: don't replace non generated gs
5554- nir: handle interface blocks in \`copy_vars`
5555- zink: handle interface blocks in \`copy_vars`
5556- nir: make var arrays large enough in \`nir_create_passthrough_gs`
5557- zink: don't create invalid inputs in \`zink_create_quads_emulation_gs`
5558- vulkan/wsi: add \`vk_wsi_force_swapchain_to_current_extent` driconf
5559- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "The Talos Principle"
5560- drirc: enable \`vk_wsi_force_swapchain_to_current_extent` for "Serious Sam Fusion"
5561
5562i509VCB (1):
5563
5564- docs/asahi: Add hardware glossary
5565
5566lorn10 (1):
5567
5568- docs: Update Clover's env variable documentation
5569
5570nihui (1):
5571
5572- panvk: port panvk_logi to vk_logi
5573
5574norablackcat (24):
5575
5576- rusticl: implement cl_khr_pci_bus_info
5577- docs/rusticl: add Contributing section
5578- rusticl/types add ::new for cl_dev_idp_accel_props
5579- rusticl/api: add integer_dot_product api
5580- rusticl/clc add integer_dot_prod feature macros
5581- rusticl/kernel: remove nir_lower_pack pass
5582- rusticl/device: add cl_khr_integer_dot_product ext
5583- rusticl/program: fix clippy cast to the same type
5584- rusticl/types: fix clippy new() not returning Self
5585- rusticl/screen: implement uuid wrapper funcs
5586- rusticl/device: implement cl_khr_device_uuid
5587- rusticl/screen: fix driver_uuid on non x86
5588- rusticl: add cl_khr_create_command_queue
5589- docs/features update opencl extensions add rusticl
5590- docs: rusticl envvars list supported drivers
5591- rusticl/memory: fix clippy errors
5592- gallium: add PIPE_CAP_TIMER_RESOLUTION
5593- llvmpipe/screen: add PIPE_CAP_TIMER_RESOLUTION
5594- sofpipe/screen: add PIPE_CAP_TIMER_RESOLUTION
5595- crocus/screen: add PIPE_CAP_TIMER_RESOLUTION
5596- iris/screen: add PIPE_CAP_TIMER_RESOLUTION
5597- r600/pipe: add PIPE_CAP_TIMER_RESOLUTION
5598- radeonsi/get: add PIPE_CAP_TIMER_RESOLUTION
5599- zink/screen: add PIPE_CAP_TIMER_RESOLUTION
5600
5601timmac-qmc (1):
5602
5603- glsl: fix potential crash with DisableUniformArrayResize
5604
5605xurui (6):
5606
5607- zink: Some return values of malloc should be checked
5608- zink: Use malloc instead of ralloc
5609- zink: Use malloc to allocate libs
5610- zink: Add some printfs when initialization fails
5611- zink: Free the cdt when an error occurs
5612- zink: The result should be assigned a value when returned
注意: 瀏覽 TracBrowser 來幫助您使用儲存庫瀏覽器

© 2024 Oracle Support Privacy / Do Not Sell My Info Terms of Use Trademark Policy Automated Access Etiquette