awesome 3D gaussian splatting下载 - awesome 3D gaussian splatting源代码下载

很棒的 3D 高斯溅射资源

专注于 3D 高斯泼溅的论文和开源资源精选列表，旨在跟上未来几个月预期的研究激增。如果您有任何补充或建议，请随时贡献。也欢迎其他资源，如博客文章、视频等。

添加了 18 篇论文：Z-Splat、Dual-Camera、StylizedGS、Hash3D、Revisiting Densification、Gaussian Pancakes、3D-aware Deformable Gaussians、SpikeNVS、零样本 PC 完成、SplatPose、DreamScene360、RealmDreamer、Gaussian-ILC、Reinforcement Learning with GGS 、GoMAvatar、OccGaussian、 LoopGaussian，回顾

2024 年 4 月 11 日

LatentSplat代码发布

2024 年 4 月 9 日

添加了 1 篇论文：EgoLifter

2024 年 4 月 8 日

添加了 3 篇论文：Robust Gaussian Splatting、SC4D 和 MM-Gaussian

2024 年 4 月 5 日

添加了 5 篇论文：Surface Reconstruction、TCLC-GS、GaSpCT、OmniGS 和 Per-Gaussian Embedding，
修复

2024 年 4 月 2 日

添加了 11 篇论文：HO、SGD、HGS、Snap-it、InstantSplat、3DGSR、MM3DGS、HAHA、CityGaussain、Mirror-3DGS 和 Feature Splatting

2024 年 3 月 30 日

添加了 8 篇论文：建模不确定性、GRM、Gamba、CoherentGS、TOGS、SA-GS 和 GaussianCube

2024 年 3 月 27 日

添加了其他实现：360-gaussian-splatting
添加了 CVPR '24 标签
添加了 5 篇论文：Comp4D、DreamPolisher、DN-Splatter、2D GS 和 Octree-GS

2024 年 3 月 26 日

添加了 13 篇论文：latentSplat、GS on the Move、RadSplat、Mini-Splatting、SyncTweedies、HAC、STAG4D、EndoGSLAM、Pixel-GS、Semantic Gaussians、Gaussian in the Wild、CG-SLAM 和 GSDF

2024 年 3 月 24 日：

添加纸张：高斯蒙砂

2024 年 3 月 20 日：

添加了 4 篇论文：GVGEN、HUGS、RGBD GS-ICP SLAM 和 High-Fidelity SLAM

2024 年 3 月 19 日：

添加了点面
添加了原作者的 3DGS 教程
添加了GauStudio
添加了 23 篇论文：Touch-GS、GGRt、FDGaussian、SWAG、Den-SOFT、Gaussian-Flow、View-Consistent 3D Editing、BAGS、GeoGaussian、GS-Pose、Analytic-Splatting、Seamless 3D Maps、Texture-GS、Recent Advances 3DGS、用于密集视觉 SLAM 的紧凑型 3DGS、BrightDreamer、3DGS-Reloc、Beyond不确定性、运动感知 3DGS、Fed3DGS、GaussNav、3DGS-Calib 和 NEDS-SLAM

2024 年 3 月 17 日：

更新 3DGS.cpp 的存储库名称和链接（最初为 VulkanSplatting）

2024 年 3 月 16 日：

斯普拉特电视
添加了 6 篇论文：GaussianGrasper、新的分割算法、Controllable Text-to-3D Generation、Spring-Mass 3DGS、Hyper-3DGS 和 DreamScene

2024 年 3 月 14 日：

添加了 6 篇论文：SemGauss、StyleGaussian、Gaussian Splatting in Style、GaussCtrl、GaussianImage 和 RAIN-GS

2024 年 3 月 8 日：

教程：如何捕获 3DGS 图像
添加了 6 篇论文：SplattingAvatar、DNGaussian、Radiative Gaussians、BAGS、GSEdit 和 ManiGaussian

2024 年 3 月 8 日：

添加了 3DGStream 查看器

2024 年 3 月 6 日：

添加了 1 篇论文：Splat-Nav

2024 年 3 月 5 日：

添加了 1 篇论文：3DGStream
代码发布
添加了新查看器

2024 年 3 月 2 日：

添加了 1 篇论文：动画和纹理的 3D 高斯模型
新部分：同时教授 3DGS 的课程。

2024 年 2 月 28 日：

广大高斯

2024 年 2 月 27 日：

添加了 2 篇论文：Spec-Gaussian 和 GEA
SC-GS 代码发布

2024 年 2 月 24 日：

添加了 2 篇论文：识别不必要的高斯和 Gaussian Pro

2024 年 2 月 23 日：

更正了 EndoGS 的作者并更新了摘要：利用高斯溅射进行可变形内窥镜组织重建

2024 年 2 月 21 日：

添加了一篇论文：重塑 SLAM：一项调查

2024 年 2 月 20 日：

GaussianObject代码发布
添加了一篇论文：GaussianHair

2024 年 2 月 19 日：

添加了博客文章：NeRFs 与 3DGS。

2024 年 2 月 16 日：

添加了 2 篇论文：IM-3D 和 GES
GaMeS代码发布

2024 年 2 月 14 日：

添加了查看器：VulkanSplatting - C++ 和 Vulkan Compute 中的跨平台高性能 3DGS 渲染器

2024 年 2 月 13 日：

代码发布：（2024 年 1 月 16 日）使用 4D 高斯泼溅进行实时真实感动态场景表示和渲染
添加了 3 篇论文：3DGala、ImplicitDeepFake 和 3D Gaussians as a New Vision Era。

2024 年 2 月 9 日：

添加了 1 篇论文：HeadStudio

2024 年 2 月 8 日：

添加了 3 篇论文：Rig3DGS、Mesh-based GS 和 LGM 2024 年 2 月 6 日：
添加了 2 篇论文：SGS-SLAM 和 4D Gaussian Splatting

2024 年 2 月 5 日：

将 SWAGS 移至动力学和变形部分
添加了 2 篇论文：GaussianObject 和 GaMeSh
GS++ 更名为最佳投影

2024 年 2 月 2 日：

添加了 6 篇论文：VR-GS、Segment Anything、Gaussian Splashing、GS++、360-GS 和 StopThePop
TRIPS 代码发布

2024 年 1 月 30 日：

代码更改：GaussianAvatars 代码更改为私有

2024 年 1 月 29 日：

添加了 2 篇论文：LIV-GaussMap 和 TIP-Editor

2024 年 1 月 26 日：

删除撤回论文：用于高保真人体运动合成的可动画 3D 高斯
添加了 3 篇论文：EndoGaussians、PSAvatar 和 GauU-Scene

2024 年 1 月 25 日：

添加了查看器：Splatapult - C++ 和 OpenGL 中的 3d 高斯喷射渲染器，可与 OpenXR 配合使用以实现联机 VR

2024 年 1 月 24 日：

添加实用程序：SideFX Houdini 的 GSOP（高斯 Splat 运算符）
代码发布：GaussianAvatars

2024 年 1 月 23 日：

添加了 3 篇论文：Amortized Gen3D、Deformable Endooscopy Tissues、Fastdynamic 3D Object Generation
代码发布：动画化身、压缩 3D 高斯、GaussianAvatar

2024 年 1 月 13 日：

添加了 4 篇论文：CoSSegGaussians、TRIPS、Gaussian Shadow Casting for Neural Characters 和 DISTWAR

2024 年 1 月 9 日：

新增 1 篇论文：A Survey on 3D Gaussian Splatting（第一次调查）

2024 年 1 月 8 日：

添加了 4 篇论文：SWAGS（添加了 2023 年的论文，我之前忘记添加了）、第一篇评论论文、压缩的 3DGS 以及表征卫星几何的应用论文。

2024 年 1 月 7 日：

1 开源实现：taichi-splatting - 工作最初源自 Taichi 3D Gaussian Splatting，并进行了重大的重新组织和更改。

2024 年 1 月 5 日：

添加了 3 篇论文：FMGS、PEGASUS 和 Repaint123。

2024 年 1 月 2 日：

添加了 1 篇论文：街头高斯。

2024 年 1 月 2 日：

更新了去模糊高斯论文链接。
SAGA代码发布。
添加了 2023 年的 2 篇论文：Text2Immersion 和 2D-Guided 3DG Segmentation。
gsplat lib 的数学补充。
在类别中添加年份。
GSM 代码发布。

2023 年 12 月 29 日：

添加了 1 篇论文（显然之前漏掉了一篇）：Gaussian-Head-Avatar。
添加了博客文章头像。

2023 年 12 月 29 日：

添加了 3 篇论文：DreamGaussian4D、4DGen 和 Spacetime Gaussian。

2023 年 12 月 27 日：

添加了 3 篇论文：LangSplat、Deformable 3DGS 和 Human101。
添加了博客文章：3DGS 的综合回顾。

2023 年 12 月 25 日：

发布了单目/多视图动态场景代码的高效 3D 高斯表示。
GPS-高斯代码发布。

2023 年 12 月 24 日：

添加了 2 篇论文：自组织高斯网格和高斯分裂。
添加了用于增强高斯渲染以建模更复杂场景的存储库。

2023 年 12 月 21 日：

添加了 3 篇论文：Splatter Image、pixelSplat 和align your gaussians。
高斯分组代码发布。

2023 年 12 月 19 日：

添加了 2 篇论文：GAvatar 和 GauFRe。

2023 年 12 月 18 日：

添加了实用程序：SpectacularAI - 不同 3DGS 约定的转换脚本。
SuGaR 代码发布。

2023 年 12 月 16 日：

添加了 WebGL 查看器 3：Gauzilla。

2023 年 12 月 15 日：

添加了 4 篇论文：DrivingGaussian、iComMa、Triplane 和 3DGS-Avatar。
Relightable 高斯代码发布。

2023 年 12 月 13 日：

添加了 5 篇论文：Gaussian-SLAM、CoGS、ASH、CF-GS 和 Photo-SLAM。

2023 年 12 月 11 日：

添加了 2 篇论文：Gaussian Splatting SLAM 和 3D Generation 的去噪分数。
ScaffoldGS 代码已发布。

2023 年 12 月 8 日：

添加了 2 篇论文：EAGLES 和 MonoGaussianAvatar。

2023 年 12 月 7 日：

LucidDreamer 代码已发布。
添加了 9 篇论文：GauHuman、HeadGaS、HiFi4G、Gaussian-Flow、Feature-3DGS、Gaussian-Avatar、FlashAvatar、Relightable 和 Deblurring Gaussians。

2023 年 12 月 5 日：

添加了 9 篇论文：NeuSG、GaussianHead、GaussianAvatars、GPS-Gaussian、用于单眼非刚性对象重建的神经参数高斯、SplaTAM、MANUS、Segment Any 和语言嵌入 3D 高斯。

2023 年 12 月 4 日：

添加了 8 篇论文：Gaussian Grouping、MD Splatting、DynMF、Scaffold-GS、SparseGS、FSGS、Control4D 和 SC-GS。

2023 年 12 月 1 日：

添加了 4 篇论文：Compact3D、GaussianShader、Periodic Vibration Gaussian 和 Gaussian Shell Maps for Efficient 3D Human Generation。
为每个类别创建了目录并添加了换行符。

2023 年 11 月 30 日：

添加了虚幻游戏引擎实现。
添加了 5 篇论文：LightGaussian、FisherRF、HUGS、HumanGaussian、CG3D 和 Multi Scale 3DGS。

2023 年 11 月 29 日：

添加了两篇论文：Point and Move 和 IR-GS。

2023 年 11 月 28 日：

添加了五篇论文：GaussinEditor、Relightable Gaussians、GART、Mip-Splatting、HumanGaussian。

2023 年 11 月 27 日：

添加了两篇论文：Gaussian Editing 和 Compact 3D Gaussians。

2023 年 11 月 25 日：

添加了可动画高斯项目（论文尚未发布）。

2023 年 11 月 22 日：

添加了 3 篇新的 GS 论文：Animatable、Depth-Regularized 和单目/多视图 3DGS。
添加了一些经典论文。
添加了另一篇 GS 论文，也称为 LucidDreamer。

2023 年 11 月 21 日：

添加了 3 篇新的 GS 论文：GaussianDiffusion、LucidDreamer、PhysGaussian。
新增 2 篇 GS 论文：SuGaR、PhysGaussian。

2023 年 11 月 21 日：

添加论文GS-SLAM

2023 年 11 月 17 日：

将 PlayCanvas 实现添加到游戏引擎部分。

2023 年 11 月 16 日：

发布可变形 3D 高斯代码。
添加了可驾驶的 3D 高斯头像纸。

2023 年 11 月 8 日：

关于 3DGS 实现和 unsive/rsal 格式讨论的一些注释。

2023 年 11 月 4 日：

添加了 2D 高斯泼溅。
添加了非常详细的（技术）博客文章，解释 3D 高斯泼溅。

2023 年 10 月 28 日：

添加了实用程序部分。
添加了 3DGS 转换器，用于在 Cloud Compare to Utilities 中编辑 3DGS .ply 文件。
添加了 Kapture（用于捆绑器到 colmap 模型转换）和 Kapture 图像裁剪器脚本，以及实用程序的转换说明。

2023 年 10 月 23 日：

添加了 python WebGL 查看器 2。
添加了高斯泼溅（和 Unity 查看器）视频博客的介绍。

2023 年 10 月 21 日：

添加了 python OpenGL 查看器。
添加了 typescript WebGPU 查看器。

2023 年 10 月 20 日：

使摘要可读（删除连字符）。
添加了 Windows 教程。
其他小的文本修复。
添加了 Jupyter 笔记本查看器。

2023 年 10 月 19 日：

添加了用于实时真实感动态场景表示的 Github 页面链接。
重新排列标题。
添加了其他非官方实现。
将 Nerfstudio gsplat 和 fast: C++/CUDA 移至非官方实现。
添加了 Nerfstudio、Blender、WebRTC、iOS 和 Metal 查看器。

2023 年 10 月 17 日：

GaussianDreamer 代码发布。
添加了实时真实感动态场景表示。

2023 年 10 月 16 日：

添加了可变形 3D 高斯纸。
动态 3D 高斯代码发布。 2023 年 10 月 15 日：包含前 6 篇论文的初始列表。

介绍 3D 高斯分布的开创性论文：

用于实时辐射场渲染的 3D 高斯喷射

作者：Bernhard Kerbl、Georgios Kopanas、Thomas Leimkühler、George Drettakis

抽象的

辐射场方法最近彻底改变了用多张照片或视频捕获的场景的新颖视图合成。然而，实现高视觉质量仍然需要训练和渲染成本高昂的神经网络，而最近更快的方法不可避免地会牺牲速度来换取质量。对于无界且完整的场景（而不是孤立的物体）和1080p分辨率渲染，当前没有方法可以实现实时显示速率。我们引入了三个关键要素，使我们能够在保持有竞争力的训练时间的同时实现最先进的视觉质量，并且重要的是允许在 1080p 分辨率下进行高质量实时（≥ 30 fps）新视图合成。首先，从相机校准期间产生的稀疏点开始，我们用 3D 高斯表示场景，保留连续体积辐射场的所需属性以进行场景优化，同时避免在空白空间中进行不必要的计算；其次，我们对 3D 高斯进行交错优化/密度控制，特别是优化各向异性协方差以实现场景的准确表示；第三，我们开发了一种快速可见性感知渲染算法，该算法支持各向异性泼溅，既加速训练又允许实时渲染。我们在几个已建立的数据集上展示了最先进的视觉质量和实时渲染。

3D 物体检测

2024年

1. 3DGS-DET：通过边界引导和框聚焦采样增强 3D 高斯泼溅，以实现 3D 物体检测

作者：曹阳、吉元良、徐丹

抽象的

神经辐射场 (NeRF) 广泛用于新颖视图合成，并已适用于 3D 对象检测 (3DOD)，为通过视图合成表示进行 3D 对象检测提供了一种有前途的方法。然而，NeRF 面临着固有的局限性：(i) 由于其隐式性质，它对 3DOD 的表示能力有限；(ii) 渲染速度慢。最近，3D 高斯分布 (3DGS) 作为一种显式 3D 表示形式出现，它通过更快的渲染功能解决了这些限制。受这些优点的启发，本文首次将 3DGS 引入 3DOD，确定了两个主要挑战：（i）高斯斑点的空间分布不明确 - 3DGS 主要依赖于 2D 像素级监督，导致高斯斑点的 3D 空间分布不清晰物体和背景的区分度差，阻碍了 3DOD； (ii) 过多的背景斑点——2D 图像通常包含大量背景像素，导致密集重建的 3DGS 中含有许多代表背景的噪声高斯斑点，对检测产生负面影响。为了应对挑战 (i)，我们利用 3DGS 重建源自 2D 图像的事实，并通过结合 2D 边界引导提出了一种优雅而有效的解决方案，以显着增强高斯斑点的空间分布，从而使物体和物体之间的区分更加清晰。他们的背景（见图1）。为了解决挑战 (ii)，我们提出了一种以框为中心的采样策略，使用 2D 框生成 3D 空间中的对象概率分布，从而允许在 3D 中进行有效的概率采样以保留更多对象斑点并减少嘈杂的背景斑点。受益于所提出的边界引导和框聚焦采样，我们的最终方法 3DGS-DET 比我们的基本管道版本实现了显着改进（[email protected] 上 +5.6，[email protected] 上 +3.7），而无需引入任何额外的可学习参数。此外，3DGS-DET 显着优于最先进的基于 NeRF 的方法 NeRF-Det，在 ScanNet 数据集的 [email protected] 上实现了 +6.6 的改进，在 [email protected] 上实现了 +8.1 的改进，并且在 ScanNet 数据集上实现了令人印象深刻的 +31.5 的改进。 ARKITScenes 数据集的 [email protected]。代码和模型可公开获取：https://github.com/yangcaoai/3DGS-DET。

？纸|代码（还没有）

自动驾驶：

Despite recent advancements in high-fidelity human reconstruction techniques, the requirements for densely captured images or time-consuming per-instance optimization significantly hinder their applications in broader scenarios. To tackle these issues, we present HumanSplat that predicts the 3D Gaussian Splatting properties of any human from a single input image in a generalizable manner. In particular, HumanSplat comprises a 2D multi-view diffusion model and a latent reconstruction transformer with human structure priors that adeptly integrate geometric priors and semantic features within a unified framework. A hierarchical loss that incorporates human semantic information is further designed to achieve high-fidelity texture modeling and better constrain the estimated multiple views. Comprehensive experiments on standard benchmarks and in-the-wild images demonstrate that HumanSplat surpasses existing state-of-the-art methods in achieving photorealistic novel-view synthesis. Project page: https://humansplat.github.io/.

？纸|项目页面

Classic work:

1. A Generalization of Algebraic Surface Drawing

Authors : James F. Blinn

Comment: : First paper rendering 3D gaussians.

抽象的

The mathematical description of three-dimensional surfaces usually falls into one of two classifications: parametric and implicit. An implicit surface is defined to be all points which satisfy some equation F (x, y, z) = 0. This form is ideally suited for image space shaded picture drawing; the pixel coordinates are substituted for x and y, and the equation is solved for z. Algorithms for drawing such objects have been developed primarily for first- and second-order polynomial functions, a subcategory known as algebraic surfaces. This paper presents a new algorithm applicable to other functional forms, in particular to the summation of several Gaussian density distributions. The algorithm was created to model electron density maps of molecular structures, but it can be used for other artistically interesting shapes.

？纸

2. Approximate Differentiable Rendering with Algebraic Surfaces

Authors : Leonid Keselman and Martial Hebert

Comment: : First paper to do differentiable rendering optimization of 3D gaussians.

抽象的

Differentiable renderers provide a direct mathematical link between an object's 3D representation and images of that object. In this work, we develop an approximate differentiable renderer for a compact, interpretable representation, which we call Fuzzy Metaballs. Our approximate renderer focuses on rendering shapes via depth maps and silhouettes. It sacrifices fidelity for utility, producing fast runtimes and high-quality gradient information that can be used to solve vision tasks. Compared to mesh-based differentiable renderers, our method has forward passes that are 5x faster and backwards passes that are 30x faster. The depth maps and silhouette images generated by our method are smooth and defined everywhere. In our evaluation of differentiable renderers for pose estimation, we show that our method is the only one comparable to classic techniques. In shape from silhouette, our method performs well using only gradient descent and a per-pixel loss, without any surrogate losses or regularization. These reconstructions work well even on natural video sequences with segmentation artifacts.

？纸|项目页面|代码| ？ Short Presentation

3. Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling

Authors : Jan U. Müller, Michael Weinmann, Reinhard Klein

Comment: Builds 2D screen-space gaussians from underlying 3D representations.

抽象的

We propose an efficient and GPU-accelerated sampling framework which enables unbiased gradient approximation for differentiable point cloud rendering based on surface splatting. Our framework models the contribution of a point to the rendered image as a probability distribution. We derive an unbiased approximative gradient for the rendering function within this model. To efficiently evaluate the proposed sample estimate, we introduce a tree-based data-structure which employs multi-pole methods to draw samples in near linear time. Our gradient estimator allows us to avoid regularization required by previous methods, leading to a more faithful shape recovery from images. Furthermore, we validate that these improvements are applicable to real-world applications by refining the camera poses and point cloud obtained from a real-time SLAM system. Finally, employing our framework in a neural rendering setting optimizes both the point cloud and network parameters, highlighting the framework's ability to enhance data driven approaches.

？纸质代码

4. Generating and Real-Time Rendering of Clouds

Authors : Petr Man

Comment: Splatting of anisotropic gaussians. Basically a non-differentiable implementation of 3DGS.

抽象的

This paper presents a method for generation and real-time rendering of static clouds. Perlin noise function generates three dimensional map of a cloud. We also present a twopass rendering algorithm that performs physically based approximation. In the first preprocessed phase it computes multiple forward scattering. In the second phase first order anisotropic scattering at runtime is evaluated. The generated map is stored as voxels and is unsuitable for the real-time rendering. We introduce a more suitable inner representation of cloud that approximates the original map and contains much less information. The cloud is then represented by a set of metaballs (spheres) with parameters such as center positions, radii and density values. The main contribution of this paper is to propose a method, that transforms the original cloud map to the inner representation. This method uses the Radial Basis Function (RBF) neural network.

？纸

压缩：

3D Gaussian Splatting has recently emerged as a highly promising technique for modeling of static 3D scenes. In contrast to Neural Radiance Fields, it utilizes efficient rasterization allowing for very fast rendering at high-quality. However, the storage size is significantly higher, which hinders practical deployment, eg on resource constrained devices. In this paper, we introduce a compact scene representation organizing the parameters of 3D Gaussian Splatting (3DGS) into a 2D grid with local homogeneity, ensuring a drastic reduction in storage requirements without compromising visual quality during rendering. Central to our idea is the explicit exploitation of perceptual redundancies present in natural scenes. In essence, the inherent nature of a scene allows for numerous permutations of Gaussian parameters to equivalently represent it. To this end, we propose a novel highly parallel algorithm that regularly arranges the high-dimensional Gaussian parameters into a 2D grid while preserving their neighborhood structure. During training, we further enforce local smoothness between the sorted parameters in the grid. The uncompressed Gaussians use the same structure as 3DGS, ensuring a seamless integration with established renderers. Our method achieves a reduction factor of 17x to 42x in size for complex scenes with no increase in training time, marking a substantial leap forward in the domain of 3D scene distribution and consumption.

？纸|项目页面|代码

扩散：

2024 年：

1. AGG: Amortized Generative 3D Gaussians for Single Image to 3D

Authors : Dejia Xu, Ye Yuan, Morteza Mardani, Sifei Liu, Jiaming Song, Zhangyang Wang, Arash Vahdat

抽象的

Given the growing need for automatic 3D content creation pipelines, various 3D representations have been studied to generate 3D objects from a single image. Due to its superior rendering efficiency, 3D Gaussian splatting-based models have recently excelled in both 3D reconstruction and generation. 3D Gaussian splatting approaches for image to 3D generation are often optimization-based, requiring many computationally expensive score-distillation steps. To overcome these challenges, we introduce an Amortized Generative 3D Gaussian framework (AGG) that instantly produces 3D Gaussians from a single image, eliminating the need for per-instance optimization. Utilizing an intermediate hybrid representation, AGG decomposes the generation of 3D Gaussian locations and other appearance attributes for joint optimization. Moreover, we propose a cascaded pipeline that first generates a coarse representation of the 3D data and later upsamples it with a 3D Gaussian super-resolution module. Our method is evaluated against existing optimization-based 3D Gaussian frameworks and sampling-based pipelines utilizing other 3D representations, where AGG showcases competitive generation abilities both qualitatively and quantitatively while being several orders of magnitude faster.

？纸| Project Page| ？ Short Presentation

2. Fast Dynamic 3D Object Generation from a Single-view Video

Authors : Zijie Pan, Zeyu Yang, Xiatian Zhu, Li Zhang

抽象的

Generating dynamic three-dimensional (3D) object from a single-view video is challenging due to the lack of 4D labeled data. Existing methods extend text-to-3D pipelines by transferring off-the-shelf image generation models such as score distillation sampling, but they are slow and expensive to scale (eg, 150 minutes per object) due to the need for back-propagating the information-limited supervision signals through a large pretrained model. To address this limitation, we propose an efficient video-to-4D object generation framework called Efficient4D. It generates high-quality spacetime-consistent images under different camera views, and then uses them as labeled data to directly train a novel 4D Gaussian splatting model with explicit point cloud geometry, enabling real-time rendering under continuous camera trajectories. Extensive experiments on synthetic and real videos show that Efficient4D offers a remarkable 10-fold increase in speed when compared to prior art alternatives while preserving the same level of innovative view synthesis quality. For example, Efficient4D takes only 14 minutes to model a dynamic object.

？纸|项目页面|代码| ？ Short Presentation

3. GaussianObject: Just Taking Four Images to Get A High-Quality 3D Object with Gaussian Splatting

Authors : Chen Yang, Sikuang Li, Jiemin Fang, Ruofan Liang, Lingxi Xie, Xiaopeng Zhang, Wei Shen, Qi Tian

抽象的

Reconstructing and rendering 3D objects from highly sparse views is of critical importance for promoting applications of 3D vision techniques and improving user experience. However, images from sparse views only contain very limited 3D information, leading to two significant challenges: 1) Difficulty in building multi-view consistency as images for matching are too few; 2）由于视图覆盖不足，部分省略或高度压缩对象信息。 To tackle these challenges, we propose GaussianObject, a framework to represent and render the 3D object with Gaussian splatting, that achieves high rendering quality with only 4 input images. We first introduce techniques of visual hull and floater elimination which explicitly inject structure priors into the initial optimization process for helping build multi-view consistency, yielding a coarse 3D Gaussian representation.然后，我们基于扩散模型构建高斯修复模型来补充遗漏的对象信息，其中高斯被进一步细化。我们设计了一种自生成策略来获取图像对来训练修复模型。 Our GaussianObject is evaluated on several challenging datasets, including MipNeRF360, OmniObject3D, and OpenIllumination, achieving strong reconstruction results from only 4 views and significantly outperforming previous state-of-the-art methods.

？纸|项目页面|代码| ？ Short Presentation

Authors : Heng Yu, Chaoyang Wang, Peiye Zhuang, Willi Menapace, Aliaksandr Siarohin, Junli Cao, Laszlo A Jeni, Sergey Tulyakov, Hsin-Ying Lee

Authors : Junwu Zhang, Zhenyu Tang, Yatian Pang, Xinhua Cheng, Peng Jin, Yida Wei, Munan Ning, Li Yuan

抽象的

Recent one image to 3D generation methods commonly adopt Score Distillation Sampling (SDS). Despite the impressive results, there are multiple deficiencies including multi-view inconsistency, over-saturated and over-smoothed textures, as well as the slow generation speed. To address these deficiencies, we present Repaint123 to alleviate multi-view bias as well as texture degradation and speed up the generation process. The core idea is to combine the powerful image generation capability of the 2D diffusion model and the texture alignment ability of the repainting strategy for generating high-quality multi-view images with consistency. We further propose visibility-aware adaptive repainting strength for overlap regions to enhance the generated image quality in the repainting process. The generated high-quality and multi-view consistent images enable the use of simple Mean Square Error (MSE) loss for fast 3D content generation. We conduct extensive experiments and show that our method has a superior ability to generate high-quality 3D content with multi-view consistency and fine textures in 2 minutes from scratch.

？纸|项目页面| Code (not yet)

Dynamics and Deformation:

Recently, 3D Gaussian, as an explicit 3D representation method, has demonstrated strong competitiveness over NeRF (Neural Radiance Fields) in terms of expressing complex scenes and training duration. These advantages signal a wide range of applications for 3D Gaussians in 3D understanding and editing. Meanwhile, the segmentation of 3D Gaussians is still in its infancy. The existing segmentation methods are not only cumbersome but also incapable of segmenting multiple objects simultaneously in a short amount of time. In response, this paper introduces a 3D Gaussian segmentation method implemented with 2D segmentation as supervision. This approach uses input 2D segmentation maps to guide the learning of the added 3D Gaussian semantic information, while nearest neighbor clustering and statistical filtering refine the segmentation results. Experiments show that our concise method can achieve comparable performances on mIOU and mAcc for multi-object segmentation as previous single-object segmentation methods.

？纸

Language Embedding:

3D Gaussian Splatting (3DGS) has recently revolutionized radiance field reconstruction, achieving high quality novel view synthesis and fast rendering speed without baking. However, 3DGS fails to accurately represent surfaces due to the multi-view inconsistent nature of 3D Gaussians. We present 2D Gaussian Splatting (2DGS), a novel approach to model and reconstruct geometrically accurate radiance fields from multi-view images. Our key idea is to collapse the 3D volume into a set of 2D oriented planar Gaussian disks. Unlike 3D Gaussians, 2D Gaussians provide view-consistent geometry while modeling surfaces intrinsically. To accurately recover thin surfaces and achieve stable optimization, we introduce a perspective-accurate 2D splatting process utilizing ray-splat intersection and rasterization. Additionally, we incorporate depth distortion and normal consistency terms to further enhance the quality of the reconstructions. We demonstrate that our differentiable renderer allows for noise-free and detailed geometry reconstruction while maintaining competitive appearance quality, fast training speed, and real-time rendering.

1. [CVPR '24] PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics

Authors : Tianyi Xie, Zeshun Zong, Yuxin Qiu, Xuan Li, Yutao Feng, Yin Yang, Chenfanfu Jiang

抽象的

We introduce PhysGaussian, a new method that seamlessly integrates physically grounded Newtonian dynamics within 3D Gaussians to achieve high-quality novel motion synthesis. Employing a custom Material Point Method (MPM), our approach enriches 3D Gaussian kernels with physically meaningful kinematic deformation and mechanical stress attributes, all evolved in line with continuum mechanics principles. A defining characteristic of our method is the seamless integration between physical simulation and visual rendering: both components utilize the same 3D Gaussian kernels as their discrete representations. This negates the necessity for triangle/tetrahedron meshing, marching cubes, "cage meshes," or any other geometry embedding, highlighting the principle of "what you see is what you simulate (WS2)." Our method demonstrates exceptional versatility across a wide variety of materials--including elastic entities, metals, non-Newtonian fluids, and granular materials--showcasing its strong capabilities in creating diverse visual content with novel viewpoints and movements.

？纸|项目页面|代码| ？ Short Presentation

2. [CVPR '24] SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering

Authors : Antoine Guédon, Vincent Lepetit

抽象的

We propose a method to allow precise and extremely fast mesh extraction from 3D Gaussian Splatting. Gaussian Splatting 最近变得非常流行，因为它可以产生逼真的渲染，同时训练速度比 NeRF 快得多。 It is however challenging to extract a mesh from the millions of tiny 3D gaussians as these gaussians tend to be unorganized after optimization and no method has been proposed so far. Our first key contribution is a regularization term that encourages the gaussians to align well with the surface of the scene.然后，我们介绍一种方法，利用这种对齐方式对场景真实表面上的样本点进行对齐，并使用泊松重建从高斯中提取网格，与通常应用于从神经 SDF 中提取网格。 Finally, we introduce an optional refinement strategy that binds gaussians to the surface of the mesh, and jointly optimizes these Gaussians and the mesh through Gaussian splatting rendering. This enables easy editing, sculpting, rigging, animating, compositing and relighting of the Gaussians using traditional softwares by manipulating the mesh instead of the gaussians themselves. Retrieving such an editable mesh for realistic rendering is done within minutes with our method, compared to hours with the state-of-the-art methods on neural SDFs, while providing a better rendering quality.

？ Paper |项目页面|代码| ？ Short Presentation

3. NeuSG: Neural Implicit Surface Reconstruction with 3D Gaussian Splatting Guidance

Authors : Hanlin Chen, Chen Li, Gim Hee Lee

抽象的

Existing neural implicit surface reconstruction methods have achieved impressive performance in multi-view 3D reconstruction by leveraging explicit geometry priors such as depth maps or point clouds as regularization. However, the reconstruction results still lack fine details because of the over-smoothed depth map or sparse point cloud. In this work, we propose a neural implicit surface reconstruction pipeline with guidance from 3D Gaussian Splatting to recover highly detailed surfaces. The advantage of 3D Gaussian Splatting is that it can generate dense point clouds with detailed structure. Nonetheless, a naive adoption of 3D Gaussian Splatting can fail since the generated points are the centers of 3D Gaussians that do not necessarily lie on the surface. We thus introduce a scale regularizer to pull the centers close to the surface by enforcing the 3D Gaussians to be extremely thin. Moreover, we propose to refine the point cloud from 3D Gaussians Splatting with the normal priors from the surface predicted by neural implicit models instead of using a fixed set of points as guidance. Consequently, the quality of surface reconstruction improves from the guidance of the more accurate 3D Gaussian splatting. By jointly optimizing the 3D Gaussian Splatting and the neural implicit model, our approach benefits from both representations and generates complete surfaces with intricate details. Experiments on Tanks and Temples verify the effectiveness of our proposed method.

？纸

杂项：

In this paper, we address the limitations of Adaptive Density Control (ADC) in 3D Gaussian Splatting (3DGS), a scene representation method achieving high-quality, photorealistic results for novel view synthesis. ADC has been introduced for automatic 3D point primitive management, controlling densification and pruning, however, with certain limitations in the densification logic. Our main contribution is a more principled, pixel-error driven formulation for density control in 3DGS, leveraging an auxiliary, per-pixel error function as the criterion for densification. We further introduce a mechanism to control the total number of primitives generated per scene and correct a bias in the current opacity handling strategy of ADC during cloning operations. Our approach leads to consistent quality improvements across a variety of benchmark scenes, without sacrificing the method's efficiency.

？纸

2023 年：

1. [CVPRW '24] Depth-Regularized Optimization for 3D Gaussian Splatting in Few-Shot Images

Authors : Jaeyoung Chung, Jeongtaek Oh, Kyoung Mu Lee

抽象的

In this paper, we present a method to optimize Gaussian splatting with a limited number of images while avoiding overfitting. Representing a 3D scene by combining numerous Gaussian splats has yielded outstanding visual quality. However, it tends to overfit the training views when only a small number of images are available. To address this issue, we introduce a dense depth map as a geometry guide to mitigate overfitting. We obtained the depth map using a pre-trained monocular depth estimation model and aligning the scale and offset using sparse COLMAP feature points. The adjusted depth aids in the color-based optimization of 3D Gaussian splatting, mitigating floating artifacts, and ensuring adherence to geometric constraints. We verify the proposed method on the NeRF-LLFF dataset with varying numbers of few images. Our approach demonstrates robust geometry compared to the original method that relies solely on images.

？ Paper |项目页面|代码

2. EAGLES: Efficient Accelerated 3D Gaussians with Lightweight EncodingS

Authors : Sharath Girish, Kamal Gupta, Abhinav Shrivastava

抽象的

Recently, 3D Gaussian splatting (3D-GS) has gained popularity in novel-view scene synthesis. It addresses the challenges of lengthy training times and slow rendering speeds associated with Neural Radiance Fields (NeRFs). Through rapid, differentiable rasterization of 3D Gaussians, 3D-GS achieves real-time rendering and accelerated training. They, however, demand substantial memory resources for both training and storage, as they require millions of Gaussians in their point cloud representation for each scene. We present a technique utilizing quantized embeddings to significantly reduce memory storage requirements and a coarse-to-fine training strategy for a faster and more stable optimization of the Gaussian point clouds. Our approach results in scene representations with fewer Gaussians and quantized representations, leading to faster training times and rendering speeds for real-time rendering of high resolution scenes. We reduce memory by more than an order of magnitude all while maintaining the reconstruction quality. We validate the effectiveness of our approach on a variety of datasets and scenes preserving the visual quality while consuming 10-20x less memory and faster training/inference speed.

？ Paper |项目页面|代码

3. [CVPR '24] COLMAP-Free 3D Gaussian Splatting

Authors : Yang Fu, Sifei Liu, Amey Kulkarni, Jan Kautz, Alexei A. Efros, Xiaolong Wang

抽象的

While neural rendering has led to impressive advances in scene reconstruction and novel view synthesis, it relies heavily on accurately pre-computed camera poses. To relax this constraint, multiple efforts have been made to train Neural Radiance Fields (NeRFs) without pre-processed camera poses. However, the implicit representations of NeRFs provide extra challenges to optimize the 3D structure and camera poses at the same time. On the other hand, the recently proposed 3D Gaussian Splatting provides new opportunities given its explicit point cloud representations. This paper leverages both the explicit geometric representation and the continuity of the input video stream to perform novel view synthesis without any SfM preprocessing. We process the input frames in a sequential manner and progressively grow the 3D Gaussians set by taking one input frame at a time, without the need to pre-compute the camera poses. Our method significantly improves over previous approaches in view synthesis and camera pose estimation under large motion changes.

？ Paper |项目页面| Code (not yet) | ？ Short Presentation

4. iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching

Authors : Yuan Sun, Xuan Wang, Yunfan Zhang, Jie Zhang, Caigui Jiang, Yu Guo, Fei Wang

抽象的

We present a method named iComMa to address the 6D pose estimation problem in computer vision. The conventional pose estimation methods typically rely on the target's CAD model or necessitate specific network training tailored to particular object classes. Some existing methods address mesh-free 6D pose estimation by employing the inversion of a Neural Radiance Field (NeRF), aiming to overcome the aforementioned constraints. However, it still suffers from adverse initializations. By contrast, we model the pose estimation as the problem of inverting the 3D Gaussian Splatting (3DGS) with both the comparing and matching loss. In detail, a render-and-compare strategy is adopted for the precise estimation of poses. Additionally, a matching module is designed to enhance the model's robustness against adverse initializations by minimizing the distances between 2D keypoints. This framework systematically incorporates the distinctive characteristics and inherent rationale of render-and-compare and matching-based approaches. This comprehensive consideration equips the framework to effectively address a broader range of intricate and challenging scenarios, including instances with substantial angular deviations, all while maintaining a high level of prediction accuracy. Experimental results demonstrate the superior precision and robustness of our proposed jointly optimized framework when evaluated on synthetic and complex real-world data in challenging scenarios.

？纸|代码

渲染：

抽象的

Image-based 3D reconstruction is a challenging task that involves inferring the 3D shape of an object or scene from a set of input images. Learning-based methods have gained attention for their ability to directly estimate 3D shapes. This review paper focuses on state-of-the-art techniques for 3D reconstruction, including the generation of novel, unseen views. An overview of recent developments in the Gaussian Splatting method is provided, covering input types, model structures, output representations, and training strategies. Unresolved challenges and future directions are also discussed. Given the rapid progress in this domain and the numerous opportunities for enhancing 3D reconstruction methods, a comprehensive examination of algorithms appears essential. Consequently, this study offers a thorough overview of the latest advancements in Gaussian Splatting.

？纸

满贯：

The integration of neural rendering and the SLAM system recently showed promising results in joint localization and photorealistic view reconstruction. However, existing methods, fully relying on implicit representations, are so resource-hungry that they cannot run on portable devices, which deviates from the original intention of SLAM. In this paper, we present Photo-SLAM, a novel SLAM framework with a hyper primitives map. Specifically, we simultaneously exploit explicit geometric features for localization and learn implicit photometric features to represent the texture information of the observed environment. In addition to actively densifying hyper primitives based on geometric features, we further introduce a Gaussian-Pyramid-based training method to progressively learn multi-level features, enhancing photorealistic mapping performance. The extensive experiments with monocular, stereo, and RGB-D datasets prove that our proposed system Photo-SLAM significantly outperforms current state-of-the-art SLAM systems for online photorealistic mapping, eg, PSNR is 30% higher and rendering speed is hundreds of times faster in the Replica dataset. Moreover, the Photo-SLAM can run at real-time speed using an embedded platform such as Jetson AGX Orin, showing the potential of robotics applications.

？ Paper | Project Page |代码

疏：

抽象的

Novel View Synthesis (NVS) for street scenes play a critical role in the autonomous driving simulation. The current mainstream technique to achieve it is neural rendering, such as Neural Radiance Fields (NeRF) and 3D Gaussian Splatting (3DGS). Although thrilling progress has been made, when handling street scenes, current methods struggle to maintain rendering quality at the viewpoint that deviates significantly from the training viewpoints. This issue stems from the sparse training views captured by a fixed camera on a moving vehicle. To tackle this problem, we propose a novel approach that enhances the capacity of 3DGS by leveraging prior from a Diffusion Model along with complementary multi-modal data. Specifically, we first fine-tune a Diffusion Model by adding images from adjacent frames as condition, meanwhile exploiting depth data from LiDAR point clouds to supply additional spatial information. Then we apply the Diffusion Model to regularize the 3DGS at unseen views during training. Experimental results validate the effectiveness of our method compared with current state-of-the-art models, and demonstrate its advance in rendering images from broader views.

？纸

7. BEINGS: Bayesian Embodied Image-goal Navigation with Gaussian Splatting

Authors : Wugang Meng, Tianfu Wu, Huan Yin, Fumin Zhang

抽象的

图像目标导航使机器人能够使用视觉提示进行引导，到达捕获目标图像的位置。然而，当前的方法要么严重依赖数据和计算成本昂贵的基于学习的方法，要么由于探索策略不足而在复杂环境中缺乏效率。 To address these limitations, we propose

展开

awesome 3D gaussian splatting

很棒的 3D 高斯溅射资源

目录

介绍 3D 高斯分布的开创性论文：

用于实时辐射场渲染的 3D 高斯喷射

3D 物体检测

2024年

1. 3DGS-DET：通过边界引导和框聚焦采样增强 3D 高斯泼溅，以实现 3D 物体检测

自动驾驶：

2024 年：

1. 用于动态城市场景建模的街道高斯

2. TCLC-GS：用于周围自动驾驶场景的紧耦合激光雷达相机高斯泼溅

3. OmniRe：全方位城市场景重建

2023 年：

1. [CVPR '24] DrivingGaussian：用于周围动态自动驾驶场景的复合高斯泼溅

2. [CVPR '24] HUGS：通过高斯泼溅理解整体城市 3D 场景

头像：

2024 年：

1. GaussianBody：通过 3d 高斯泼溅重建穿着衣服的人体

2. PSAvatar：基于点的可变形形状模型，用于通过 3D 高斯泼溅创建实时头部头像

3. Rig3DGS：从休闲单目视频创建可控肖像

4. HeadStudio：使用 3D 高斯泼溅将文本发送到可动画化的头部头像

5. ImplicitDeepfake：使用 NeRF 和高斯泼溅通过隐式 Deepfake 生成进行合理的换脸

6. GaussianHair：使用光感知高斯进行头发建模和渲染

7. GVA：从单目视频重建生动的 3D 高斯头像

8。[CVPR '24] Splattingavatar：现实的实时人体化身，带网状的高斯裂口

9. splatface：高斯splat脸重建利用优化的表面

10。哈哈：高度铰接的高斯人化身，带有纹理网状

11。[CVPRW '24] 3D感知生成的对抗网络的高斯脱离解码器

12. Gomavatar：使用网格高斯的单眼视频从单眼视频中有效的动画人类建模

13。occgaussian：3d高斯分裂以遮挡人类渲染

14。[CVPR '24]猜测看不见的：Dynamic 3D场景重建局部2D瞥见

15。[Neurips '24]可推广和动画的高斯头像

16。[siggraph asia'24]双重：鲁棒的双高斯分裂以沉浸式人体以人为中心的体积视频

17。[Siggraph Asia'24] V^3：通过流式2D动态高斯观看手机上的体积视频

2023 年：

1。可驱动的3D高斯化身

2. splatarmor：来自单眼RGB视频的动画人类的铰接高斯碎片

3。[CVPR '24]动画高斯人：学习姿势依赖的高斯地图

4。[CVPR '24] GART：高斯铰接模板模型

5。[CVPR '24]人类高斯碎片：动画化身的实时渲染

6。[CVPR '24]拥抱：人类高斯碎片

7。[CVPR '24]高斯壳图3D人类一代

8。高斯黑德：具有可学习高斯派生的高保真头像

9。[CVPR '24]高斯瓦塔塔：逼真的头像带有3D高斯

10。[CVPR '24] GPS-GAUSSIAN：可推广的像素3D高斯分裂，用于实时人类小说综合

11。高曼：从单眼人类视频中铰接的高斯分裂

12。

13。[CVPR '24] HIFI4G：高保真人类的性能通过紧凑的高斯脱落

14。[CVPR '24] Gaussianavatar：通过动画3D高斯人从单个视频中迈向现实的人类头像建模

15。[CVPR '24] Flashavatar：高保真头像具有高效的高斯嵌入

16. [CVPR '24] Relightable Gaussian Codec Avatars

17. MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar

18. [CVPR '24] ASH: Animatable Gaussian Splats for Efficient and Photoreal Human Rendering

19. [CVPR '24] 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting

20. [CVPR '24] GAvatar: Animatable 3D Gaussian Avatars with Implicit Mesh Learning

21. Deformable 3D Gaussian Splatting for Animatable Human Avatars

22. Human101: Training 100+FPS Human Gaussians in 100s from 1 View

23. [CVPR '24] Gaussian Head Avatar: Ultra High-fidelity Head Avatar via Dynamic Gaussians

24. HumanSplat: Generalizable Single-Image Human Gaussian Splatting with Structure Priors

Classic work:

1. A Generalization of Algebraic Surface Drawing

2. Approximate Differentiable Rendering with Algebraic Surfaces

3. Unbiased Gradient Estimation for Differentiable Surface Splatting via Poisson Sampling

4. Generating and Real-Time Rendering of Clouds

压缩：

2024 年：

1. [I3D '24] Reducing the Memory Footprint of 3D Gaussian Splatting

2. [CVPR '24] Compressed 3D Gaussian Splatting for Accelerated Novel View Synthesis

3. HAC: Hash-grid Assisted Context for 3D Gaussian Splatting Compression

4. [ECCV '24] End-to-End Rate-Distortion Optimized 3D Gaussian Representation

5. 3DGS.zip: A survey on 3D Gaussian Splatting Compression Methods

6. LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming

7. Implicit Gaussian Splatting with Efficient Multi-Level Tri-Plane Representation

2023 年：

1. LightGaussian: Unbounded 3D Gaussian Compression with 15x Reduction and 200+ FPS

2. Compact3D: Compressing Gaussian Splat Radiance Field Models with Vector Quantization

3. [CVPR '24] Compact 3D Gaussian Representation for Radiance Field

4. [ECCV '24] Compact 3D Scene Representation via Self-Organizing Gaussian Grids

扩散：