voxelgpt下载voxelgpt源代码下载

voxelgpt

其他源码

1.0.0

下载

voxelgpt

希望您可以在不编写代码行的情况下搜索图像或视频？是否想通过以简单的英语询问从数据中提取见解？现在可以！？

VoxelGpt是一个五十一个插件，结合了大语言模型（LLMS）和大型多模型模型（LMM）的功能，以及FiftyOne的计算机视觉查询语言，使您能够使用自然语言过滤，分类，分类，切片，切成语义，并提出有关数据的问题。它甚至可以在您的数据集上为您执行计算 - 当然可以通过批准！

现场演示

您可以在gpt.fiftyone.ai上尝试voxelgpt！

功能

VoxelGpt能够处理以下任何类型的查询：

数据集查询
计算查询
五十一个库查询
fiftyone工作区查询
通用机器学习查询

当您向Voxelgpt提出问题时，它将解释您的意图并确定您要问的查询类型。如果不确定Voxelgpt，它将要求您澄清。

查询数据集

voxelgpt_dataset.mp4

VoxelGpt可以处理有关数据集的以下查询类型：

回答有关您的数据集，字段和运行的架构的问题
通过构建和串联查看阶段来创建数据过滤的视图。
在fiftyone应用中设置视图
通过整个数据集执行聚合或在数据集中的视图

您可以要求Voxelpt为您搜索数据集。这是您可以问的一些示例：

告诉我10个随机样品
向我展示高信心假阳性预测
我有多个人的图像吗？
我的图像中包含猫的平均亮度是多少？

在引擎盖下，voxelgpt解释您的查询并将其转换为相应的数据集视图。 VoxelGpt了解数据集的模式，以及评估运行和相似性索引之类的内容。

它还可以自动检查数据集的内容以检索特定实体。

数据模式查询

VoxelGpt可以回答有关数据集，大脑运行，评估运行等方案的问题。这里有一些例子：

我的数据集中有哪些字段？
我有任何评估吗？
我用哪种型号用于相似性索引我的数据集？

对象检测查询

如果您的数据集包含一个或多个fo.Detections字段，则VoxelGpt可以根据边界框的大小（相对和绝对）和检测次数进行过滤或匹配。

将视图限制为大于图像区域一半的地面真相检测
告诉我所有的预测< $ 96^2 $像素
我每个图像的人检测平均数量是多少？

地理位置查询

如果您的数据集具有GeoLocation字段，则可以在数据集上运行地理查询。 VoxelGpt可以执行从位置名称（或文本描述）到(lon, lat)对的地理编码，或定义边界区域的(lon, lat)点的列表。这里有几个例子：

按靠近自由女神像进行排序
向我展示大中央400m之内的样品
过滤到巴黎的图像
我在地狱的厨房里拍了几张图像？

时间查询

如果您的数据集具有Date或DateTime字段，则VoxelGpt可以执行诸如：

周二拍摄的图片过滤器
2023年6月1日之后添加了多少张图像？
向我展示样本，其中event字段在晚上8点至晚上11点之间阅读一天中的时间

聚合

VoxelGpt可以访问FiftyOne的聚合阶段，因此可以执行整个数据集中的count ， mean ，Sum， sum ， std ， min ，MAX， max ， values和DICTICT的聚合，或者在整个数据集中的字段或表达式distinct ，或在数据集中的视图中进行视图。这里有一些例子：

我的图像的平均亮度是什么？
cat标签我有几张图像？
我的预测中confidence领域的标准偏差是什么？

计算查询

VoxelGpt可以在数据集上执行计算，例如：

亮度：使用FiftyOne的图像质量问题插件为数据集中的每个样本分配亮度得分
熵：使用FiftyOne的图像质量问题插件量化数据集中每个示例中的信息量
独特性：使用fiftyone大脑为数据集中的每个样本分配一个唯一性评分
重复：使用fiftyone大脑在数据集中识别和删除重复样本
相似性：在数据集上生成矢量相似性索引，该索引可用于使用fiftyone脑，用于比较数据集中的样本
降低尺寸降低：使用UMAP，PCA或T-SNE使用fiftyone大脑降低每个样品的特征向量的维度，以便可以在2D或3D中可视化它们
聚类：使用KMeans，dbscan和其他聚类算法中的数据集中的群集样品，使用fiftyOne的聚类插件

以下是一些计算查询的示例，您可以询问VoxelGpt：

计算我的数据集上图像的亮度
在我的数据集中为每个图像的唯一性评分
为我的数据集生成相似性索引
使用Kmeans聚集我的数据集
帮助我使用UMAP在2D中可视化数据集

如果您不想允许VoxelGpt运行计算，请设置环境变量：

 export VOXELGPT_ALLOW_COMPUTATIONS=false

您还可以设置VoxelGPT需要在该数据集中的最小数据集大小来要求运行计算的权限：

 export VOXELGPT_APPROVAL_THRESHOLD=1000

默认值是100个样本。

五十一个库查询

Voxelgpt不仅是配对的程序员；它也是一种教育工具。 VoxelGpt可以访问整个FiftyOne文档，以及Voxel51博客上的所有博客文章，以及Voxel51 YouTube频道上的视频中的成绩单。它可以使用所有这些资源来回答与五十一个有关的问题。

以下是一些文档查询的示例，您可以询问VoxelGpt：

如何从FiftyOne动物园加载数据集？
匹配（）阶段做什么？
我可以以可可格式导出数据集吗？
Fiftyone是否有任何可积极学习的插件？

VoxelGpt将提供跨Voxel51文档，博客和YouTube频道中最有用的资源的链接。对于YouTube视频，链接将直接指向最相关的时间戳！

查询您的工作区

Voxelgpt可以回答有关您正在运行的五十一个环境的问题，包括：

您已下载的其他数据集
您已安装的插件和这些插件中的操作员
您的fiftyone配置
您的fiftyone应用程序配置

这是一些工作区查询的示例，您可以询问VoxelGpt：

我有任何可可数据集吗？
我有任何用于识别数据中问题的插件吗？
我的操作员超时设置了什么？

通用机器学习查询

voxelgpt_cv.mp4

最后，Voxelgpt可以回答有关计算机视觉，机器学习和数据科学的一般问题。它可以帮助您了解基本概念并学习如何克服数据质量问题。

以下是一些机器学习查询的示例，您可以询问VoxelGpt：

精度和召回之间有什么区别？
如何在图像中检测到面孔？
我有什么方法可以减少数据集中的冗余？

安装

如果还没有，请安装fiftyone：

pip install fiftyone

您还需要提供一个OpenAI API密钥（创建一个）：

 export OPENAI_API_KEY=XXXXXXXX

要与您的私人Azure部署一起使用，请参阅此处

仅使用应用

如果您只想在fiftyone应用中使用voxelgpt，则可以简单地将其安装为插件：

fiftyone plugins download https://github.com/voxel51/voxelgpt
fiftyone plugins requirements @voxel51/voxelgpt --install

本地使用/开发

如果要直接使用voxelgpt模块或在本地开发项目，则需要克隆存储库：

git clone https://github.com/voxel51/voxelgpt
cd voxelgpt

安装要求：

pip install -r requirements.txt

并通过将其与插件目录相结合：将插件可用于fiftyOne应用程序：

 # Symlinks your clone of voxelgpt into your FiftyOne plugins directory
ln -s " $( pwd ) " " $( fiftyone config plugins_dir ) /voxelgpt "

五十一个团队

是否想在您的FiftyOne团队部署中添加Voxelgpt？你可以！说明在这里。

使用Azure OpenAI部署

您可以通过设置以下环境变量来将VoxelGPT与私有Azure部署一起使用：

 export OPENAI_API_TYPE=azure
export AZURE_OPENAI_ENDPOINT= < azure_endpoint >
export AZURE_OPENAI_KEY= < azure_api_key >

export AZURE_OPENAI_GPT35_DEPLOYMENT_NAME= < gpt35-deployment-name >
export AZURE_OPENAI_GPT4O_DEPLOYMENT_NAME= < gpt4o-deployment-name >
export AZURE_OPENAI_TEXT_EMBEDDING_3_LARGE_DEPLOYMENT_NAME= < embedding-deployment-name >

如果未设置前三个环境变量中的任何一个，则VoxelGpt将默认使用OpenAI API。对于最后三个环境变量，如果找不到任何资源的设置，则VoxelGpt将默认将OpenAI API用于该特定模型。

在应用中使用voxelgpt

您可以通过加载任何数据集来使用fiftyone应用中的voxelgpt：

 import fiftyone as fo
import fiftyone . zoo as foz

dataset = foz . load_zoo_dataset ( "quickstart" )
session = fo . launch_app ( dataset )

然后是：

点击网格上方的OpenAI图标
按“样本”选项卡旁边的+图标并选择voxelgpt

voxelgpt_open.mp4

例如，尝试询问以下问题：

哪些流行的计算机视觉模型架构是什么？
如何在数据集中搜索特定对象？
告诉我预测的飞机

专家提示：使用now关键字将您以前的提示合并为下一个查询的上下文！

您还可以通过按键盘上的~键并从列表中选择Ask VoxelGPT来运行VoxelGpt作为操作员。这将打开一个小型模式，您可以在其中输入查询。

在Python中使用voxelgpt

如果您已经在本地安装，也可以通过Python直接与Voxelgpt进行交互。

互动会议

您可以使用ask_voxelgpt_interactive()启动交互式会话，可以在其中通过input()提示与voxelgpt交谈。导航到VoxelGpt所在的目录：

 import fiftyone as fo
import fiftyone . zoo as foz

from voxelgpt import ask_voxelgpt_interactive

dataset = foz . load_zoo_dataset ( "quickstart" )
session = fo . launch_app ( dataset )

ask_voxelgpt_interactive ( dataset , session = session )

专家提示：使用now关键字将您以前的提示合并为下一个查询的上下文。

像往常一样，您可以通过数据集，文档和一般计算机视觉查询的任何组合提示Voxelgpt。例如，对话可能看起来像：

 You:
    what is the difference between precision and recall?

VoxelGPT:
    Precision and recall are two important metrics used to evaluate the
    performance of a machine learning model in classification tasks. Precision
    measures the proportion of true positive predictions among all positive
    predictions made by the model. Recall, on the other hand, measures the
    proportion of true positive predictions among all actual positive instances
    in the dataset.

    In other words, precision is the ability of the model to correctly identify
    positive instances, while recall is the ability of the model to identify
    all positive instances. A high precision means that the model is making
    very few false positive predictions, while a high recall means that the
    model is able to identify most of the positive instances in the dataset.

    It is important to balance precision and recall when evaluating a model's
    performance. A model with high precision but low recall may be missing
    important positive instances, while a model with high recall but low
    precision may be making too many false positive predictions.

    In the context of the FiftyOne library, precision and recall can be
    computed using the `fiftyone.utils.metrics` module, which provides
    functions for computing various classification metrics. For example, the
    `fiftyone.utils.metrics.binary_classification` function can be used to
    compute precision and recall for binary classification tasks.

You:
    does FiftyOne integrate with Label Studio?

VoxelGPT:
    Yes, FiftyOne integrates with Label Studio. The integration allows you to
    easily upload your data directly from FiftyOne to Label Studio for labeling.
    You can get started with Label Studio through a simple pip install to get a
    local server up and running. FiftyOne provides simple setup instructions
    that you can use to specify the necessary account credentials and server
    endpoint to use. FiftyOne provides an API to create projects, upload data,
    define label schemas, and download annotations using Label Studio, all
    programmatically in Python. All of the following label types are supported
    for image datasets: Classification, Detections, Instance segmentations,
    Polygons and polylines, Keypoints, Scalar fields, and Semantic segmentation.

    Sources:
    - https://docs.voxel51.com/integrations/index.html
    - https://docs.voxel51.com/integrations/labelstudio.html

You:
    show me predicted airplanes

VoxelGPT:
    Identified potential algorithms: evaluation
    No evaluation runs found.
    Identified potential fields: predictions
    Identified potential label classes: {'predictions': ['airplane']}
    Identified potential view stages: ['match', 'filter_labels', 'match_labels', 'exclude_labels', 'filter_field']
    Okay, I'm going to load dataset.filter_labels("predictions",F("label") == "airplane")

You:
    now only show me the first 10 samples

VoxelGPT:
    Identified potential algorithms: evaluation
    No evaluation runs found.
    Identified potential fields: predictions
    Identified potential label classes: {'predictions': ['airplane']}
    Identified potential view stages: ['match', 'limit', 'limit_labels', 'skip', 'sort_by']
    Okay, I'm going to load dataset.match(F("predictions.detections.label").contains("airplane")).limit(10)

You: exit

在交互式模式下，voxelgpt会自动加载其在应用程序中创建的任何视图，您可以通过会话对象访问它们：

 print ( session . view . count ( "predictions.detections" ))

单个查询

您也可以使用ask_voxelgpt()提示带有各个查询的voxelgpt：

 from voxelgpt import ask_voxelgpt

ask_voxelgpt ( "Does FiftyOne integrate with CVAT?" )

 Yes, FiftyOne integrates with CVAT, which is an open-source image and video
annotation tool. You can upload your data directly from FiftyOne to CVAT to add or
edit labels. You can use CVAT either through the hosted server at app.cvat.ai or
through a self-hosted server. In either case, FiftyOne provides simple setup
instructions that you can use to specify the necessary account credentials and
server endpoint to use. The tight integration between FiftyOne and CVAT allows
you to curate and explore datasets in FiftyOne and then send off samples or
existing labels for annotation in CVAT with just one line of code. To use CVAT,
you must create an account on a CVAT server. By default, FiftyOne uses app.cvat.ai.
If you haven’t already, go to app.cvat.ai and create an account now. Another option
is to set up CVAT locally and then configure FiftyOne to use your self-hosted server.
A primary benefit of setting up CVAT locally is that you are limited to 10 tasks and
500MB of data with app.cvat.ai.

Sources:
- https://docs.voxel51.com/integrations/cvat.html#examples
- https://docs.voxel51.com/tutorials/cvat_annotation.html#Annotating-Datasets-with-CVAT
- https://docs.voxel51.com/tutorials/cvat_annotation.html#Setup
- https://docs.voxel51.com/integrations/index.html#fiftyone-integrations

当voxelgpt对您的查询创建视图时，它将返回：

 import fiftyone as fo
import fiftyone . zoo as foz

dataset = foz . load_zoo_dataset ( "quickstart" )

view = ask_voxelgpt ( "show me 10 random samples" , dataset )

 Identified potential view stages: ['match', 'limit', 'skip', 'take', 'sort_by']
Okay, I'm going to load dataset.take(10)

关键字

Voxelgpt经过培训，可以识别某些关键字，以帮助其理解您的意图：

关键词	意义
`show` / `display`	告诉VoxelGpt您要它查询数据集并显示结果
`docs` / `how` / `FiftyOne`	告诉Voxelgpt您希望它查询fiftyone文档。
`now`	使用您的聊天历史记录作为上下文来解释您的下一个查询。例如，如果您问“与人一起向我展示图像”，然后问“现在向我展示10个最独特的图像”，VoxelGpt将了解您想向人们展示10个最独特的图像
`help`	用使用说明打印帮助消息
`reset`	重置对话历史
`exit`	退出互动python会议