目录
输入和输出集成概述
Azure Event Hubs 输入插件允许 Telegraf 从 Azure Event Hubs 和 Azure IoT Hub 消费数据,从而实现来自这些云服务的事件流的高效数据处理和监控。
AWS Timestream Telegraf 插件使用户能够将指标直接发送到 Amazon 的 Timestream 服务,该服务专为时序数据管理而设计。此插件为身份验证、数据组织和保留设置提供了各种配置选项。
集成详情
Azure Event Hubs
此插件充当 Azure Event Hubs 和 Azure IoT Hub 的消费者,允许用户有效地从这些平台摄取数据流。Azure Event Hubs 是一个高度可扩展的数据流平台和事件摄取服务,能够每秒接收和处理数百万个事件,而 Azure IoT Hub 则支持物联网应用中安全的设备到云和云到设备通信。Event Hub 输入插件与这些服务无缝交互,提供可靠的消息消费和流处理能力。主要功能包括消费者组的动态管理、防止数据丢失的消息跟踪以及用于预取计数、用户代理和元数据处理的可自定义设置。此插件旨在支持各种用例,包括实时遥测数据收集、物联网数据处理以及与更广泛的 Azure 生态系统中的各种数据分析和监控工具集成。
AWS Timestream
此插件旨在有效地将指标写入 Amazon 的 Timestream 服务,Timestream 服务是针对物联网和运营应用优化的时序数据库。借助此插件,Telegraf 可以发送从各种来源收集的数据,并支持身份验证、数据组织和保留管理方面的灵活配置。它利用凭证链进行身份验证,允许各种方法,例如 Web 身份、承担的角色和共享配置文件。用户可以定义指标在 Timestream 中的组织方式——是使用单个表还是多个表,以及控制磁存储和内存存储的保留期等方面。一个关键特性是它能够处理多指标记录,从而实现高效的数据摄取并有助于减少多次写入的开销。在错误处理方面,该插件包括用于解决与数据写入期间的 AWS 错误相关的常见问题的机制,例如用于限制的重试逻辑以及根据需要创建表的功能。
配置
Azure Event Hubs
[[inputs.eventhub_consumer]]
## The default behavior is to create a new Event Hub client from environment variables.
## This requires one of the following sets of environment variables to be set:
##
## 1) Expected Environment Variables:
## - "EVENTHUB_CONNECTION_STRING"
##
## 2) Expected Environment Variables:
## - "EVENTHUB_NAMESPACE"
## - "EVENTHUB_NAME"
## - "EVENTHUB_KEY_NAME"
## - "EVENTHUB_KEY_VALUE"
## 3) Expected Environment Variables:
## - "EVENTHUB_NAMESPACE"
## - "EVENTHUB_NAME"
## - "AZURE_TENANT_ID"
## - "AZURE_CLIENT_ID"
## - "AZURE_CLIENT_SECRET"
## Uncommenting the option below will create an Event Hub client based solely on the connection string.
## This can either be the associated environment variable or hard coded directly.
## If this option is uncommented, environment variables will be ignored.
## Connection string should contain EventHubName (EntityPath)
# connection_string = ""
## Set persistence directory to a valid folder to use a file persister instead of an in-memory persister
# persistence_dir = ""
## Change the default consumer group
# consumer_group = ""
## By default the event hub receives all messages present on the broker, alternative modes can be set below.
## The timestamp should be in https://github.com/toml-lang/toml#offset-date-time format (RFC 3339).
## The 3 options below only apply if no valid persister is read from memory or file (e.g. first run).
# from_timestamp =
# latest = true
## Set a custom prefetch count for the receiver(s)
# prefetch_count = 1000
## Add an epoch to the receiver(s)
# epoch = 0
## Change to set a custom user agent, "telegraf" is used by default
# user_agent = "telegraf"
## To consume from a specific partition, set the partition_ids option.
## An empty array will result in receiving from all partitions.
# partition_ids = ["0","1"]
## Max undelivered messages
## This plugin uses tracking metrics, which ensure messages are read to
## outputs before acknowledging them to the original broker to ensure data
## is not lost. This option sets the maximum messages to read from the
## broker that have not been written by an output.
##
## This value needs to be picked with awareness of the agent's
## metric_batch_size value as well. Setting max undelivered messages too high
## can result in a constant stream of data batches to the output. While
## setting it too low may never flush the broker's messages.
# max_undelivered_messages = 1000
## Set either option below to true to use a system property as timestamp.
## You have the choice between EnqueuedTime and IoTHubEnqueuedTime.
## It is recommended to use this setting when the data itself has no timestamp.
# enqueued_time_as_ts = true
# iot_hub_enqueued_time_as_ts = true
## Tags or fields to create from keys present in the application property bag.
## These could for example be set by message enrichments in Azure IoT Hub.
# application_property_tags = []
# application_property_fields = []
## Tag or field name to use for metadata
## By default all metadata is disabled
# sequence_number_field = "SequenceNumber"
# enqueued_time_field = "EnqueuedTime"
# offset_field = "Offset"
# partition_id_tag = "PartitionID"
# partition_key_tag = "PartitionKey"
# iot_hub_device_connection_id_tag = "IoTHubDeviceConnectionID"
# iot_hub_auth_generation_id_tag = "IoTHubAuthGenerationID"
# iot_hub_connection_auth_method_tag = "IoTHubConnectionAuthMethod"
# iot_hub_connection_module_id_tag = "IoTHubConnectionModuleID"
# iot_hub_enqueued_time_field = "IoTHubEnqueuedTime"
## Data format to consume.
## Each data format has its own unique set of configuration options, read
## more about them here:
## https://github.com/influxdata/telegraf/blob/master/docs/DATA_FORMATS_INPUT.md
data_format = "influx"
AWS Timestream
[[outputs.timestream]]
## Amazon Region
region = "us-east-1"
## Amazon Credentials
## Credentials are loaded in the following order:
## 1) Web identity provider credentials via STS if role_arn and web_identity_token_file are specified
## 2) Assumed credentials via STS if role_arn is specified
## 3) explicit credentials from 'access_key' and 'secret_key'
## 4) shared profile from 'profile'
## 5) environment variables
## 6) shared credentials file
## 7) EC2 Instance Profile
#access_key = ""
#secret_key = ""
#token = ""
#role_arn = ""
#web_identity_token_file = ""
#role_session_name = ""
#profile = ""
#shared_credential_file = ""
## Endpoint to make request against, the correct endpoint is automatically
## determined and this option should only be set if you wish to override the
## default.
## ex: endpoint_url = "http://localhost:8000"
# endpoint_url = ""
## Timestream database where the metrics will be inserted.
## The database must exist prior to starting Telegraf.
database_name = "yourDatabaseNameHere"
## Specifies if the plugin should describe the Timestream database upon starting
## to validate if it has access necessary permissions, connection, etc., as a safety check.
## If the describe operation fails, the plugin will not start
## and therefore the Telegraf agent will not start.
describe_database_on_start = false
## Specifies how the data is organized in Timestream.
## Valid values are: single-table, multi-table.
## When mapping_mode is set to single-table, all of the data is stored in a single table.
## When mapping_mode is set to multi-table, the data is organized and stored in multiple tables.
## The default is multi-table.
mapping_mode = "multi-table"
## Specifies if the plugin should create the table, if the table does not exist.
create_table_if_not_exists = true
## Specifies the Timestream table magnetic store retention period in days.
## Check Timestream documentation for more details.
## NOTE: This property is valid when create_table_if_not_exists = true.
create_table_magnetic_store_retention_period_in_days = 365
## Specifies the Timestream table memory store retention period in hours.
## Check Timestream documentation for more details.
## NOTE: This property is valid when create_table_if_not_exists = true.
create_table_memory_store_retention_period_in_hours = 24
## Specifies how the data is written into Timestream.
## Valid values are: true, false
## When use_multi_measure_records is set to true, all of the tags and fields are stored
## as a single row in a Timestream table.
## When use_multi_measure_record is set to false, Timestream stores each field in a
## separate table row, thereby storing the tags multiple times (once for each field).
## The recommended setting is true.
## The default is false.
use_multi_measure_records = "false"
## Specifies the measure_name to use when sending multi-measure records.
## NOTE: This property is valid when use_multi_measure_records=true and mapping_mode=multi-table
measure_name_for_multi_measure_records = "telegraf_measure"
## Specifies the name of the table to write data into
## NOTE: This property is valid when mapping_mode=single-table.
# single_table_name = ""
## Specifies the name of dimension when all of the data is being stored in a single table
## and the measurement name is transformed into the dimension value
## (see Mapping data from Influx to Timestream for details)
## NOTE: This property is valid when mapping_mode=single-table.
# single_table_dimension_name_for_telegraf_measurement_name = "namespace"
## Only valid and optional if create_table_if_not_exists = true
## Specifies the Timestream table tags.
## Check Timestream documentation for more details
# create_table_tags = { "foo" = "bar", "environment" = "dev"}
## Specify the maximum number of parallel go routines to ingest/write data
## If not specified, defaulted to 1 go routines
max_write_go_routines = 25
## Please see README.md to know how line protocol data is mapped to Timestream
##
输入和输出集成示例
Azure Event Hubs
-
实时物联网设备监控:使用 Azure Event Hubs 插件监控来自传感器和执行器等物联网设备的遥测数据。通过将设备数据流式传输到监控仪表板,组织可以深入了解系统性能、跟踪使用模式并快速响应异常情况。此设置允许对设备进行主动管理,从而提高运营效率并减少停机时间。
-
事件驱动的数据处理工作流:利用此插件根据从 Azure Event Hubs 收到的事件触发数据处理工作流。例如,当新事件到达时,它可以启动数据转换、聚合或存储过程,从而使企业能够更有效地自动化其工作流。此集成增强了响应能力并简化了跨系统的运营。
-
与分析平台集成:实施该插件以将事件数据导入到 Azure Synapse 或 Power BI 等分析平台。通过将实时流数据集成到分析工具中,组织可以执行全面的数据分析、推动商业智能工作并创建信息丰富的交互式可视化效果,从而为决策提供依据。
-
跨平台数据同步:使用 Azure Event Hubs 插件跨不同的系统或平台同步数据流。通过从 Azure Event Hubs 消费数据并将其转发到数据库或云存储等其他系统,组织可以在其整个架构中维护一致且最新的信息,从而实现有凝聚力的数据策略。
AWS Timestream
-
物联网数据指标:使用 Timestream 插件将来自物联网设备的实时指标发送到 Timestream,从而可以快速分析和可视化传感器数据。通过将设备读数组织成时序格式,用户可以跟踪趋势、识别异常并根据设备性能简化运营决策。
-
应用性能监控:将 Timestream 与应用监控工具结合使用,以随时间推移发送有关服务性能的指标。此集成使工程师能够执行应用性能的历史分析,将其与业务指标相关联,并根据随时间推移查看的使用模式优化资源分配。
-
自动化数据存档:配置 Timestream 插件以将数据写入 Timestream,同时管理保留期。此设置可以自动化存档策略,确保根据预定义的标准保留旧数据。这对于合规性和历史分析特别有用,使企业能够以最少的人工干预来维护其数据生命周期。
-
多应用指标聚合:利用 Timestream 插件将来自多个应用的指标聚合到 Timestream 中。通过创建统一的性能指标数据库,组织可以深入了解各种服务,提高系统范围性能的可见性,并促进跨应用故障排除。
反馈
感谢您成为我们社区的一份子!如果您有任何一般性反馈或在这些页面上发现任何错误,我们欢迎并鼓励您提出意见。请在 InfluxDB 社区 Slack 中提交您的反馈。