所有集成 / Kafka 和 OpenSearch 集成

Kafka 和 OpenSearch 集成

强大的性能和简单的集成，由 InfluxData 构建的开源数据连接器 Telegraf 提供支持。

免费获取此集成免费获取此集成

50 亿+

Telegraf 下载量

时间序列数据库
来源：DB Engines

10 亿+

InfluxDB 下载量

2,800+

贡献者

实用链接

文档

Telegraf 快速入门培训

使用 Telegraf 进行基础设施监控

强大的性能，无限的扩展能力

收集、组织和处理海量高速数据。当您将任何数据视为时间序列数据时，它都更有价值。使用 InfluxDB，这是排名第一的时间序列平台，旨在与 Telegraf 一起扩展。

查看入门方法

输入和输出集成概述

此插件允许您从 Kafka 主题实时收集指标，从而增强 Telegraf 设置中的数据监控和收集能力。

OpenSearch 输出插件允许用户使用 HTTP 将指标直接发送到 OpenSearch 实例，从而促进 OpenSearch 生态系统内有效的数据管理和分析。

集成详情

Kafka

Kafka Telegraf 插件旨在从 Kafka 主题读取数据，并使用支持的输入数据格式创建指标。作为服务输入插件，它持续监听传入的指标和事件，这与以固定间隔运行的标准输入插件不同。此特定插件可以使用各种 Kafka 版本的功能，并能够使用 SASL 等配置的安全性凭据，以及使用消息偏移和消费者组的消息处理选项，从指定主题消费消息。此插件的灵活性使其能够处理各种消息格式和用例，使其成为依赖 Kafka 进行数据摄取的应用程序的宝贵资产。

OpenSearch

OpenSearch Telegraf 插件通过 HTTP 与 OpenSearch 数据库集成，从而可以简化指标的收集和存储。作为一个专为 OpenSearch 2.x 版本设计的强大工具，该插件在提供强大功能的同时，还通过原始 Elasticsearch 插件与 1.x 版本兼容。此插件有助于在 OpenSearch 中创建和管理索引，自动管理模板并确保数据结构化，以提高分析效率。该插件支持各种配置选项，例如索引名称、身份验证、运行状况检查和值处理，使其能够根据不同的操作要求进行定制。其功能使其对于希望利用 OpenSearch 的强大功能进行指标存储和查询的组织至关重要。

配置

Kafka


[[inputs.kafka_consumer]]
              ## Kafka brokers.
              brokers = ["localhost:9092"]

              ## Set the minimal supported Kafka version. Should be a string contains
              ## 4 digits in case if it is 0 version and 3 digits for versions starting
              ## from 1.0.0 separated by dot. This setting enables the use of new
              ## Kafka features and APIs.  Must be 0.10.2.0(used as default) or greater.
              ## Please, check the list of supported versions at
              ## https://pkg.go.dev/github.com/Shopify/sarama#SupportedVersions
              ##   ex: kafka_version = "2.6.0"
              ##   ex: kafka_version = "0.10.2.0"
              # kafka_version = "0.10.2.0"

              ## Topics to consume.
              topics = ["telegraf"]

              ## Topic regular expressions to consume.  Matches will be added to topics.
              ## Example: topic_regexps = [ "*test", "metric[0-9A-z]*" ]
              # topic_regexps = [ ]

              ## When set this tag will be added to all metrics with the topic as the value.
              # topic_tag = ""

              ## The list of Kafka message headers that should be pass as metric tags
              ## works only for Kafka version 0.11+, on lower versions the message headers
              ## are not available
              # msg_headers_as_tags = []

              ## The name of kafka message header which value should override the metric name.
              ## In case when the same header specified in current option and in msg_headers_as_tags
              ## option, it will be excluded from the msg_headers_as_tags list.
              # msg_header_as_metric_name = ""

              ## Set metric(s) timestamp using the given source.
              ## Available options are:
              ##   metric -- do not modify the metric timestamp
              ##   inner  -- use the inner message timestamp (Kafka v0.10+)
              ##   outer  -- use the outer (compressed) block timestamp (Kafka v0.10+)
              # timestamp_source = "metric"

              ## Optional Client id
              # client_id = "Telegraf"

              ## Optional TLS Config
              # enable_tls = false
              # tls_ca = "/etc/telegraf/ca.pem"
              # tls_cert = "/etc/telegraf/cert.pem"
              # tls_key = "/etc/telegraf/key.pem"
              ## Use TLS but skip chain & host verification
              # insecure_skip_verify = false

              ## Period between keep alive probes.
              ## Defaults to the OS configuration if not specified or zero.
              # keep_alive_period = "15s"

              ## SASL authentication credentials.  These settings should typically be used
              ## with TLS encryption enabled
              # sasl_username = "kafka"
              # sasl_password = "secret"

              ## Optional SASL:
              ## one of: OAUTHBEARER, PLAIN, SCRAM-SHA-256, SCRAM-SHA-512, GSSAPI
              ## (defaults to PLAIN)
              # sasl_mechanism = ""

              ## used if sasl_mechanism is GSSAPI
              # sasl_gssapi_service_name = ""
              # ## One of: KRB5_USER_AUTH and KRB5_KEYTAB_AUTH
              # sasl_gssapi_auth_type = "KRB5_USER_AUTH"
              # sasl_gssapi_kerberos_config_path = "/"
              # sasl_gssapi_realm = "realm"
              # sasl_gssapi_key_tab_path = ""
              # sasl_gssapi_disable_pafxfast = false

              ## used if sasl_mechanism is OAUTHBEARER
              # sasl_access_token = ""

              ## SASL protocol version.  When connecting to Azure EventHub set to 0.
              # sasl_version = 1

              # Disable Kafka metadata full fetch
              # metadata_full = false

              ## Name of the consumer group.
              # consumer_group = "telegraf_metrics_consumers"

              ## Compression codec represents the various compression codecs recognized by
              ## Kafka in messages.
              ##  0 : None
              ##  1 : Gzip
              ##  2 : Snappy
              ##  3 : LZ4
              ##  4 : ZSTD
              # compression_codec = 0
              ## Initial offset position; one of "oldest" or "newest".
              # offset = "oldest"

              ## Consumer group partition assignment strategy; one of "range", "roundrobin" or "sticky".
              # balance_strategy = "range"

              ## Maximum number of retries for metadata operations including
              ## connecting. Sets Sarama library's Metadata.Retry.Max config value. If 0 or
              ## unset, use the Sarama default of 3,
              # metadata_retry_max = 0

              ## Type of retry backoff. Valid options: "constant", "exponential"
              # metadata_retry_type = "constant"

              ## Amount of time to wait before retrying. When metadata_retry_type is
              ## "constant", each retry is delayed this amount. When "exponential", the
              ## first retry is delayed this amount, and subsequent delays are doubled. If 0
              ## or unset, use the Sarama default of 250 ms
              # metadata_retry_backoff = 0

              ## Maximum amount of time to wait before retrying when metadata_retry_type is
              ## "exponential". Ignored for other retry types. If 0, there is no backoff
              ## limit.
              # metadata_retry_max_duration = 0

              ## When set to true, this turns each bootstrap broker address into a set of
              ## IPs, then does a reverse lookup on each one to get its canonical hostname.
              ## This list of hostnames then replaces the original address list.
              ## resolve_canonical_bootstrap_servers_only = false

              ## Strategy for making connection to kafka brokers. Valid options: "startup",
              ## "defer". If set to "defer" the plugin is allowed to start before making a
              ## connection. This is useful if the broker may be down when telegraf is
              ## started, but if there are any typos in the broker setting, they will cause
              ## connection failures without warning at startup
              # connection_strategy = "startup"

              ## Maximum length of a message to consume, in bytes (default 0/unlimited);
              ## larger messages are dropped
              max_message_len = 1000000

              ## Max undelivered messages
              ## This plugin uses tracking metrics, which ensure messages are read to
              ## outputs before acknowledging them to the original broker to ensure data
              ## is not lost. This option sets the maximum messages to read from the
              ## broker that have not been written by an output.
              ##
              ## This value needs to be picked with awareness of the agent's
              ## metric_batch_size value as well. Setting max undelivered messages too high
              ## can result in a constant stream of data batches to the output. While
              ## setting it too low may never flush the broker's messages.
              # max_undelivered_messages = 1000

              ## Maximum amount of time the consumer should take to process messages. If
              ## the debug log prints messages from sarama about 'abandoning subscription
              ## to [topic] because consuming was taking too long', increase this value to
              ## longer than the time taken by the output plugin(s).
              ##
              ## Note that the effective timeout could be between 'max_processing_time' and
              ## '2 * max_processing_time'.
              # max_processing_time = "100ms"

              ## The default number of message bytes to fetch from the broker in each
              ## request (default 1MB). This should be larger than the majority of
              ## your messages, or else the consumer will spend a lot of time
              ## negotiating sizes and not actually consuming. Similar to the JVM's
              ## `fetch.message.max.bytes`.
              # consumer_fetch_default = "1MB"

              ## Data format to consume.
              ## Each data format has its own unique set of configuration options, read
              ## more about them here:
              ## https://github.com/influxdata/telegraf/blob/master/docs/DATA_FORMATS_INPUT.md
              data_format = "influx"

OpenSearch

[[outputs.opensearch]]
  ## URLs
  ## The full HTTP endpoint URL for your OpenSearch instance. Multiple URLs can
  ## be specified as part of the same cluster, but only one URLs is used to
  ## write during each interval.
  urls = ["http://node1.os.example.com:9200"]

  ## Index Name
  ## Target index name for metrics (OpenSearch will create if it not exists).
  ## This is a Golang template (see https://pkg.go.dev/text/template)
  ## You can also specify
  ## metric name (`{{.Name}}`), tag value (`{{.Tag "tag_name"}}`), field value (`{{.Field "field_name"}}`)
  ## If the tag does not exist, the default tag value will be empty string "".
  ## the timestamp (`{{.Time.Format "xxxxxxxxx"}}`).
  ## For example: "telegraf-{{.Time.Format \"2006-01-02\"}}-{{.Tag \"host\"}}" would set it to telegraf-2023-07-27-HostName
  index_name = ""

  ## Timeout
  ## OpenSearch client timeout
  # timeout = "5s"

  ## Sniffer
  ## Set to true to ask OpenSearch a list of all cluster nodes,
  ## thus it is not necessary to list all nodes in the urls config option
  # enable_sniffer = false

  ## GZIP Compression
  ## Set to true to enable gzip compression
  # enable_gzip = false

  ## Health Check Interval
  ## Set the interval to check if the OpenSearch nodes are available
  ## Setting to "0s" will disable the health check (not recommended in production)
  # health_check_interval = "10s"

  ## Set the timeout for periodic health checks.
  # health_check_timeout = "1s"
  ## HTTP basic authentication details.
  # username = ""
  # password = ""
  ## HTTP bearer token authentication details
  # auth_bearer_token = ""

  ## Optional TLS Config
  ## Set to true/false to enforce TLS being enabled/disabled. If not set,
  ## enable TLS only if any of the other options are specified.
  # tls_enable =
  ## Trusted root certificates for server
  # tls_ca = "/path/to/cafile"
  ## Used for TLS client certificate authentication
  # tls_cert = "/path/to/certfile"
  ## Used for TLS client certificate authentication
  # tls_key = "/path/to/keyfile"
  ## Send the specified TLS server name via SNI
  # tls_server_name = "kubernetes.example.com"
  ## Use TLS but skip chain & host verification
  # insecure_skip_verify = false

  ## Template Config
  ## Manage templates
  ## Set to true if you want telegraf to manage its index template.
  ## If enabled it will create a recommended index template for telegraf indexes
  # manage_template = true

  ## Template Name
  ## The template name used for telegraf indexes
  # template_name = "telegraf"

  ## Overwrite Templates
  ## Set to true if you want telegraf to overwrite an existing template
  # overwrite_template = false

  ## Document ID
  ## If set to true a unique ID hash will be sent as
  ## sha256(concat(timestamp,measurement,series-hash)) string. It will enable
  ## data resend and update metric points avoiding duplicated metrics with
  ## different id's
  # force_document_id = false

  ## Value Handling
  ## Specifies the handling of NaN and Inf values.
  ## This option can have the following values:
  ##    none    -- do not modify field-values (default); will produce an error
  ##               if NaNs or infs are encountered
  ##    drop    -- drop fields containing NaNs or infs
  ##    replace -- replace with the value in "float_replacement_value" (default: 0.0)
  ##               NaNs and inf will be replaced with the given number, -inf with the negative of that number
  # float_handling = "none"
  # float_replacement_value = 0.0

  ## Pipeline Config
  ## To use a ingest pipeline, set this to the name of the pipeline you want to use.
  # use_pipeline = "my_pipeline"

  ## Pipeline Name
  ## Additionally, you can specify a tag name using the notation (`{{.Tag "tag_name"}}`)
  ## which will be used as the pipeline name (e.g. "{{.Tag \"os_pipeline\"}}").
  ## If the tag does not exist, the default pipeline will be used as the pipeline.
  ## If no default pipeline is set, no pipeline is used for the metric.
  # default_pipeline = ""

输入和输出集成示例

Kafka

实时数据处理：使用 Kafka 插件将来自 Kafka 主题的实时数据馈送到监控系统。这对于需要即时反馈性能指标或用户活动的应用尤其有用，使企业能够更快地对其环境中的变化条件做出反应。
动态指标收集：利用此插件根据 Kafka 中发生的事件动态调整正在捕获的指标。例如，通过与其他服务集成，用户可以让插件即时重新配置自身，确保始终根据业务或应用程序的需求收集相关指标。
集中式日志记录和监控：实施集中式日志记录系统，使用 Kafka Consumer Plugin 将来自多个服务的日志聚合到统一的监控仪表板中。此设置可以帮助识别不同服务之间的问题，并提高整体系统可观测性和故障排除能力。
异常检测系统：将 Kafka 与机器学习算法结合使用，进行实时异常检测。通过不断分析流式数据，此设置可以自动识别异常模式，触发警报并更有效地缓解潜在问题。

OpenSearch

时间序列数据的动态索引：利用 OpenSearch Telegraf 插件为时间序列指标动态创建索引，确保数据以有组织的方式存储，从而有利于基于时间的查询。通过使用 Go 模板定义索引模式，用户可以利用该插件创建每日或每月索引，这可以大大简化随时间推移的数据管理和检索，从而提高分析性能。
多租户应用程序的集中式日志记录：在多租户应用程序中实施 OpenSearch 插件，其中每个租户的日志都发送到单独的索引。这使得可以对每个租户进行有针对性的分析和监控，同时保持数据隔离。通过利用索引名称模板功能，用户可以自动创建特定于租户的索引，这不仅简化了流程，还提高了租户数据的安全性和可访问性。
与机器学习集成以进行异常检测：将 OpenSearch 插件与机器学习工具结合使用，以自动检测指标数据中的异常。通过配置插件以将实时指标发送到 OpenSearch，用户可以将机器学习模型应用于传入的数据流，以识别异常值或异常模式，从而促进主动监控和快速补救措施。
使用 OpenSearch 增强监控仪表板：使用从 OpenSearch 收集的指标创建实时仪表板，以提供对系统性能的深入了解。通过将指标馈送到 OpenSearch，组织可以利用 OpenSearch Dashboards 可视化关键绩效指标，使运营团队能够快速评估运行状况和性能，并做出数据驱动的决策。

反馈

感谢您成为我们社区的一份子！如果您有任何一般性反馈或在这些页面上发现了任何错误，我们欢迎并鼓励您提供意见。请在InfluxDB 社区 Slack中提交您的反馈。

实用链接

文档

Telegraf 快速入门培训

使用 Telegraf 进行基础设施监控

强大的性能，无限的扩展能力

查看入门方法

Kafka 和 OpenSearch 集成

目录

实用链接

强大的性能，无限的扩展能力

输入和输出集成概述

集成详情

配置

输入和输出集成示例

反馈

实用链接

强大的性能，无限的扩展能力

相关集成

标题

标题

标题

相关集成

HTTP 和 InfluxDB 集成

Kafka 和 InfluxDB 集成

Kinesis 和 InfluxDB 集成

立即开始构建

产品与解决方案

开发者

公司

Kafka 和 OpenSearch 集成

目录

实用链接

强大的性能，无限的扩展能力

输入和输出集成概述

集成详情

配置

输入和输出集成示例

反馈

实用链接

强大的性能，无限的扩展能力

相关集成

标题

标题

标题

相关集成

HTTP 和 InfluxDB 集成

Kafka 和 InfluxDB 集成

Kinesis 和 InfluxDB 集成

立即开始构建

产品与解决方案

开发者

公司

注册 InfluxData 新闻邮件

关注我们