Kafka Connect Mickael Maison, Kate Stanley

Kafka Connect Mickael Maison, Kate Stanley - okladka książki

Autorzy:: Mickael Maison, Kate Stanley
Wydawnictwo:: O'Reilly Media (Z chęcią przeczytam książkę w języku polskim)
Ocena:: Bądź pierwszym, który oceni tę książkę
Stron:: 402
Dostępne formaty:: ePub

Mobi

Ebook

228,65 zł ~~269,00 zł~~ (-15%)

161,40 zł najniższa cena z 30 dni

Dodaj do koszyka Dostępny natychmiast po opłaceniu zakupu lub Kup na prezent Kup 1-kliknięciem

Przenieś na półkę

Do przechowalni

Used by more than 80% of Fortune 100 companies, Apache Kafka has become the de facto event streaming platform. Kafka Connect is a key component of Kafka that lets you flow data between your existing systems and Kafka to process data in real time.

With this practical guide, authors Mickael Maison and Kate Stanley show data engineers, site reliability engineers, and application developers how to build data pipelines between Kafka clusters and a variety of data sources and sinks. Kafka Connect allows you to quickly adopt Kafka by tapping into existing data and enabling many advanced use cases. No matter where you are in your event streaming journey, Kafka Connect is the ideal tool for building a modern data pipeline.

Learn Kafka Connect's capabilities, main concepts, and terminology
Design data and event streaming pipelines that use Kafka Connect
Configure and operate Kafka Connect environments at scale
Deploy secured and highly available Kafka Connect clusters
Build sink and source connectors and single message transforms and converters

Wybrane bestsellery

Promocja

Apache Spark is amazing when everything clicks. But if you haven't seen the performance improvements you expected or still don't feel confident enough to use Spark in production, this practical book is for you. Authors Holden Karau, Adi Polak, and Rachel Warren walk you through the secrets of the Spark code base and demonstrate performance optimiza
- ebook
High Performance Spark. Best Practices for Scaling and Optimizing Apache Spark. 2nd Edition

Holden Karau, Adi Polak, Rachel Warren

(131,40 zł najniższa cena z 30 dni)

186.15 zł ~~219.00 zł (-15%)~~
Promocja

Description Modern frontend development is the art of building the digital bridge between users and technology, and mastering it requires more than just code. It requires not only technical expertise in UI/UX principles and web technologies, but a deep understanding of the psychological mechanisms that determine whether users engage with or leave t
- ebook
Frontend Development

Dario Benevento

(116,10 zł najniższa cena z 30 dni)

125.10 zł ~~139.00 zł (-10%)~~
Promocja

Description The guiding principles for an architect is the comprehensive roadmap to understanding the platforms core components and why they matter for todays businesses. Microsoft Power Platform is transforming how enterprises build solutions in todays fast-paced digital era. By enabling low-code innovation and empowering citizen developers, it he
- ebook
Microsoft Power Platform

Goloknath Mishra

(116,10 zł najniższa cena z 30 dni)

125.10 zł ~~139.00 zł (-10%)~~
Promocja

Description Rust has revolutionized modern development by providing unmatched performance and security guarantees, making it the ideal foundation for building reliable web applications. While its development has not slowed down the slightest, it already has a vibrant ecosystem to support diverse developer needs. Readers will learn to build minimali
- ebook
Web Development in Rust

Viktor Daróczi

(46,15 zł najniższa cena z 30 dni)

89.91 zł ~~99.90 zł (-10%)~~
Promocja

Description Python is the industry standard for modern software development, known for its readability and ability to integrate into virtually every domain, from scripting to complex system design. This book is your practical guide to moving beyond Python basics and mastering the art of building complete, deployable applications. Each chapter blend
- ebook
Python Real-World Projects

Arun Prakash Shivakumar

(46,15 zł najniższa cena z 30 dni)

89.91 zł ~~99.90 zł (-10%)~~
Promocja

Description Python is a versatile programming language that can help solve problems in various fields. With PyCharm as an IDE, you will learn to build Python applications step-by-step. This book is for beginner to intermediate software developers and data scientists who want to use Python for web development and for data science projects. This book
- ebook
Application Development with PyCharm

Muhammad Asif

(46,15 zł najniższa cena z 30 dni)

89.91 zł ~~99.90 zł (-10%)~~
Promocja

Navigating the complexities of large-scale spatial data can be daunting. In order to unleash the power of massive and complex datasets, you'll need a cutting-edge tool like Apache Sedona. This innovative distributed computing system, designed specifically for spatial data, has diverse applications in fields such as mobility, telematics, agriculture
- ebook
Cloud Native Geospatial Analytics with Apache Sedona. A Hands-On Guide for Working with Large-Scale Spatial Data

Pawel Tokaj, Jia Yu, Mo Sarwat

(143,40 zł najniższa cena z 30 dni)

203.15 zł ~~239.00 zł (-15%)~~
Promocja

Description Elixir is the modern, powerful programming language designed for massive scale and reliability, perfectly suited for todays concurrent web applications. Built on the proven Erlang virtual machine (BEAM), Elixir empowers developers to build fast, fault-tolerant systems that simply do not crash. This book provides a clear, sequential path
- ebook
Elixir and Phoenix for Beginners

Karthikeyan Paramasivan

(46,15 zł najniższa cena z 30 dni)

89.91 zł ~~99.90 zł (-10%)~~
Promocja

Overcome challenges in building transactional guarantees on rapidly changing data by using Apache Hudi. With this practical guide, data engineers, data architects, and software architects will discover how to seamlessly build an interoperable lakehouse from disparate data sources and deliver faster insights using your query engine of choice. Author
- ebook
Apache Hudi: The Definitive Guide. Building Robust, Open, and High-Performing Data Lakehouses

Shiyan Xu, Prashant Wason, Bhavani Sudha Saktheeswaran

(143,40 zł najniższa cena z 30 dni)

203.15 zł ~~239.00 zł (-15%)~~
Promocja

Description Artificial intelligence is redefining how software is created, enabling developers to code faster, improve accuracy, and bring innovative ideas to life. In todays competitive technology landscape, AI-assisted programming is no longer optional; its a core skill for building modern web applications and machine learning solutions. This boo
- ebook
AI-assisted Programming for Web and Machine Learning

Dr. Muralidhar Kurni, Ramesh Krishnamaneni, Dr. Srinivasa K. G.

(46,15 zł najniższa cena z 30 dni)

89.91 zł ~~99.90 zł (-10%)~~

Ebooka "Kafka Connect" przeczytasz na:

czytnikach Inkbook, Kindle, Pocketbook, Onyx Boox i innych
systemach Windows, MacOS i innych

systemach Windows, Android, iOS, HarmonyOS
na dowolnych urządzeniach i aplikacjach obsługujących formaty: PDF, EPub, Mobi

Masz pytania? Zajrzyj do zakładki Pomoc »

Oceny i opinie klientów: Kafka Connect Mickael Maison, Kate Stanley

(0)

Szczegóły książki

ISBN Ebooka:: 978-10-981-2649-0, 9781098126490
Data wydania ebooka :: 2023-09-18 Data wydania ebooka często jest dniem wprowadzenia tytułu do sprzedaży i może nie być równoznaczna z datą wydania książki papierowej. Dodatkowe informacje możesz znaleźć w darmowym fragmencie. Jeśli masz wątpliwości skontaktuj się z nami sklep@ebookpoint.pl.
Język publikacji:: angielski
Rozmiar pliku ePub:: 3.9MB
Rozmiar pliku Mobi:: 9.6MB

Zgłoś erratę

Kategorie

Kliknij, aby zgłosić błędnie przypisaną kategorię »

Informatyka » Serwery internetowe » Apache

Dostępność produktu

Produkt nie został jeszcze oceniony pod kątem ułatwień dostępu lub nie podano żadnych informacji o ułatwieniach dostępu lub są one niewystarczające. Prawdopodobnie Wydawca/Dostawca jeszcze nie umożliwił dokonania walidacji produktu lub nie przekazał odpowiednich informacji na temat jego dostępności.

Spis treści książki

Foreword
Preface
- Who Should Read This Book
- Kafka Versions
- Navigating This Book
- Conventions Used in This Book
- OReilly Online Learning
- How to Contact Us
- Acknowledgements
I. Introduction to Kafka Connect
1. Meet Kafka Connect
- Kafka Connect Features
  - Pluggable Architecture
  - Scalability and Reliability
  - Declarative Pipeline Definition
  - Part of Apache Kafka
- Use Cases
  - Capturing Database Changes
  - Mirroring Kafka Clusters
  - Building Data Lakes
  - Aggregating Logs
  - Modernizing Legacy Systems
- Alternatives to Kafka Connect
- Summary
2. Apache Kafka Basics
- A Distributed Event Streaming Platform
  - Open Source
  - Distributed
  - Event Streaming
  - Platform
- Kafka Concepts
  - Publish-Subscribe
  - Brokers and Records
  - Topics and Partitions
  - Replication
  - Retention and Compaction
  - KRaft and ZooKeeper
- Interacting with Kafka
  - Producers
  - Consumers
  - Kafka Streams
- Getting Started with Kafka
  - Starting Kafka
    - Kafka in KRaft mode (without ZooKeeper)
    - Kafka with ZooKeeper
  - Sending and Receiving Records
  - Running a Kafka Streams Application
- Summary
II. Developing Data Pipelines with Kafka Connect
3. Components in a Kafka Connect Data Pipeline
- Kafka Connect Runtime
  - Running Kafka Connect
  - Kafka Connect REST API
  - Installing Plug-Ins
  - Deployment Modes
- Source and Sink Connectors
  - Connectors and Tasks
  - Configuring Connectors
  - Running Connectors
- Converters
  - Data Format and Schemas
  - Configuring Converters
  - Using Converters
- Transformations and Predicates
  - Transformation Use Cases
    - Routing
    - Sanitizing
    - Formatting
    - Enhancing
  - Predicates
  - Configuring Transformations and Predicates
  - Using Transformations and Predicates
- Summary
4. Designing Effective Data Pipelines
- Choosing a Connector
  - Pipeline Direction
  - Licensing and Support
  - Connector Features
- Defining Data Models
  - Data Transformation
  - Mapping Data Between Systems
- Formatting Data
  - Data Formats
  - Schemas
    - Kafka Connect record schemas
    - Kafka record schemas
- Exploring Kafka Connect Internals
  - Internal Topics
  - Group Membership
  - Rebalance Protocols
- Handling Failures in Kafka Connect
  - Worker Failure
  - Connector/Task Failure
  - Kafka/External Systems Failure
  - Dead Letter Queues
- Understanding Processing Semantics
  - Sink Connectors
  - Source Connectors
- Summary
5. Connectors in Action
- Confluent S3 Sink Connector
  - Configuring the Connector
    - Connectivity and S3 details
    - Object partitioning
    - Object naming
    - Object formats
    - Object upload
  - Exactly-Once Semantics
  - Running the Connector
    - Using the field partitioner
    - Using the time-based partitioner
- Confluent JDBC Source Connector
  - Configuring the Connector
    - Connectivity
    - Topic naming
    - Table filtering
    - Data collection mode
    - Partitioning and parallelism
  - Running the Connector
    - Using the bulk mode
    - Using an incrementing mode
- Debezium MySQL Source Connector
  - Configuring the Connector
    - Connectivity
    - Database and table filtering
    - Snapshotting
  - Event Formats
  - Running the Connector
- Summary
6. Mirroring Clusters with MirrorMaker
- Introduction to Mirroring
  - Exploring Mirroring Use Cases
    - Geo-replication
    - Disaster recovery
    - Migration
    - Complex topologies
  - Mirroring in Practice
- Introduction to MirrorMaker
  - Common Concepts
    - Local and remote topics
    - Common configurations
    - Replication policies
    - Client overrides
  - Deployment Modes
- MirrorMaker Connectors
  - MirrorSourceConnector
    - Configurations
      - Topic configurations
      - Offset-syncs configurations
      - ACLs configurations
      - Metrics configurations
    - Permissions
      - Source cluster ACLs
      - Target cluster ACLs
    - Metrics
  - MirrorCheckpointConnector
    - Configurations
    - Permissions
      - Source cluster ACLs
      - Target cluster ACLs
    - Metrics
  - MirrorHeartbeatConnector
    - Configurations
    - Permissions
- Running MirrorMaker
  - Disaster Recovery Example
  - Geo-Replication Example
- Summary
III. Running Kafka Connect in Production
7. Deploying and Operating Kafka Connect Clusters
- Preparing the Kafka Connect Environment
  - Building a Kafka Connect Environment
  - Installing Plug-Ins
  - Networking and Permissions
- Worker Plug-Ins
  - Configuration Providers
  - REST Extensions
  - Connector Client Configuration Override Policies
- Sizing and Planning Capacity
  - Understanding Kafka Connect Resource Utilization
  - How Many Workers and Tasks?
    - Single cluster versus separate clusters
    - Maintainability
    - Isolation
    - Security
    - Use case optimization
- Operating Kafka Connect Clusters
  - Adding Workers
  - Removing Workers
  - Upgrading and Applying Maintenance to Workers
  - Restarting Failed Tasks and Connectors
  - Resetting Offsets of Connectors
    - Sink connector offsets
    - Source connector offsets
- Administering Kafka Connect Using the REST API
  - Creating and Deleting a Connector
  - Connector and Task Configuration
  - Controlling the Lifecycle of Connectors
  - Listing Connector Offsets
  - Debugging Issues
- Summary
8. Configuring Kafka Connect
- Configuring the Runtime
  - Configurations for Production
    - Clients and connector overrides
    - REST configurations
    - Miscellaneous configuration
  - Fine-Tuning Configurations
    - Connection configurations
    - Inter-worker and rebalance configurations
    - Topic tracking configurations
    - Metrics configurations
    - Offset flush configurations
- Configuring Connectors
  - Topic Configurations
  - Client Overrides
  - Configurations for Exactly-Once
  - Configurations for Error Handling
- Configuring Kafka Connect Clusters for Security
  - Securing the Connection to Kafka
    - TLS configurations
    - SASL configurations
    - SASL OAUTHBEARER configurations
    - SASL GSSAPI configurations
  - Configuring Permissions
  - Securing the REST API
- Summary
9. Monitoring Kafka Connect
- Monitoring Logs
  - Logging Configuration
  - Understanding Startup Logs
  - Analyzing Logs
    - Log contexts
    - Key events
    - Errors
- Monitoring Metrics
  - Metrics Reporters
  - Analyzing Metrics
  - Exploring Metrics
- Key Metrics
  - Kafka Connect Runtime Metrics
    - Metadata metrics
    - Network metrics
    - Group protocol metrics
    - Connector-level metrics
    - Task-level metrics
  - Other System Metrics
    - Internal Kafka client metrics
    - Kafka and external system metrics
- Summary
10. Administering Kafka Connect on Kubernetes
- Introduction to Kubernetes
  - Virtualization Technologies
  - Kubernetes Fundamentals
- Running Kafka Connect on Kubernetes
  - Container Image
  - Deploying Workers
  - Networking and Monitoring
  - Configuration
- Using a Kubernetes Operator to Deploy Kafka Connect
  - Introduction to Kubernetes Operators
  - Kubernetes Operators for Kafka Connect
- Strimzi
  - Getting a Kubernetes Environment
  - Starting the Operator
  - Kafka Connect CRDs
  - Deploying a Kafka Connect Cluster and Connectors
  - MirrorMaker CRD
- Summary
IV. Building Custom Connectors and Plug-Ins
11. Building Source and Sink Connectors
- Common Concepts and APIs
  - Building a Custom Connector
    - Implementing a connector
    - Packaging a connector
  - The Connector API
    - The version() method
    - The config() method
    - The initialize() method
    - The start() method
    - The taskClass() method
    - The taskConfigs() method
    - The stop() method
    - The validate() method
    - The context() methods
    - Connector API lifecycle
  - Configurations
    - Configuration types
    - Validators and recommenders
    - Interacting with configurations at runtime
  - The Task API
    - The initialize() methods
    - The start() method
    - The stop() method
    - Task API lifecycle
  - Kafka Connect Records
    - Schemas
  - The ConnectorContext API
    - The requestTaskReconfiguration() method
    - The raiseError() method
    - The configs() method
- Implementing Source Connectors
  - The SourceTask API
    - The poll() method
    - The commit() and commitRecord() methods
    - SourceTask API lifecycle
  - Source Records
  - The SourceConnectorContext and SourceTaskContext APIs
    - The offsetStorageReader() method
    - The transactionContext() method
  - Exactly-Once Support
    - The exactlyOnceSupport() method
    - The canDefineTransactionBoundaries() method
    - The commitTransaction() methods
    - The abortTransaction() methods
- Implementing Sink Connectors
  - The SinkTask API
    - The put() method
    - The preCommit() method
    - The flush() method
    - The open() and close() methods
    - The SinkTask API lifecycle
  - Sink Records
  - The SinkConnectorContext and SinkTaskContext APIs
    - The offset() methods
    - The timeout() method
    - The assignment() method
    - The pause() and resume() methods
    - The requestCommit() method
    - The errantRecordReporter() method
- Summary
12. Extending Kafka Connect with Connector and Worker Plug-Ins
- Implementing Connector Plug-Ins
  - The Transformation API
    - The apply() method
    - The config() method
    - The configure() method
    - The close() method
  - The Predicate API
    - The test() method
    - The config() method
    - The configure() method
    - The close() method
  - The Converter and HeaderConverter APIs
    - The fromConnectData() methods
    - The toConnectData() methods
    - The fromConnectHeader() method
    - The toConnectHeader() method
    - The config() methods
    - The configure() methods
    - The close() method
- Implementing Worker Plug-Ins
  - The ConfigProvider API
    - The get() methods
    - The configure() method
    - The close() method
    - The subscribe(), unsubscribe(), and unsubscribeAll() methods
  - The ConnectorClientConfigOverridePolicy API
    - The validate() method
    - The configure() method
    - The close() method
  - The ConnectRestExtension APIs
    - The register() method
    - The configure() method
    - The close() method
    - The version() method
- Summary
Index