Hadoop: The Definitive Guide. Storage and Analysis at Internet Scale. 4th Edition
![Język publikacji: angielski Język publikacji: angielski](https://static01.helion.com.pl/global/flagi/1.png)
![Hadoop: The Definitive Guide. Storage and Analysis at Internet Scale. 4th Edition Tom White - okładka ebooka](https://static01.helion.com.pl/global/okladki/326x466/e_2gt2.png)
![Hadoop: The Definitive Guide. Storage and Analysis at Internet Scale. 4th Edition Tom White - tył okładki ebooka](https://static01.helion.com.pl/global/okladki-tyl/326x466/e_2gt2.png)
- Ocena:
- Bądź pierwszym, który oceni tę książkę
- Stron:
- 756
- Dostępne formaty:
-
ePubMobi
Opis ebooka: Hadoop: The Definitive Guide. Storage and Analysis at Internet Scale. 4th Edition
Get ready to unlock the power of your data. With the fourth edition of this comprehensive guide, youâ??ll learn how to build and maintain reliable, scalable, distributed systems with Apache Hadoop. This book is ideal for programmers looking to analyze datasets of any size, and for administrators who want to set up and run Hadoop clusters.
Using Hadoop 2 exclusively, author Tom White presents new chapters on YARN and several Hadoop-related projects such as Parquet, Flume, Crunch, and Spark. Youâ??ll learn about recent changes to Hadoop, and explore new case studies on Hadoopâ??s role in healthcare systems and genomics data processing.
- Learn fundamental components such as MapReduce, HDFS, and YARN
- Explore MapReduce in depth, including steps for developing applications with it
- Set up and maintain a Hadoop cluster running HDFS and MapReduce on YARN
- Learn two data formats: Avro for data serialization and Parquet for nested data
- Use data ingestion tools such as Flume (for streaming data) and Sqoop (for bulk data transfer)
- Understand how high-level data processing tools like Pig, Hive, Crunch, and Spark work with Hadoop
- Learn the HBase distributed database and the ZooKeeper distributed configuration service
Wybrane bestsellery
-
Tę książkę napisał wytrawny znawca i współtwórca Hadoopa. Przedstawia w niej wszystkie istotne mechanizmy działania platformy i pokazuje, jak efektywnie jej używać. Dowiesz się stąd, do czego służą model MapReduce oraz systemy HDFS i YARN. Nauczysz się budować aplikacje oraz klastry.
Hadoop. Komplety przewodnik. Analiza i przechowywanie danych Hadoop. Komplety przewodnik. Analiza i przechowywanie danych
(44.50 zł najniższa cena z 30 dni)44.50 zł
89.00 zł(-50%) -
Hadoop: The Definitive Guide helps you harness the power of your data. Ideal for processing large datasets, the Apache Hadoop framework is an open source implementation of the MapReduce algorithm on which Google built its empire. This comprehensive resource demonstrates how to use Hadoop to build...
Hadoop: The Definitive Guide. The Definitive Guide Hadoop: The Definitive Guide. The Definitive Guide
(122.68 zł najniższa cena z 30 dni)122.63 zł
149.00 zł(-18%) -
This book helps you create efficient databases, covering data modeling, query optimization, and more. You'll be equipped with the skills needed to design robust databases using PostgreSQL and MySQL for modern data-driven applications.
Database Design and Modeling with PostgreSQL and MySQL. Build efficient and scalable databases for modern applications using open source databases Database Design and Modeling with PostgreSQL and MySQL. Build efficient and scalable databases for modern applications using open source databases
(78.48 zł najniższa cena z 30 dni) -
This book provides a highly focused view of real business outcomes powered by data governance, that resonate with non-data executives such as CFOs and CEOs. You’ll also find useful insights into how to implement data governance initiatives.
Data Governance Handbook. A practical approach to building trust in data Data Governance Handbook. A practical approach to building trust in data
-
This book shows you how to use Apache Spark, Delta Lake, and Databricks to build data pipelines, manage and transform data, optimize performance, and more. Additionally, you’ll implement DataOps and DevOps practices, and orchestrate data workflows.
Data Engineering with Databricks Cookbook. Build effective data and AI solutions using Apache Spark, Databricks, and Delta Lake Data Engineering with Databricks Cookbook. Build effective data and AI solutions using Apache Spark, Databricks, and Delta Lake
-
This book will guide you through the fundamental and advanced features of the Snowpark framework in Python. You’ll learn how to use Snowpark for implementing workloads in the fields of data engineering, data science, and data applications.
The Ultimate Guide to Snowpark. Design and deploy Snowpark with Python for efficient data workloads The Ultimate Guide to Snowpark. Design and deploy Snowpark with Python for efficient data workloads
-
This report highlights the vital role of data quality in your data strategy, offering actionable steps to make it the foundation of your data culture, unlocking greater value and informed decisions in the evolving landscape of AI.
Data Quality in the Age of AI. Building a foundation for AI strategy and data culture Data Quality in the Age of AI. Building a foundation for AI strategy and data culture
-
Ця книжка познайомить вас з особливостями Jav...
Head First. Програмування на JavaScript. Head First. Програмування на JavaScript Head First. Програмування на JavaScript. Head First. Програмування на JavaScript
(84.16 zł najniższa cena z 30 dni)84.16 zł
103.90 zł(-19%) -
«Патерни проєктування» 2014 ваша книжка, якщо C...(84.16 zł najniższa cena z 30 dni)
84.16 zł
103.90 zł(-19%)
Kup polskie wydanie:
Hadoop. Komplety przewodnik. Analiza i przechowywanie danych
- Autor:
- Tom White
44,50 zł
89,00 zł
(44.50 zł najniższa cena z 30 dni)
Ebooka "Hadoop: The Definitive Guide. Storage and Analysis at Internet Scale. 4th Edition" przeczytasz na:
-
czytnikach Inkbook, Kindle, Pocketbook, Onyx Boox i innych
-
systemach Windows, MacOS i innych
-
systemach Windows, Android, iOS, HarmonyOS
-
na dowolnych urządzeniach i aplikacjach obsługujących formaty: PDF, EPub, Mobi
Masz pytania? Zajrzyj do zakładki Pomoc »
Audiobooka "Hadoop: The Definitive Guide. Storage and Analysis at Internet Scale. 4th Edition" posłuchasz:
-
w aplikacji Ebookpoint na Android, iOS, HarmonyOs
-
na systemach Windows, MacOS i innych
-
na dowolnych urządzeniach i aplikacjach obsługujących format MP3 (pliki spakowane w ZIP)
Masz pytania? Zajrzyj do zakładki Pomoc »
Kurs Video "Hadoop: The Definitive Guide. Storage and Analysis at Internet Scale. 4th Edition" zobaczysz:
-
w aplikacjach Ebookpoint i Videopoint na Android, iOS, HarmonyOs
-
na systemach Windows, MacOS i innych z dostępem do najnowszej wersji Twojej przeglądarki internetowej
Szczegóły ebooka
- ISBN Ebooka:
- 978-14-919-0170-0, 9781491901700
- Data wydania ebooka:
-
2015-03-25
Data wydania ebooka często jest dniem wprowadzenia tytułu do sprzedaży i może nie być równoznaczna z datą wydania książki papierowej. Dodatkowe informacje możesz znaleźć w darmowym fragmencie. Jeśli masz wątpliwości skontaktuj się z nami sklep@ebookpoint.pl.
- Język publikacji:
- angielski
- Rozmiar pliku ePub:
- 8.1MB
- Rozmiar pliku Mobi:
- 16.7MB
Spis treści ebooka
- Hadoop: The Definitive Guide
- Dedication
- Foreword
- Preface
- Administrative Notes
- Whats New in the Fourth Edition?
- Whats New in the Third Edition?
- Whats New in the Second Edition?
- Conventions Used in This Book
- Using Code Examples
- Safari Books Online
- How to Contact Us
- Acknowledgments
- I. Hadoop Fundamentals
- 1. Meet Hadoop
- Data!
- Data Storage and Analysis
- Querying All Your Data
- Beyond Batch
- Comparison with Other Systems
- Relational Database Management Systems
- Grid Computing
- Volunteer Computing
- A Brief History of Apache Hadoop
- Whats in This Book?
- 1. Meet Hadoop
- 2. MapReduce
- A Weather Dataset
- Data Format
- A Weather Dataset
- Analyzing the Data with Unix Tools
- Analyzing the Data with Hadoop
- Map and Reduce
- Java MapReduce
- A test run
- Scaling Out
- Data Flow
- Combiner Functions
- Specifying a combiner function
- Running a Distributed MapReduce Job
- Hadoop Streaming
- Ruby
- Python
- 3. The Hadoop Distributed Filesystem
- The Design of HDFS
- HDFS Concepts
- Blocks
- Namenodes and Datanodes
- Block Caching
- HDFS Federation
- HDFS High Availability
- Failover and fencing
- The Command-Line Interface
- Basic Filesystem Operations
- Hadoop Filesystems
- Interfaces
- HTTP
- C
- NFS
- FUSE
- Interfaces
- The Java Interface
- Reading Data from a Hadoop URL
- Reading Data Using the FileSystem API
- FSDataInputStream
- Writing Data
- FSDataOutputStream
- Directories
- Querying the Filesystem
- File metadata: FileStatus
- Listing files
- File patterns
- PathFilter
- Deleting Data
- Data Flow
- Anatomy of a File Read
- Anatomy of a File Write
- Coherency Model
- Consequences for application design
- Parallel Copying with distcp
- Keeping an HDFS Cluster Balanced
- 4. YARN
- Anatomy of a YARN Application Run
- Resource Requests
- Application Lifespan
- Building YARN Applications
- Anatomy of a YARN Application Run
- YARN Compared to MapReduce 1
- Scheduling in YARN
- Scheduler Options
- Capacity Scheduler Configuration
- Queue placement
- Fair Scheduler Configuration
- Enabling the Fair Scheduler
- Queue configuration
- Queue placement
- Preemption
- Delay Scheduling
- Dominant Resource Fairness
- Further Reading
- 5. Hadoop I/O
- Data Integrity
- Data Integrity in HDFS
- LocalFileSystem
- ChecksumFileSystem
- Data Integrity
- Compression
- Codecs
- Compressing and decompressing streams with CompressionCodec
- Inferring CompressionCodecs using CompressionCodecFactory
- Native libraries
- CodecPool
- Codecs
- Compression and Input Splits
- Using Compression in MapReduce
- Compressing map output
- Serialization
- The Writable Interface
- WritableComparable and comparators
- The Writable Interface
- Writable Classes
- Writable wrappers for Java primitives
- Text
- Indexing
- Unicode
- Iteration
- Mutability
- Resorting to String
- BytesWritable
- NullWritable
- ObjectWritable and GenericWritable
- Writable collections
- Implementing a Custom Writable
- Implementing a RawComparator for speed
- Custom comparators
- Serialization Frameworks
- Serialization IDL
- File-Based Data Structures
- SequenceFile
- Writing a SequenceFile
- Reading a SequenceFile
- Displaying a SequenceFile with the command-line interface
- Sorting and merging SequenceFiles
- The SequenceFile format
- SequenceFile
- MapFile
- MapFile variants
- Other File Formats and Column-Oriented Formats
- II. MapReduce
- 6. Developing a MapReduce Application
- The Configuration API
- Combining Resources
- Variable Expansion
- The Configuration API
- Setting Up the Development Environment
- Managing Configuration
- GenericOptionsParser, Tool, and ToolRunner
- 6. Developing a MapReduce Application
- Writing a Unit Test with MRUnit
- Mapper
- Reducer
- Running Locally on Test Data
- Running a Job in a Local Job Runner
- Testing the Driver
- Running on a Cluster
- Packaging a Job
- The client classpath
- The task classpath
- Packaging dependencies
- Task classpath precedence
- Packaging a Job
- Launching a Job
- The MapReduce Web UI
- The resource manager page
- The MapReduce job page
- Retrieving the Results
- Debugging a Job
- The tasks and task attempts pages
- Handling malformed data
- Hadoop Logs
- Remote Debugging
- Tuning a Job
- Profiling Tasks
- The HPROF profiler
- Profiling Tasks
- MapReduce Workflows
- Decomposing a Problem into MapReduce Jobs
- JobControl
- Apache Oozie
- Defining an Oozie workflow
- Packaging and deploying an Oozie workflow application
- Running an Oozie workflow job
- 7. How MapReduce Works
- Anatomy of a MapReduce Job Run
- Job Submission
- Job Initialization
- Task Assignment
- Task Execution
- Streaming
- Progress and Status Updates
- Job Completion
- Anatomy of a MapReduce Job Run
- Failures
- Task Failure
- Application Master Failure
- Node Manager Failure
- Resource Manager Failure
- Shuffle and Sort
- The Map Side
- The Reduce Side
- Configuration Tuning
- Task Execution
- The Task Execution Environment
- Streaming environment variables
- The Task Execution Environment
- Speculative Execution
- Output Committers
- Task side-effect files
- 8. MapReduce Types and Formats
- MapReduce Types
- The Default MapReduce Job
- The default Streaming job
- Keys and values in Streaming
- The Default MapReduce Job
- MapReduce Types
- Input Formats
- Input Splits and Records
- FileInputFormat
- FileInputFormat input paths
- FileInputFormat input splits
- Small files and CombineFileInputFormat
- Preventing splitting
- File information in the mapper
- Processing a whole file as a record
- Input Splits and Records
- Text Input
- TextInputFormat
- Controlling the maximum line length
- TextInputFormat
- KeyValueTextInputFormat
- NLineInputFormat
- XML
- Binary Input
- SequenceFileInputFormat
- SequenceFileAsTextInputFormat
- SequenceFileAsBinaryInputFormat
- FixedLengthInputFormat
- Multiple Inputs
- Database Input (and Output)
- Output Formats
- Text Output
- Binary Output
- SequenceFileOutputFormat
- SequenceFileAsBinaryOutputFormat
- MapFileOutputFormat
- Multiple Outputs
- An example: Partitioning data
- MultipleOutputs
- Lazy Output
- Database Output
- 9. MapReduce Features
- Counters
- Built-in Counters
- Task counters
- Job counters
- Built-in Counters
- User-Defined Java Counters
- Dynamic counters
- Retrieving counters
- Counters
- User-Defined Streaming Counters
- Sorting
- Preparation
- Partial Sort
- Total Sort
- Secondary Sort
- Java code
- Streaming
- Joins
- Map-Side Joins
- Reduce-Side Joins
- Side Data Distribution
- Using the Job Configuration
- Distributed Cache
- Usage
- How it works
- The distributed cache API
- MapReduce Library Classes
- III. Hadoop Operations
- 10. Setting Up a Hadoop Cluster
- Cluster Specification
- Cluster Sizing
- Master node scenarios
- Cluster Sizing
- Network Topology
- Rack awareness
- Cluster Specification
- 10. Setting Up a Hadoop Cluster
- Cluster Setup and Installation
- Installing Java
- Creating Unix User Accounts
- Installing Hadoop
- Configuring SSH
- Configuring Hadoop
- Formatting the HDFS Filesystem
- Starting and Stopping the Daemons
- Creating User Directories
- Hadoop Configuration
- Configuration Management
- Environment Settings
- Java
- Memory heap size
- System logfiles
- SSH settings
- Important Hadoop Daemon Properties
- HDFS
- YARN
- Memory settings in YARN and MapReduce
- CPU settings in YARN and MapReduce
- Hadoop Daemon Addresses and Ports
- Other Hadoop Properties
- Cluster membership
- Buffer size
- HDFS block size
- Reserved storage space
- Trash
- Job scheduler
- Reduce slow start
- Short-circuit local reads
- Security
- Kerberos and Hadoop
- An example
- Kerberos and Hadoop
- Delegation Tokens
- Other Security Enhancements
- Benchmarking a Hadoop Cluster
- Hadoop Benchmarks
- Benchmarking MapReduce with TeraSort
- Other benchmarks
- Hadoop Benchmarks
- User Jobs
- 11. Administering Hadoop
- HDFS
- Persistent Data Structures
- Namenode directory structure
- The filesystem image and edit log
- Secondary namenode directory structure
- Datanode directory structure
- Persistent Data Structures
- Safe Mode
- Entering and leaving safe mode
- HDFS
- Audit Logging
- Tools
- dfsadmin
- Filesystem check (fsck)
- Finding the blocks for a file
- Datanode block scanner
- Balancer
- Monitoring
- Logging
- Setting log levels
- Getting stack traces
- Logging
- Metrics and JMX
- Maintenance
- Routine Administration Procedures
- Metadata backups
- Data backups
- Filesystem check (fsck)
- Filesystem balancer
- Routine Administration Procedures
- Commissioning and Decommissioning Nodes
- Commissioning new nodes
- Decommissioning old nodes
- Upgrades
- HDFS data and metadata upgrades
- Start the upgrade
- Wait until the upgrade is complete
- Check the upgrade
- Roll back the upgrade (optional)
- Finalize the upgrade (optional)
- HDFS data and metadata upgrades
- IV. Related Projects
- 12. Avro
- Avro Data Types and Schemas
- In-Memory Serialization and Deserialization
- The Specific API
- Avro Datafiles
- Interoperability
- Python API
- Avro Tools
- 12. Avro
- Schema Resolution
- Sort Order
- Avro MapReduce
- Sorting Using Avro MapReduce
- Avro in Other Languages
- 13. Parquet
- Data Model
- Nested Encoding
- Data Model
- Parquet File Format
- Parquet Configuration
- Writing and Reading Parquet Files
- Avro, Protocol Buffers, and Thrift
- Projection and read schemas
- Avro, Protocol Buffers, and Thrift
- Parquet MapReduce
- 14. Flume
- Installing Flume
- An Example
- Transactions and Reliability
- Batching
- The HDFS Sink
- Partitioning and Interceptors
- File Formats
- Fan Out
- Delivery Guarantees
- Replicating and Multiplexing Selectors
- Distribution: Agent Tiers
- Delivery Guarantees
- Sink Groups
- Integrating Flume with Applications
- Component Catalog
- Further Reading
- 15. Sqoop
- Getting Sqoop
- Sqoop Connectors
- A Sample Import
- Text and Binary File Formats
- Generated Code
- Additional Serialization Systems
- Imports: A Deeper Look
- Controlling the Import
- Imports and Consistency
- Incremental Imports
- Direct-Mode Imports
- Working with Imported Data
- Imported Data and Hive
- Importing Large Objects
- Performing an Export
- Exports: A Deeper Look
- Exports and Transactionality
- Exports and SequenceFiles
- Further Reading
- 16. Pig
- Installing and Running Pig
- Execution Types
- Local mode
- MapReduce mode
- Execution Types
- Running Pig Programs
- Grunt
- Pig Latin Editors
- Installing and Running Pig
- An Example
- Generating Examples
- Comparison with Databases
- Pig Latin
- Structure
- Statements
- Expressions
- Types
- Schemas
- Using Hive tables with HCatalog
- Validation and nulls
- Schema merging
- Functions
- Other libraries
- Macros
- User-Defined Functions
- A Filter UDF
- Leveraging types
- A Filter UDF
- An Eval UDF
- Dynamic invokers
- A Load UDF
- Using a schema
- Data Processing Operators
- Loading and Storing Data
- Filtering Data
- FOREACH...GENERATE
- STREAM
- Grouping and Joining Data
- JOIN
- COGROUP
- CROSS
- GROUP
- Sorting Data
- Combining and Splitting Data
- Pig in Practice
- Parallelism
- Anonymous Relations
- Parameter Substitution
- Dynamic parameters
- Parameter substitution processing
- Further Reading
- 17. Hive
- Installing Hive
- The Hive Shell
- Installing Hive
- An Example
- Running Hive
- Configuring Hive
- Execution engines
- Logging
- Configuring Hive
- Hive Services
- Hive clients
- The Metastore
- Comparison with Traditional Databases
- Schema on Read Versus Schema on Write
- Updates, Transactions, and Indexes
- SQL-on-Hadoop Alternatives
- HiveQL
- Data Types
- Primitive types
- Complex types
- Data Types
- Operators and Functions
- Conversions
- Tables
- Managed Tables and External Tables
- Partitions and Buckets
- Partitions
- Buckets
- Storage Formats
- The default storage format: Delimited text
- Binary storage formats: Sequence files, Avro datafiles, Parquet files, RCFiles, and ORCFiles
- Using a custom SerDe: RegexSerDe
- Storage handlers
- Importing Data
- Inserts
- Multitable insert
- CREATE TABLE...AS SELECT
- Altering Tables
- Dropping Tables
- Querying Data
- Sorting and Aggregating
- MapReduce Scripts
- Joins
- Inner joins
- Outer joins
- Semi joins
- Map joins
- Subqueries
- Views
- User-Defined Functions
- Writing a UDF
- Writing a UDAF
- A more complex UDAF
- Further Reading
- 18. Crunch
- An Example
- The Core Crunch API
- Primitive Operations
- union()
- parallelDo()
- groupByKey()
- combineValues()
- Primitive Operations
- Types
- Records and tuples
- Sources and Targets
- Reading from a source
- Writing to a target
- Existing outputs
- Combined sources and targets
- Functions
- Serialization of functions
- Object reuse
- Materialization
- PObject
- Pipeline Execution
- Running a Pipeline
- Asynchronous execution
- Debugging
- Running a Pipeline
- Stopping a Pipeline
- Inspecting a Crunch Plan
- Iterative Algorithms
- Checkpointing a Pipeline
- Crunch Libraries
- Further Reading
- 19. Spark
- Installing Spark
- An Example
- Spark Applications, Jobs, Stages, and Tasks
- A Scala Standalone Application
- A Java Example
- A Python Example
- Resilient Distributed Datasets
- Creation
- Transformations and Actions
- Aggregation transformations
- Persistence
- Persistence levels
- Serialization
- Data
- Functions
- Shared Variables
- Broadcast Variables
- Accumulators
- Anatomy of a Spark Job Run
- Job Submission
- DAG Construction
- Task Scheduling
- Task Execution
- Executors and Cluster Managers
- Spark on YARN
- YARN client mode
- YARN cluster mode
- Spark on YARN
- Further Reading
- 20. HBase
- HBasics
- Backdrop
- HBasics
- Concepts
- Whirlwind Tour of the Data Model
- Regions
- Locking
- Whirlwind Tour of the Data Model
- Implementation
- HBase in operation
- Installation
- Test Drive
- Clients
- Java
- MapReduce
- REST and Thrift
- Building an Online Query Application
- Schema Design
- Loading Data
- Load distribution
- Bulk load
- Online Queries
- Station queries
- Observation queries
- HBase Versus RDBMS
- Successful Service
- HBase
- Praxis
- HDFS
- UI
- Metrics
- Counters
- Further Reading
- 21. ZooKeeper
- Installing and Running ZooKeeper
- An Example
- Group Membership in ZooKeeper
- Creating the Group
- Joining a Group
- Listing Members in a Group
- ZooKeeper command-line tools
- Deleting a Group
- The ZooKeeper Service
- Data Model
- Ephemeral znodes
- Sequence numbers
- Watches
- Data Model
- Operations
- Multiupdate
- APIs
- Watch triggers
- ACLs
- Implementation
- Consistency
- Sessions
- Time
- States
- Building Applications with ZooKeeper
- A Configuration Service
- The Resilient ZooKeeper Application
- InterruptedException
- KeeperException
- State exceptions
- Recoverable exceptions
- Unrecoverable exceptions
- A reliable configuration service
- A Lock Service
- The herd effect
- Recoverable exceptions
- Unrecoverable exceptions
- Implementation
- More Distributed Data Structures and Protocols
- BookKeeper and Hedwig
- ZooKeeper in Production
- Resilience and Performance
- Configuration
- Further Reading
- V. Case Studies
- 22. Composable Data at Cerner
- From CPUs to Semantic Integration
- Enter Apache Crunch
- Building a Complete Picture
- Integrating Healthcare Data
- Composability over Frameworks
- Moving Forward
- 22. Composable Data at Cerner
- 23. Biological Data Science: Saving Lives with Software
- The Structure of DNA
- The Genetic Code: Turning DNA Letters into Proteins
- Thinking of DNA as Source Code
- The Human Genome Project and Reference Genomes
- Sequencing and Aligning DNA
- ADAM, A Scalable Genome Analysis Platform
- Literate programming with the Avro interface description language (IDL)
- Column-oriented access with Parquet
- A simple example: k-mer counting using Spark and ADAM
- From Personalized Ads to Personalized Medicine
- Join In
- 24. Cascading
- Fields, Tuples, and Pipes
- Operations
- Taps, Schemes, and Flows
- Cascading in Practice
- Flexibility
- Hadoop and Cascading at ShareThis
- Summary
- A. Installing Apache Hadoop
- Prerequisites
- Installation
- Configuration
- Standalone Mode
- Pseudodistributed Mode
- Configuring SSH
- Formatting the HDFS filesystem
- Starting and stopping the daemons
- Creating a user directory
- Fully Distributed Mode
- B. Clouderas Distribution Including Apache Hadoop
- C. Preparing the NCDC Weather Data
- D. The Old and New Java MapReduce APIs
- Index
- Colophon
- Copyright
O'Reilly Media - inne książki
-
Keeping up with the Python ecosystem can be daunting. Its developer tooling doesn't provide the out-of-the-box experience native to languages like Rust and Go. When it comes to long-term project maintenance or collaborating with others, every Python project faces the same problem: how to build re...(200.58 zł najniższa cena z 30 dni)
200.38 zł
239.00 zł(-16%) -
Bringing a deep-learning project into production at scale is quite challenging. To successfully scale your project, a foundational understanding of full stack deep learning, including the knowledge that lies at the intersection of hardware, software, data, and algorithms, is required.This book il...(241.06 zł najniższa cena z 30 dni)
241.01 zł
289.00 zł(-17%) -
Frontend developers have to consider many things: browser compatibility, usability, performance, scalability, SEO, and other best practices. But the most fundamental aspect of creating websites is one that often falls short: accessibility. Accessibility is the cornerstone of any website, and if a...(199.29 zł najniższa cena z 30 dni)
198.79 zł
239.00 zł(-17%) -
In this insightful and comprehensive guide, Addy Osmani shares more than a decade of experience working on the Chrome team at Google, uncovering secrets to engineering effectiveness, efficiency, and team success. Engineers and engineering leaders looking to scale their effectiveness and drive tra...(114.23 zł najniższa cena z 30 dni)
113.73 zł
149.00 zł(-24%) -
Data modeling is the single most overlooked feature in Power BI Desktop, yet it's what sets Power BI apart from other tools on the market. This practical book serves as your fast-forward button for data modeling with Power BI, Analysis Services tabular, and SQL databases. It serves as a starting ...(198.48 zł najniższa cena z 30 dni)
198.43 zł
239.00 zł(-17%) -
C# is undeniably one of the most versatile programming languages available to engineers today. With this comprehensive guide, you'll learn just how powerful the combination of C# and .NET can be. Author Ian Griffiths guides you through C# 12.0 and .NET 8 fundamentals and techniques for building c...(240.17 zł najniższa cena z 30 dni)
240.07 zł
289.00 zł(-17%) -
Learn how to get started with Futures Thinking. With this practical guide, Phil Balagtas, founder of the Design Futures Initiative and the global Speculative Futures network, shows you how designers and futurists have made futures work at companies such as Atari, IBM, Apple, Disney, Autodesk, Luf...(147.60 zł najniższa cena z 30 dni)
147.09 zł
179.00 zł(-18%) -
Augmented Analytics isn't just another book on data and analytics; it's a holistic resource for reimagining the way your entire organization interacts with information to become insight-driven.Moving beyond traditional, limited ways of making sense of data, Augmented Analytics provides a dynamic,...(173.34 zł najniższa cena z 30 dni)
173.13 zł
219.00 zł(-21%) -
Learn how to prepare for—and pass—the Kubernetes and Cloud Native Associate (KCNA) certification exam. This practical guide serves as both a study guide and point of entry for practitioners looking to explore and adopt cloud native technologies. Adrián González Sánchez ...
Kubernetes and Cloud Native Associate (KCNA) Study Guide Kubernetes and Cloud Native Associate (KCNA) Study Guide
(169.14 zł najniższa cena z 30 dni)177.65 zł
199.00 zł(-11%) -
Python is an excellent way to get started in programming, and this clear, concise guide walks you through Python a step at a time—beginning with basic programming concepts before moving on to functions, data structures, and object-oriented design. This revised third edition reflects the gro...(139.84 zł najniższa cena z 30 dni)
139.74 zł
179.00 zł(-22%)
Dzieki opcji "Druk na żądanie" do sprzedaży wracają tytuły Grupy Helion, które cieszyły sie dużym zainteresowaniem, a których nakład został wyprzedany.
Dla naszych Czytelników wydrukowaliśmy dodatkową pulę egzemplarzy w technice druku cyfrowego.
Co powinieneś wiedzieć o usłudze "Druk na żądanie":
- usługa obejmuje tylko widoczną poniżej listę tytułów, którą na bieżąco aktualizujemy;
- cena książki może być wyższa od początkowej ceny detalicznej, co jest spowodowane kosztami druku cyfrowego (wyższymi niż koszty tradycyjnego druku offsetowego). Obowiązująca cena jest zawsze podawana na stronie WWW książki;
- zawartość książki wraz z dodatkami (płyta CD, DVD) odpowiada jej pierwotnemu wydaniu i jest w pełni komplementarna;
- usługa nie obejmuje książek w kolorze.
Masz pytanie o konkretny tytuł? Napisz do nas: sklep[at]helion.pl.
Książka, którą chcesz zamówić pochodzi z końcówki nakładu. Oznacza to, że mogą się pojawić drobne defekty (otarcia, rysy, zagięcia).
Co powinieneś wiedzieć o usłudze "Końcówka nakładu":
- usługa obejmuje tylko książki oznaczone tagiem "Końcówka nakładu";
- wady o których mowa powyżej nie podlegają reklamacji;
Masz pytanie o konkretny tytuł? Napisz do nas: sklep[at]helion.pl.
Książka drukowana
![Loader](https://static01.helion.com.pl/ebookpoint/img/ajax-loader.gif)
![ajax-loader](https://static01.helion.com.pl/ebookpoint/img/ajax-loader.gif)
Oceny i opinie klientów: Hadoop: The Definitive Guide. Storage and Analysis at Internet Scale. 4th Edition Tom White (0)
Weryfikacja opinii następuję na podstawie historii zamówień na koncie Użytkownika umieszczającego opinię. Użytkownik mógł otrzymać punkty za opublikowanie opinii uprawniające do uzyskania rabatu w ramach Programu Punktowego.