Bioinformatics Data Skills. Reproducible and Robust Research with Open Source Tools
- Autor:
- Vince Buffalo
- Ocena:
- Bądź pierwszym, który oceni tę książkę
- Stron:
- 538
- Dostępne formaty:
-
ePubMobi
Opis ebooka: Bioinformatics Data Skills. Reproducible and Robust Research with Open Source Tools
Learn the data skills necessary for turning large sequencing datasets into reproducible and robust biological findings. With this practical guide, you’ll learn how to use freely available open source tools to extract meaning from large complex biological data sets.
At no other point in human history has our ability to understand life’s complexities been so dependent on our skills to work with and analyze data. This intermediate-level book teaches the general computational and data skills you need to analyze biological data. If you have experience with a scripting language like Python, you’re ready to get started.
- Go from handling small problems with messy scripts to tackling large problems with clever methods and tools
- Process bioinformatics data with powerful Unix pipelines and data tools
- Learn how to use exploratory data analysis techniques in the R language
- Use efficient methods to work with genomic range data and range operations
- Work with common genomics data file formats like FASTA, FASTQ, SAM, and BAM
- Manage your bioinformatics project with the Git version control system
- Tackle tedious data processing tasks with with Bash scripts and Makefiles
Wybrane bestsellery
-
Zapora sieciowa (ang. firewall) to ostatni bastion chroniący Twój komputer przed atakiem. Dzięki odpowiedniej konfiguracji jesteś w stanie decydować, jakie pakiety mogą trafić do wnętrza sieci, a jakie nie. Możesz przesądzić o dostępie do określonych usług, zezwolić lub zabronić aplikacjom korzys...
Zapory sieciowe w systemie Linux. Kompendium wiedzy o nftables. Wydanie IV Zapory sieciowe w systemie Linux. Kompendium wiedzy o nftables. Wydanie IV
(24.90 zł najniższa cena z 30 dni)34.50 zł
69.00 zł(-50%) -
Przestępcy, co raz częściej korzystają ze złośliwego oprogramowania (exploity, wirusy, rootkity itp.) w celu oszukania nieświadomego użytkownika, przeprowadzenia ataku typu odmowa usługi lub włamania się do jakiegoś systemu i wykradzenia informacji. Działanie oprogramowania tego typu można w pełn...
Inżynieria odwrotna w praktyce. Narzędzia i techniki Inżynieria odwrotna w praktyce. Narzędzia i techniki
Bruce Dang, Alexandre Gazet, Elias Bachaalany, Sébastien Josse
(24.90 zł najniższa cena z 30 dni)34.50 zł
69.00 zł(-50%) -
Hacking jest w tym tomie rozpatrywany z różnych perspektyw, jako: przestępstwo, incydent cyberbezpieczeństwa, eksplorowanie i modyfikowanie systemów technicznych, społecznych i biologicznych, a najogólniej - jako przełamywanie zabezpieczeń.(203.15 zł najniższa cena z 30 dni)
203.15 zł
239.00 zł(-15%) -
By 2020, the Internet of Things (IoT) will consist of millions of computational devices intimately connected to real-world aspects of human life. In this insightful book, Professor Sean Smith, who worked in information security long before the web appeared, explains that if we build the IoT the w...
The Internet of Risky Things. Trusting the Devices That Surround Us The Internet of Risky Things. Trusting the Devices That Surround Us
(92.65 zł najniższa cena z 30 dni)101.15 zł
119.00 zł(-15%) -
Niniejsza książka jest wyczerpującym podręcznikiem bezpieczeństwa systemów informatycznych, a ściślej rzecz ujmując — procedur reagowania na incydenty bezpieczeństwa. To lektura obowiązkowa zarówno dla osób z najwyższego kierownictwa, jak i dla koordynatorów oraz specjalistów zajmujących si...
Incydenty bezpieczeństwa. Metody reagowania w informatyce śledczej Incydenty bezpieczeństwa. Metody reagowania w informatyce śledczej
(34.90 zł najniższa cena z 30 dni)49.50 zł
99.00 zł(-50%) -
If you’re involved in cybersecurity as a software developer, forensic investigator, or network administrator, this practical guide shows you how to apply the scientific method when assessing techniques for protecting your information systems. You’ll learn how to conduct scientific exp...
Essential Cybersecurity Science. Build, Test, and Evaluate Secure Systems Essential Cybersecurity Science. Build, Test, and Evaluate Secure Systems
(152.15 zł najniższa cena z 30 dni)160.65 zł
189.00 zł(-15%) -
Znajdziesz w niej szczegółowe omówienie zagadnień związanych z przeprowadzaniem testów penetracyjnych sieci bezprzewodowych, bogato zilustrowanych przykładami praktycznymi, które pozwolą Ci dobrze przyswoić prezentowany materiał i przećwiczyć zdobyte umiejętności, począwszy od bardzo prostych, el...
Kali Linux. Audyt bezpieczeństwa sieci Wi-Fi dla każdego. Wydanie II Kali Linux. Audyt bezpieczeństwa sieci Wi-Fi dla każdego. Wydanie II
(24.90 zł najniższa cena z 30 dni)34.50 zł
69.00 zł(-50%) -
Sięgnij po nią i przekonaj się, jak szybko stworzyć w języku Python skrypt tropiący pakiety w systemach Windows i Linux, przeprowadzający atak ARP cache poisoning lub korzystający z biblioteki urllib2. Sporo uwagi zostało tu poświęcone tworzeniu koni trojańskich oraz budowaniu rozszerzeń dla narz...
Black Hat Python. Język Python dla hakerów i pentesterów Black Hat Python. Język Python dla hakerów i pentesterów
(14.90 zł najniższa cena z 30 dni)19.95 zł
39.90 zł(-50%) -
Dowiedz się, w jaki sposób możesz zdalnie zidentyfikować uruchomiony system operacyjny, przeprowadzać ataki DoS (ang. Denial of Service) oraz testować aplikacje sieciowe. To doskonała lektura, która w rękach wprawnego administratora pozwoli na zdecydowane zwiększenie bezpieczeństwa sieci.(24.90 zł najniższa cena z 30 dni)
34.50 zł
69.00 zł(-50%) -
This book is a marvellous thing: an important intervention in the policy debate about information security and a practical text for people trying to improve the situation.— Cory Doctorowauthor, co-editor of Boing BoingA future with billions of connected "things" includes monumental security...
Abusing the Internet of Things. Blackouts, Freakouts, and Stakeouts Abusing the Internet of Things. Blackouts, Freakouts, and Stakeouts
(152.15 zł najniższa cena z 30 dni)160.65 zł
189.00 zł(-15%)
Ebooka "Bioinformatics Data Skills. Reproducible and Robust Research with Open Source Tools" przeczytasz na:
-
czytnikach Inkbook, Kindle, Pocketbook, Onyx Boox i innych
-
systemach Windows, MacOS i innych
-
systemach Windows, Android, iOS, HarmonyOS
-
na dowolnych urządzeniach i aplikacjach obsługujących formaty: PDF, EPub, Mobi
Masz pytania? Zajrzyj do zakładki Pomoc »
Audiobooka "Bioinformatics Data Skills. Reproducible and Robust Research with Open Source Tools" posłuchasz:
-
w aplikacji Ebookpoint na Android, iOS, HarmonyOs
-
na systemach Windows, MacOS i innych
-
na dowolnych urządzeniach i aplikacjach obsługujących format MP3 (pliki spakowane w ZIP)
Masz pytania? Zajrzyj do zakładki Pomoc »
Kurs Video "Bioinformatics Data Skills. Reproducible and Robust Research with Open Source Tools" zobaczysz:
-
w aplikacjach Ebookpoint i Videopoint na Android, iOS, HarmonyOs
-
na systemach Windows, MacOS i innych z dostępem do najnowszej wersji Twojej przeglądarki internetowej
Szczegóły ebooka
- ISBN Ebooka:
- 978-14-493-6750-3, 9781449367503
- Data wydania ebooka:
- 2015-07-01 Data wydania ebooka często jest dniem wprowadzenia tytułu do sprzedaży i może nie być równoznaczna z datą wydania książki papierowej. Dodatkowe informacje możesz znaleźć w darmowym fragmencie. Jeśli masz wątpliwości skontaktuj się z nami sklep@ebookpoint.pl.
- Język publikacji:
- angielski
- Rozmiar pliku ePub:
- 5.4MB
- Rozmiar pliku Mobi:
- 16.0MB
Spis treści ebooka
- Preface
- The Approach of This Book
- Why This Book Focuses on Sequencing Data
- Audience
- The Difficulty Level of Bioinformatics Data Skills
- Assumptions This Book Makes
- Supplementary Material on GitHub
- Computing Resources and Setup
- Organization of This Book
- Code Conventions
- Conventions Used in This Book
- Using Code Examples
- Safari Books Online
- How to Contact Us
- Acknowledgments
- I. Ideology: Data Skills for Robust and Reproducible Bioinformatics
- 1. How to Learn Bioinformatics
- Why Bioinformatics? Biologys Growing Data
- Learning Data Skills to Learn Bioinformatics
- New Challenges for Reproducible and Robust Research
- Reproducible Research
- Robust Research and the Golden Rule of Bioinformatics
- Adopting Robust and Reproducible Practices Will Make Your Life Easier, Too
- Recommendations for Robust Research
- Pay Attention to Experimental Design
- Write Code for Humans, Write Data for Computers
- Let Your Computer Do the Work For You
- Make Assertions and Be Loud, in Code and in Your Methods
- Test Code, or Better Yet, Let Code Test Code
- Use Existing Libraries Whenever Possible
- Treat Data as Read-Only
- Spend Time Developing Frequently Used Scripts into Tools
- Let Data Prove That Its High Quality
- Recommendations for Reproducible Research
- Release Your Code and Data
- Document Everything
- Make Figures and Statistics the Results of Scripts
- Use Code as Documentation
- Continually Improving Your Bioinformatics Data Skills
- II. Prerequisites: Essential Skills for Getting Started with a Bioinformatics Project
- 2. Setting Up and Managing a Bioinformatics Project
- Project Directories and Directory Structures
- Project Documentation
- Use Directories to Divide Up Your Project into Subprojects
- Organizing Data to Automate File Processing Tasks
- Markdown for Project Notebooks
- Markdown Formatting Basics
- Using Pandoc to Render Markdown to HTML
- 3. Remedial Unix Shell
- Why Do We Use Unix in Bioinformatics? Modularity and the Unix Philosophy
- Working with Streams and Redirection
- Redirecting Standard Out to a File
- Redirecting Standard Error
- Using Standard Input Redirection
- The Almighty Unix Pipe: Speed and Beauty in One
- Pipes in Action: Creating Simple Programs with Grep and Pipes
- Combining Pipes and Redirection
- Even More Redirection: A tee in Your Pipe
- Managing and Interacting with Processes
- Background Processes
- Killing Processes
- Exit Status: How to Programmatically Tell Whether Your Command Worked
- Command Substitution
- 4. Working with Remote Machines
- Connecting to Remote Machines with SSH
- Quick Authentication with SSH Keys
- Maintaining Long-Running Jobs with nohup and tmux
- nohup
- Working with Remote Machines Through Tmux
- Installing and Configuring Tmux
- Creating, Detaching, and Attaching Tmux Sessions
- Working with Tmux Windows
- 5. Git for Scientists
- Why Git Is Necessary in Bioinformatics Projects
- Git Allows You to Keep Snapshots of Your Project
- Git Helps You Keep Track of Important Changes to Code
- Git Helps Keep Software Organized and Available After People Leave
- Why Git Is Necessary in Bioinformatics Projects
- Installing Git
- Basic Git: Creating Repositories, Tracking Files, and Staging and Committing Changes
- Git Setup: Telling Git Who You Are
- git init and git clone: Creating Repositories
- Tracking Files in Git: git add and git status Part I
- Staging Files in Git: git add and git status Part II
- git commit: Taking a Snapshot of Your Project
- Seeing File Differences: git diff
- Seeing Your Commit History: git log
- Moving and Removing Files: git mv and git rm
- Telling Git What to Ignore: .gitignore
- Undoing a Stage: git reset
- Collaborating with Git: Git Remotes, git push, and git pull
- Creating a Shared Central Repository with GitHub
- Authenticating with Git Remotes
- Connecting with Git Remotes: git remote
- Pushing Commits to a Remote Repository with git push
- Pulling Commits from a Remote Repository with git pull
- Working with Your Collaborators: Pushing and Pulling
- Merge Conflicts
- More GitHub Workflows: Forking and Pull Requests
- Using Git to Make Life Easier: Working with Past Commits
- Getting Files from the Past: git checkout
- Stashing Your Changes: git stash
- More git diff: Comparing Commits and Files
- Undoing and Editing Commits: git commit --amend
- Working with Branches
- Creating and Working with Branches: git branch and git checkout
- Merging Branches: git merge
- Branches and Remotes
- Continuing Your Git Education
- 6. Bioinformatics Data
- Retrieving Bioinformatics Data
- Downloading Data with wget and curl
- wget
- Curl
- Downloading Data with wget and curl
- Rsync and Secure Copy (scp)
- Retrieving Bioinformatics Data
- Data Integrity
- SHA and MD5 Checksums
- Looking at Differences Between Data
- Compressing Data and Working with Compressed Data
- gzip
- Working with Gzipped Compressed Files
- Case Study: Reproducibly Downloading Data
- III. Practice: Bioinformatics Data Skills
- 7. Unix Data Tools
- Unix Data Tools and the Unix One-Liner Approach: Lessons from Programming Pearls
- When to Use the Unix Pipeline Approach and How to Use It Safely
- Inspecting and Manipulating Text Data with Unix Tools
- Inspecting Data with Head and Tail
- less
- Plain-Text Data Summary Information with wc, ls, and awk
- Working with Column Data with cut and Columns
- Formatting Tabular Data with column
- The All-Powerful Grep
- Decoding Plain-Text Data: hexdump
- Sorting Plain-Text Data with Sort
- Finding Unique Values in Uniq
- Join
- Text Processing with Awk
- Bioawk: An Awk for Biological Formats
- Stream Editing with Sed
- Advanced Shell Tricks
- Subshells
- Named Pipes and Process Substitution
- The Unix Philosophy Revisited
- 8. A Rapid Introduction to the R Language
- Getting Started with R and RStudio
- R Language Basics
- Simple Calculations in R, Calling Functions, and Getting Help in R
- Variables and Assignment
- Vectors, Vectorization, and Indexing
- Vector types
- Factors and classes in R
- Working with and Visualizing Data in R
- Loading Data into R
- Exploring and Transforming Dataframes
- Exploring Data Through Slicing and Dicing: Subsetting Dataframes
- Exploring Data Visually with ggplot2 I: Scatterplots and Densities
- Exploring Data Visually with ggplot2 II: Smoothing
- Binning Data with cut() and Bar Plots with ggplot2
- Merging and Combining Data: Matching Vectors and Merging Dataframes
- Using ggplot2 Facets
- More R Data Structures: Lists
- Writing and Applying Functions to Lists with lapply() and sapply()
- Using lapply()
- Writing functions
- Digression: Debugging R Code
- More list apply functions: sapply() and mapply()
- Working with the Split-Apply-Combine Pattern
- Exploring Dataframes with dplyr
- Working with Strings
- Developing Workflows with R Scripts
- Control Flow: if, for, and while
- Working with R Scripts
- Workflows for Loading and Combining Multiple Files
- Exporting Data
- Further R Directions and Resources
- 9. Working with Range Data
- A Crash Course in Genomic Ranges and Coordinate Systems
- An Interactive Introduction to Range Data with GenomicRanges
- Installing and Working with Bioconductor Packages
- Storing Generic Ranges with IRanges
- Basic Range Operations: Arithmetic, Transformations, and Set Operations
- Finding Overlapping Ranges
- Finding Nearest Ranges and Calculating Distance
- Run Length Encoding and Views
- Run-length encoding and coverage()
- Going from run-length encoded sequences to ranges with slice()
- Advanced IRanges: Views
- Storing Genomic Ranges with GenomicRanges
- Grouping Data with GRangesList
- Working with Annotation Data: GenomicFeatures and rtracklayer
- Retrieving Promoter Regions: Flank and Promoters
- Retrieving Promoter Sequence: Connection GenomicRanges with Sequence Data
- Getting Intergenic and Intronic Regions: Gaps, Reduce, and Setdiffs in Practice
- Finding and Working with Overlapping Ranges
- Calculating Coverage of GRanges Objects
- Working with Ranges Data on the Command Line with BEDTools
- Computing Overlaps with BEDTools Intersect
- BEDTools Slop and Flank
- Coverage with BEDTools
- Other BEDTools Subcommands and pybedtools
- 10. Working with Sequence Data
- The FASTA Format
- The FASTQ Format
- Nucleotide Codes
- Base Qualities
- Example: Inspecting and Trimming Low-Quality Bases
- A FASTA/FASTQ Parsing Example: Counting Nucleotides
- Indexed FASTA Files
- 11. Working with Alignment Data
- Getting to Know Alignment Formats: SAM and BAM
- The SAM Header
- The SAM Alignment Section
- Bitwise Flags
- CIGAR Strings
- Mapping Qualities
- Getting to Know Alignment Formats: SAM and BAM
- Command-Line Tools for Working with Alignments in the SAM Format
- Using samtools view to Convert between SAM and BAM
- Samtools Sort and Index
- Extracting and Filtering Alignments with samtools view
- Extracting alignments from a region with samtools view
- Filtering alignments with samtools view
- Visualizing Alignments with samtools tview and the Integrated Genomics Viewer
- Pileups with samtools pileup, Variant Calling, and Base Alignment Quality
- Creating Your Own SAM/BAM Processing Tools with Pysam
- Opening BAM Files, Fetching Alignments from a Region, and Iterating Across Reads
- Extracting SAM/BAM Header Information from an AlignmentFile Object
- Working with AlignedSegment Objects
- Writing a Program to Record Alignment Statistics
- Additional Pysam Features and Other SAM/BAM APIs
- 12. Bioinformatics Shell Scripting, Writing Pipelines, and Parallelizing Tasks
- Basic Bash Scripting
- Writing and Running Robust Bash Scripts
- A robust Bash header
- Running Bash scripts
- Writing and Running Robust Bash Scripts
- Variables and Command Arguments
- Command-line arguments
- Basic Bash Scripting
- Conditionals in a Bash Script: if Statements
- Processing Files with Bash Using for Loops and Globbing
- Automating File-Processing with find and xargs
- Using find and xargs
- Finding Files with find
- finds Expressions
- finds -exec: Running Commands on finds Results
- xargs: A Unix Powertool
- Using xargs with Replacement Strings to Apply Commands to Files
- xargs and Parallelization
- Make and Makefiles: Another Option for Pipelines
- 13. Out-of-Memory Approaches: Tabix and SQLite
- Fast Access to Indexed Tab-Delimited Files with BGZF and Tabix
- Compressing Files for Tabix with Bgzip
- Indexing Files with Tabix
- Using Tabix
- Fast Access to Indexed Tab-Delimited Files with BGZF and Tabix
- Introducing Relational Databases Through SQLite
- When to Use Relational Databases in Bioinformatics
- Installing SQLite
- Exploring SQLite Databases with the Command-Line Interface
- Querying Out Data: The Almighty SELECT Command
- Limiting results with LIMIT
- Selecting columns with SELECT
- Ordering rows with ORDER BY
- Filtering which rows with WHERE
- SQLite Functions
- SQLite Aggregate Functions
- Grouping rows with GROUP BY
- Subqueries
- Organizing Relational Databases and Joins
- Organizing relational databases
- Inner joins
- Left outer joins
- Writing to Databases
- Creating tables
- Inserting records into tables
- Indexing
- Dropping Tables and Deleting Databases
- Interacting with SQLite from Python
- Connecting to SQLite databases and creating tables from Python
- Loading data into a table from Python
- Dumping Databases
- 14. Conclusion
- Where to Go From Here?
- Glossary
- Bibliography
- Index
O'Reilly Media - inne książki
-
Software as a service (SaaS) is on the path to becoming the de facto model for building, delivering, and operating software solutions. Adopting a multi-tenant SaaS model requires builders to take on a broad range of new architecture, implementation, and operational challenges. How data is partiti...(237.15 zł najniższa cena z 30 dni)
245.65 zł
289.00 zł(-15%) -
Great engineers don't necessarily make great leaders—at least, not without a lot of work. Finding your path to becoming a strong leader is often fraught with challenges. It's not easy to figure out how to be strategic, successful, and considerate while also being firm. Whether you're on the...(118.15 zł najniższa cena z 30 dni)
126.65 zł
149.00 zł(-15%) -
Data science happens in code. The ability to write reproducible, robust, scaleable code is key to a data science project's success—and is absolutely essential for those working with production code. This practical book bridges the gap between data science and software engineering,and clearl...(211.65 zł najniższa cena z 30 dni)
220.15 zł
259.00 zł(-15%) -
With the massive adoption of microservices, operators and developers face far more complexity in their applications today. Service meshes can help you manage this problem by providing a unified control plane to secure, manage, and monitor your entire network. This practical guide shows you how th...(194.65 zł najniższa cena z 30 dni)
211.65 zł
249.00 zł(-15%) -
Get practical advice on how to leverage AI development tools for all stages of code creation, including requirements, planning, design, coding, debugging, testing, and documentation. With this book, beginners and experienced developers alike will learn how to use a wide range of tools, from gener...(177.65 zł najniższa cena z 30 dni)
164.25 zł
219.00 zł(-25%) -
Rust's popularity is growing, due in part to features like memory safety, type safety, and thread safety. But these same elements can also make learning Rust a challenge, even for experienced programmers. This practical guide helps you make the transition to writing idiomatic Rust—while als...(177.65 zł najniższa cena z 30 dni)
164.25 zł
219.00 zł(-25%) -
Advance your Power BI skills by adding AI to your repertoire at a practice level. With this practical book, business-oriented software engineers and developers will learn the terminologies, practices, and strategy necessary to successfully incorporate AI into your business intelligence estate. Je...(211.65 zł najniższa cena z 30 dni)
220.15 zł
259.00 zł(-15%) -
Microservices can be a very effective approach for delivering value to your organization and to your customers. If you get them right, microservices help you to move fast by making changes to small parts of your system hundreds of times a day. But if you get them wrong, microservices will just ma...(203.15 zł najniższa cena z 30 dni)
211.65 zł
249.00 zł(-15%) -
JavaScript gives web developers great power to create rich interactive browser experiences, and much of that power is provided by the browser itself. Modern web APIs enable web-based applications to come to life like never before, supporting actions that once required browser plug-ins. Some are s...(186.15 zł najniższa cena z 30 dni)
186.15 zł
219.00 zł(-15%) -
How will software development and operations have to change to meet the sustainability and green needs of the planet? And what does that imply for development organizations? In this eye-opening book, sustainable software advocates Anne Currie, Sarah Hsu, and Sara Bergman provide a unique overview...(160.65 zł najniższa cena z 30 dni)
169.14 zł
199.00 zł(-15%)
Dzieki opcji "Druk na żądanie" do sprzedaży wracają tytuły Grupy Helion, które cieszyły sie dużym zainteresowaniem, a których nakład został wyprzedany.
Dla naszych Czytelników wydrukowaliśmy dodatkową pulę egzemplarzy w technice druku cyfrowego.
Co powinieneś wiedzieć o usłudze "Druk na żądanie":
- usługa obejmuje tylko widoczną poniżej listę tytułów, którą na bieżąco aktualizujemy;
- cena książki może być wyższa od początkowej ceny detalicznej, co jest spowodowane kosztami druku cyfrowego (wyższymi niż koszty tradycyjnego druku offsetowego). Obowiązująca cena jest zawsze podawana na stronie WWW książki;
- zawartość książki wraz z dodatkami (płyta CD, DVD) odpowiada jej pierwotnemu wydaniu i jest w pełni komplementarna;
- usługa nie obejmuje książek w kolorze.
Masz pytanie o konkretny tytuł? Napisz do nas: sklep[at]helion.pl.
Książka, którą chcesz zamówić pochodzi z końcówki nakładu. Oznacza to, że mogą się pojawić drobne defekty (otarcia, rysy, zagięcia).
Co powinieneś wiedzieć o usłudze "Końcówka nakładu":
- usługa obejmuje tylko książki oznaczone tagiem "Końcówka nakładu";
- wady o których mowa powyżej nie podlegają reklamacji;
Masz pytanie o konkretny tytuł? Napisz do nas: sklep[at]helion.pl.
Książka drukowana
Oceny i opinie klientów: Bioinformatics Data Skills. Reproducible and Robust Research with Open Source Tools Vince Buffalo (0) Weryfikacja opinii następuję na podstawie historii zamówień na koncie Użytkownika umieszczającego opinię. Użytkownik mógł otrzymać punkty za opublikowanie opinii uprawniające do uzyskania rabatu w ramach Programu Punktowego.