{"product_id":"9781633436374","title":"Data Pipelines with Apache Airflow, Second Edition: Orchestration for data and AI","description":"\u003ctable\u003e\u003ctbody\u003e\n\u003ctr\u003e\n\u003ctd style=\"\"\u003e\u003cstrong\u003eAuthor\/Contributor(s):\u003c\/strong\u003e\u003c\/td\u003e\n\u003ctd style=\"\"\u003eRuiter, Julian de; Cabral, Ismael; Geusebroek, Kris; Ende, Daniel van der; Harenslak, Bas\u003cbr\u003e\n\u003c\/td\u003e\n\u003c\/tr\u003e\n\u003ctr\u003e\n\u003ctd style=\"\"\u003e\u003cstrong\u003ePublisher:\u003c\/strong\u003e\u003c\/td\u003e\n\u003ctd\u003eManning\u003cbr\u003e\n\u003c\/td\u003e\n\u003c\/tr\u003e\n\u003ctr\u003e\n\u003ctd style=\"\"\u003e\u003cstrong\u003eDate:\u003c\/strong\u003e\u003c\/td\u003e\n\u003ctd\u003e1\/27\/2026\u003cbr\u003e\n\u003c\/td\u003e\n\u003c\/tr\u003e\n\u003ctr\u003e\n\u003ctd style=\"\"\u003e\u003cstrong\u003eBinding:\u003c\/strong\u003e\u003c\/td\u003e\n\u003ctd style=\"\"\u003ePaperback\u003c\/td\u003e\n\u003c\/tr\u003e\n\u003ctr\u003e\n\u003ctd style=\"\"\u003e\u003cstrong\u003eCondition:\u003c\/strong\u003e\u003c\/td\u003e\n\u003ctd style=\"\"\u003eNEW\u003cbr\u003e\n\u003c\/td\u003e\n\u003c\/tr\u003e\n\u003c\/tbody\u003e\u003c\/table\u003e\u003cb\u003e\u003clabel\u003eGet a free eBook (PDF or ePub) from Manning as well as access to the online liveBook format (and its AI assistant that will answer your questions in any language) when you purchase the print book.\u003c\/label\u003e\u003c\/b\u003e\u003cbr\u003e\u003cbr\u003e\u003ci open=\"\"\u003eData Pipelines with Apache Airflow\u003c\/i\u003e has empowered thousands of data engineers to build more successful data platforms. This new second edition has been fully revised for Airflow 3 with coverage of all the latest features of Apache Airflow, including the Taskflow API, deferrable operators, and Large Language Model integration. Filled with real-world scenarios and examples, you'll be carefully guided from Airflow novice to expert.\u003cbr\u003e \u003cbr\u003e Using real-world scenarios and examples, this book teaches you how to simplify and automate data pipelines, reduce operational overhead, and smoothly integrate all the technologies in your stack. Part reference and part tutorial, each technique is illustrated with engaging hands-on examples, from training machine learning models for generative AI to optimizing delivery routes.\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e In \u003ci open=\"\"\u003eData Pipelines with Apache Airflow, Second Edition \u003c\/i\u003eyou'll learn how to:\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e • Master the core concepts of Airflow architecture and workflow design\u003cbr open=\"\"\u003e • Schedule data pipelines using the Dataset API and time tables, including complex irregular schedules\u003cbr open=\"\"\u003e • Develop custom Airflow components for your specific needs\u003cbr open=\"\"\u003e • Implement comprehensive testing strategies for your pipelines\u003cbr open=\"\"\u003e • Apply industry best practices for building and maintaining Airflow workflows\u003cbr open=\"\"\u003e • Deploy and operate Airflow in production environments\u003cbr open=\"\"\u003e • Orchestrate workflows in container-native environments\u003cbr open=\"\"\u003e • Build and deploy Machine Learning and Generative AI models using Airflow\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e \u003cb\u003eAbout the Technology\u003c\/b\u003e\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e Apache Airflow provides a unified platform for collecting, consolidating, cleaning, and analyzing data. With its easy-to-use UI, powerful scheduling and monitoring features, plug-and-play options, and flexible Python scripting, Airflow makes it easy to implement secure, consistent pipelines for any data or AI task.\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e \u003cb\u003eAbout the book\u003c\/b\u003e\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e \u003ci open=\"\"\u003eData Pipelines with Apache Airflow, Second Edition \u003c\/i\u003eteaches you how to build, monitor, and maintain effective data workflows. This new edition adds comprehensive coverage of Airflow 3 features, such as event-driven scheduling, dynamic task mapping, DAG versioning, and Airflow’s entirely new UI. The numerous examples address common use cases like data ingestion and transformation and connecting to multiple data sources, along with AI-aware techniques such as building RAG systems.\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e \u003cb\u003eWhat's inside\u003c\/b\u003e\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e • Deploying data pipelines as Airflow DAGs\u003cbr open=\"\"\u003e • Time and event-based scheduling strategies\u003cbr open=\"\"\u003e • Integrating with databases, LLMs, and AI models\u003cbr open=\"\"\u003e • Deploying Airflow using Kubernetes\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e \u003cb\u003eAbout the reader\u003c\/b\u003e\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e For data engineers, machine learning engineers, DevOps, and sysadmins with intermediate Python skills.\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e \u003cb\u003eAbout the author\u003c\/b\u003e\u003cbr open=\"\"\u003e \u003cbr open=\"\"\u003e \u003cb\u003eJulian de Ruiter\u003c\/b\u003e, \u003cb\u003eIsmael Cabral\u003c\/b\u003e, \u003cb\u003eKris Geusebroek\u003c\/b\u003e, \u003cb\u003eDaniel van der Ende\u003c\/b\u003e, and \u003cb\u003eBas Harenslak\u003c\/b\u003e are seasoned data engineers and Airflow experts.\u003cbr\u003e \u003cbr\u003e \u003cb\u003eTable of Contents\u003c\/b\u003e\u003cbr\u003e \u003cbr\u003e Part 1\u003cbr\u003e 1 Meet Apache Airflow\u003cbr\u003e 2 Anatomy of an Airflow DAG\u003cbr\u003e 3 Time-based scheduling\u003cbr\u003e 4 Asset-aware scheduling\u003cbr\u003e 5 Templating tasks using the Airflow context\u003cbr\u003e 6 Defining dependencies between tasks\u003cbr\u003e Part 2\u003cbr\u003e 7 Triggering workflows with external input\u003cbr\u003e 8 Communicating with external systems\u003cbr\u003e 9 Extending Airflow with custom operators and sensors\u003cbr\u003e 10 Testing\u003cbr\u003e 11 Running tasks in containers\u003cbr\u003e Part 3\u003cbr\u003e 12 Best practices\u003cbr\u003e 13 Project: Finding the fastest way to get around NYC\u003cbr\u003e 14 Project: Keeping family traditions alive with Airflow and generative AI\u003cbr\u003e Part 4\u003cbr\u003e 15 Operating Airflow in production\u003cbr\u003e 16 Securing Airflow\u003cbr\u003e 17 Airflow deployment options\u003cbr\u003e A Running code samples\u003cbr\u003e B Prometheus metric mapping","brand":"Manning","offers":[{"title":"Default Title","offer_id":47131741946111,"sku":"9781633436374","price":59.99,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0452\/0886\/2873\/files\/9781633436374_s600x595.jpg?v=1781621892","url":"https:\/\/massivebookshop.com\/products\/9781633436374","provider":"MASSIVE BOOKSHOP","version":"1.0","type":"link"}