<?xml version="1.0" encoding="utf-8" standalone="yes"?><rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom"><channel><title>Apache Hive on Boyang Yue</title><link>http://blog.boyangyue.com/tags/apache-hive/</link><description>Recent content in Apache Hive on Boyang Yue</description><generator>Hugo</generator><language>en-us</language><copyright>A Good Year Ahead</copyright><lastBuildDate>Sun, 16 Jan 2022 21:46:37 +0800</lastBuildDate><atom:link href="http://blog.boyangyue.com/tags/apache-hive/index.xml" rel="self" type="application/rss+xml"/><item><title>The Comprehensive Guide to Hive UDF</title><link>http://blog.boyangyue.com/2022/01/the-comprehensive-guide-to-hive-udf/</link><pubDate>Sun, 16 Jan 2022 21:46:37 +0800</pubDate><guid>http://blog.boyangyue.com/2022/01/the-comprehensive-guide-to-hive-udf/</guid><description>&lt;p&gt;One of the most essential features of Spark is interaction with Hive, the data warehouse platform built on top of Hadoop. Naturally, Spark SQL supports the &lt;a href="https://spark.apache.org/docs/latest/sql-ref-functions-udf-hive.html" target="_blank" rel="nofollow noopener noreferrer"&gt;integration of Hive UDFs, UDAFs, and UDTFs&lt;/a&gt;.&lt;/p&gt;
&lt;p&gt;At a glance, delving into Hive UDFs might seem unnecessary in the Spark context, considering the extensive functionalities provided by &lt;a href="https://spark.apache.org/docs/latest/sql-ref-functions-udf-scalar.html" target="_blank" rel="nofollow noopener noreferrer"&gt;Spark UDF&lt;/a&gt;. Nevertheless, Hive UDF could prove indispensable in particular scenarios, such as &lt;strong&gt;building pure SQL environments&lt;/strong&gt; or &lt;strong&gt;optimizing performance&lt;/strong&gt;. Despite the abundance of Spark tutorials, there is a dearth of practical guides on how to work with Hive UDF, and that&amp;rsquo;s why this article is being written.&lt;/p&gt;</description></item></channel></rss>