Enable Apache Spark REST API on Oracle Cloud based Linux VMs

Johan Louwers
2 min readApr 18, 2024
Apache Spark — REST API

Apache Spark is a powerful, open-source, distributed computing system that has rapidly become a cornerstone in big data processing and analytics. Offering lightning-fast data processing speeds and versatile functionality, Spark has revolutionized how organizations handle large-scale data analytics and machine learning tasks. With its intuitive APIs, extensive library support, and seamless integration with various data sources, Spark empowers businesses to derive valuable insights from vast and complex datasets, driving innovation and decision-making across industries.

There are several methods to engage with Apache Spark, and one of them involves using the REST interface. This interface offers a simple way to interact with the Apache Spark master node programmatically. However, despite its benefits, some individuals may be unsure of where to locate the REST interface.

Many resources indicate that the REST interface is accessible via port 6066 on the Apache Spark master node IP address. While this is accurate, these sources often overlook the fact that the REST API is disabled by default.

Enable the REST API

An earlier post already gave some inisghts in how to deploy Apache Spark on Oracle Linux within Oracle Cloud. We have build upon this and extended…

--

--

Johan Louwers

Johan Louwers is a technology enthousiasts with a long background in supporting enterprises and startups alike as CTO, Chief Enterprise Architect and developer.