Sparkmagic is a set of tools for interactively working with remote Spark clusters in Jupyter notebooks. Sparkmagic interacts with remote Spark clusters through a REST server. Automatic visualization of SQL queries in the PySpark, Spark and SparkR kernels; use an easy visual interface to interactively construct visualizations, no code required. Ability to capture the output of SQL queries as Pandas dataframes to interact with other Python libraries (e.g. matplotlib). Send local files or dataframes to a remote cluster (e.g. sending pretrained local ML model straight to the Spark cluster) Authenticate to Livy via Basic Access authentication or via Kerberos.
Features
- For running interactive sessions on Yarn
- For running interactive sessions on Yarn or Kubernetes (only PySpark sessions are supported)
- For running interactive sessions on Yarn or Kubernetes
- The Sparkmagic project includes a set of magics for interactively running Spark code in multiple languages
- Run Spark code in multiple languages against any remote Spark cluster through Livy
- Automatic SparkContext (sc) and HiveContext (sqlContext) creation
- Easily execute SparkSQL queries with the %%sql magic
Categories
Operating System KernelsLicense
MIT LicenseFollow sparkmagic
Other Useful Business Software
Easily Host LLMs and Web Apps on Cloud Run
Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of sparkmagic!