To trick windows commands that it is working on linux with hadoop, Create a c:\tmp\hive directory, and cd into c:\winutils\bin, and run winutils.exe chmod 777 c:\tmp\hiveħ. If you are on a 32-bit version of Windows, you’ll need to search for a 32-bit build of winutils.exe for Hadoop.)Ħ. Download winutils.exe from and move it into a C:\winutils\bin folder that you’ve created. You should end up with directories like c:\spark\bin, c:\spark\conf, etc.ĥ. Extract the Spark archive, and copy its contents into C:\spark after creating that directory. If necessary, download and install WinRAR so you can extract the.
Download a pre-built version of Apache Spark 3.0.0 or 2.4.4 (depending on the version you want to use – Spark 3.0.0 or 2.4.4 from ģ. Don’t accept the default path that goes into “Program Files” on Windows, as that has a space.Ģ. And BE SURE TO INSTALL JAVA TO A PATH WITH NO SPACES IN IT. Spark is not compatible with Java 9 or newer. Keep track of where you installed the JDK you’ll need that later.ĭO NOT INSTALL THE LATEST RELEASE – INSTALL JAVA 8. Install a JDK (Java Development Kit) from Installing Apache SPARK and an IDE for Scala on Windows OSġ.