We’re living in a world of big data. The current generation of line-of-business computer systems generate terabytes of data every year, tracking sales and production through CRM and ERP. It’s a flood ...
Apache Spark is a project designed to accelerate Hadoop and other big data applications through the use of an in-memory, clustered data engine. The Apache Foundation describes the Spark project this ...
What I'd like to cover here goes beyond those AI headlines, however, and involves a special nugget just for folks doing data engineering, analytics and machine learning work with Apache Spark.