Here we provide an example of how to do linear regression using the Spark ML (machine learning) library and Scala. We will do multiple regression example, meaning there is more than one input variable. The goal is to read sample data and then train the Spark linear regression model. From there we can make predicted […]
If you have ever worked with or for a printing firm, you will undoubtedly know about the rift that often exists between designers and the actual people who put ink on paper. The designers think the printers are being overly awkward and restrictive when they inform the designer that something is impossible to achieve. The […]
In the current, connected environment, it is very easy to trap ourselves by believing that every device, and every application, has access to a reliable, low latency network connection. And whilst this may generally be true, for some applications, it is not enough. Consider for a moment how often your 3G/4G mobile internet connection bombs […]
Riak is at heart, a NoSQL-style data storage engine. Where Riak differs from other NoSQL database platforms is in the fact it has been developed and optimized to handle massive amounts of sequential, small sized data packets. Furthermore, Riak is intended to collect data from IoT devices. When we combine these two differentiators, we find […]
The TICK Stack is a collection of associated technologies which combine to deliver a platform for storing, capturing, monitoring and visualizing data that is in time series. The TICK stack consists of the following technologies: Telgeraf – collection of tie sequential data from a range of sources including IoT devices. InfluxDB – high performance and […]
As cluster computing frameworks go, Apache Spark is undoubtedly a major player in the Big Data market. The ability to interface with other major Big Data technologies such as Hadoop and Cassandra, whilst bringing in major cloud platforms like Amazon Web Services make it almost the go to tech for companies looking to deploy a […]
A monolithic application describes a single-tiered software application in which the user interface and data access code are combined into a single program for a single platform. A monolithic application is self-contained and independent from other computing applications.
ELK Stack, or Elastic Stack as it has just been rebranded to, has just received a long-awaited update to version 5. But before we take a look at some of the major improvements in this new version, it might be best just to reiterate on exactly what Elastic Stack is. Elastic Stack is a platform […]
The AWS Lambda service is part of the overall application hosting platform that makes up the full Amazon Web Services product offering. Where AWS Lambda differs from other services within the AWS infrastructure, is in that it is an on-demand style service. Application functions that operate under the AWS Lambda service will only be […]
How do you use external Scala frameworks or Java jar files in the Scala Command Line Interpreter (REPL, aka read–eval–print loop)? You use SBT. SBT is the Scala Interactive Build tool. You can use it to build Scala or Java projects and run a complex series of tasks. It does the same thing as Maven. […]
Smartsourcing is a brief guide to the world of modern technology partnerships. It was developed through a collaborative effort of top Zymr executives as we uncovered a gap in the market between the perception of what outsourcing used to be, and how leading technology innovators are leveraging this globalized approach to value generation. Read this guide to learn...