## CA5 – A brief look at C++ in RStudio

CA5 Understanding the basics of C++ with R Sometimes R code is not fast enough to process large data set and extra speed is required. Rcpp allows R programmers to seamlessly integrate C++ code into their R workflow. This post briefly discussing getting the two languages working together, write some …

## CA5 Multi-agent Systems

CA5 Multi-agent Systems – Question 5 from the 2017 sample paper Using question 5 from the sample paper Oct 2017 use the simulated data set from Question 1 on the same paper to: Part a) Adopt a centralised scheme to all agents and sketch the graphical scheme. Question 1 described …

## CA5 – GLM to model OT to TeamsPts and OppPts

CA5 Use GLM to model OT to TeamsPts and OppPts Using question 4 from the sample paper Oct 2017 perform a GLM on ndbaodds201415.csv. Part a) Train the model using 80% of this data set and suggest an appropriate GLM to model OT to TeamsPts and OppPts variables Read in …

## CA5 – Factor Analysis

Introduction – CA5 Factor Analysis The link between factor analysis and regression. When you are given, a large data set it is often beneficial to use factor analysis and the PCA technique to apply factor analysis. Both regression and factor analysis allow you to connect the dots between the variables …

## CA4 Perform Multiple Linear Regression on a chosen data set

Introduction – CA4 Multiple Linear Regression Regression analysis is a statistical technique used to investigate and model relationships between variables. There are numerous examples of regression analysis in industry, health insurance being one of the most popular but other fields include engineering, economics, life and social science. This blog describes …

## CA04_Multiple Linear Regression

This project will analyze with a Multiple linear regression the Asphalt suface free energy in mJ/m^2 (srf.fr.eng) with different variables. The data has been taken from http://users.stat.ufl.edu/~winner/data/asphalt_binder.csv and the variables are listed below: % Saturates (saturates) % Aromatics (aromatics) % Resins (resins) % Asphaltenes (asptenes) % Wax (wax) % Carbon (carbon) % …

## Multiple Linear Regression – CA4 – Student Test Grades

Student Performance Data Set and Applying Multiple Linear Regression By James Gallagher For my first blog post I will be conducting Multiple Linear Regression in the statistical programming software R. The data set I have chosen is from the University of California and is an open data set and is …

## Multiple Linear Regression with R – CA4

Mathematical relationships describe many aspects of everyday life and to model this relationship we use Linear Regression Analysis. One of the most common linear regression analysis is Multiple Linear Regression and is used to explain/predict the relationship between one continuous dependent variable and two or more independent variables where the …

## Big data

What’s Big Data   When we speak of Big Data we mean data sets or combinations of data sets whose size, complexity (variability) and speed of growth (velocity) make their capture, management, processing or analysis difficult by conventional technologies and tools Such as relational databases and conventional statistics or …

## Big Data 5v´s

There are many definitions on BIG Data, ranging from the definition of Tim Kraska in which they consider the Big Data as the data class on the actual technology in use is not able to obtain in cost, time and quality responses to the exploitation of The same Going by …