DBA

Script for creating test data for odds ratio

Make sure you have a working version of SQL Server 2016. USE [master] GO /****** Object: Database [WorldHealth] ******/ CREATE DATABASE [WorldHealth] CONTAINMENT = NONE ON PRIMARY ( NAME = N’WorldHealth’, FILENAME = N’D:\Microsoft SQL Server\DATA\WorldHealth.mdf’ , SIZE = 8192KB , MAXSIZE = UNLIMITED, FILEGROWTH = 65536KB ) LOG ON ( NAME = N’WorldHealth_log’, FILENAME… Continue reading Script for creating test data for odds ratio

Data Mining · Statistics with R and SQL

Descriptive Statistics with SQL and R – II

In the previous post I looked into some very basic and common measures of descriptive statistics – mean, median and mode, and how to derive these using T-SQL, R as well as a combo of the two in SQL Server 2016. These measures also called measures of ‘Central Tendency‘. In this post am going to… Continue reading Descriptive Statistics with SQL and R – II

Statistics with R and SQL

Script to create demo database and load data for statistics and R

Make sure you have a working install of SQL Server 2016. The size of the database is only 8 MB. USE [master] GO /****** Object: Database [WorldHealth] Script Date: 7/15/2016 4:44:58 PM ******/ CREATE DATABASE [WorldHealth] CONTAINMENT = NONE ON PRIMARY ( NAME = N’WorldHealth’, FILENAME = N’D:\DATA\WorldHealth.mdf’ , SIZE = 8192KB , MAXSIZE =… Continue reading Script to create demo database and load data for statistics and R

Data Mining · Statistics with R and SQL

Descriptive Statistics with SQL and R – 1

I started my re-discovery of statistics with an introduction here. This second post is about descriptive statistics – very basic, simple statistics you begin with as a learner. Descriptive Statistics are also called Summary Statistics and serve to describe/summarize the data. They allow you to understand what the data is about and get a feel for… Continue reading Descriptive Statistics with SQL and R – 1