Ever since getting my hands on some Opta data, courtesy of Manchester City’s Analytics challenge all the way back in August 2012, I’ve been wanting to try something different with the data. Although it’s taken me over a year to get around to doing it, I’d initially thought of the idea of doing some kind of Network Graph to explore how the players were interconnected throughout a match, and potentially over an entire season. Since starting to play with Gephi some time ago, I figured it would be perfect for the job. Continue reading “Visualising a football match as a Network Graph using Gephi” »
Lately I’ve been experimenting with the Raspberry Pi, the credit-card sized budget computer that took the world by storm back in 2012. I posted the other day about the hardware I’m using to create my own, Raspberry Pi based NAS (Network Attached Storage) slash backup server slash media centre. I mentioned at the end of that article about buying or creating an enclosure to tidy up the Pi-based solution, as well as keep all the components safe and together. It’s not entirely necessary, but if you have a Raspberry Pi, one or two external HDDs, a USB hub and HDMI/Ethernet cables, chances are it’ll be messy and you’ll want to build or buy something to keep everything together all neat and tidy. There are lots of possibilities out there, some you can buy, others you can make. There are lots of cases for the Pi itself, but I needed one to match my particular setup and contain the hard drives, USB hub, and all the related cabling as well. Continue reading “Building a Raspberry Pi NAS: Enclosure” »
The Raspberry Pi has been a huge hit since its launch in 2012, grabbing the attention of hobbyists and professionals alike. The option to buy a fully functional, credit-card sized computer for less than £30 has opened up a slew of possibilities for experimentation and creativity, regardless of budget. I’d been meaning to pick one up for a while, and finally got the push I needed when I started doing some freelance web work and needed a backup system for my client sites and databases. Reading Scott Hanselman’s post regarding the Computer Backup Rule of Three simply drove home the point. I also fancied getting all my media files off my hard drive and into a centralised location on my home network where they could be backed up, easily accessible, and viewed through my TV. Given the option of buying a pre-made NAS box from Amazon, or constructing my own and getting my teeth into that tasty Raspberry Pi, there was no choice to be made! Continue reading “Building a Raspberry Pi NAS: Hardware” »
I’ve been meaning to write something on Power BI for a long time now, and I’m a little late in getting round to writing this, as most of the dust has already settled after Microsoft sent out the first round of invites to the Power BI for Office 365 preview, and a lot of people have produced some amazing work with Power BI. Chris Webb has written a pretty comprehensive review on his blog, as have countless others.
What is Power BI?
For anyone living under a rock (or new to the world of MS BI), Power BI is a new offering from Microsoft which makes their new Excel-based self-service BI tools shareable and collaborative in a way that was previously only available for organisations rocking a SharePoint Enterprise installation. By hooking their toolkit up to Office 365, they’re providing a cloud-based ecosystem in which to share, manage and explore data, using their suite of data tools: Power Query (formerly Data Explorer), Power Pivot (formerly PowerPivot), Power View, and Power Map (formerly GeoFlow). If you want to know a bit more, I’ve got a more detailed post on the included functionality in Power BI for Office 365. Continue reading “Power BI for Office 365 first thoughts” »
I’ve really been neglecting the blog of late and have been taking a bit of a break from a lot of extra curricular business intelligence and data reading. I figured it was about time to get back to posting though, and as luck would have it, my colleague Stephen came to me with an interesting SSIS performance issue that presented the perfect opportunity for a quick blog post. I’ve not written much about SSIS lately, having been drawn off by the shiny sparkle of developments in the self-service BI sector such as Power BI, and playing with Big Data tools like Hadoop. But I still do a lot of work with SSIS and it’s still my go-to large scale ETL tool. Continue reading “Optimising SSIS to read from a view using OLE DB Source” »