Tag Archive for 'samknows'

Reliable data


Recently we have seen BDUK announce the funding allocations to local authorities and the devolved assemblies, and the companies aiming hoping to get on the national framework have been short-listed. The sums awarded to councils were modelled by BDUK according to their understanding of need, and at the moment the framework companies are trying to develop a consistent understanding of what will be required of them and their shareholders should they be successful.

At stake is the investment of billions of pounds and public and private money, and the future competitiveness of the UK economy. Yet questions have been raise in several quarters for quite some time now about the accuracy of BDUKs data on which all this investment sits. So for the record I decided to correlate a source of data I have grown to trust – from who in turn get their DSL data from BT – against a set of BDUK data for the same area. The sample included a little over 19,000 postcodes.

BDUK Broadband Speed data

(click the graph to see a bigger version)

The plot shows BDUK speeds along the horizontal with BT speeds on the vertical, with each point representing a postcode average. If the two sets agreed the points should broadly align along the diagonal but its clear there is a limited correlation between the two sets.

This data is for , so the first location I checked was my own postcode. BDUK suggests that I should get 13971.456kbps while BT suggests I get 6Mbps with ADSL2+. With an ordinary ISP I do in fact get 6 Mbps (Be There uniquely allow me to tune the connection so I get a shade more).

In fact on 76% of occasions the BDUK data offers faster speeds than BT’s reported data, and on average 52% faster.

When focussing in on just the 2 Mbps Universal Service Commitment, relying on BDUK data would result in about 900 postcodes having a problem addressed which doesn’t exist, yet almost 40% of the areas which do suffer broadband at less than 2 Mbps would have been missed altogether.

In 63 cases the discrepancy was more than 22 Mbps – or rather BDUK expected people to receive what they now consider “superfast broadband” when in fact no broadband was available at all.

From what I understand none of the usual sources relied on by the industry provided BDUK with this data and that the speeds are reported to thousandths of a kilobit suggests Excel may have been involved somewhere along the line rather than empirical data.

This information was provided to BDUK but they were largely unconcerned about the discrepancy at the time.

I’ll allow you to come to your own conclusions about the impact this might have had on the decisions BDUK is making and the fairness of funding allocations. For organisations seeking to be part of the framework, this data appears to be having a continuing impact.

NOTE: This is one of a number of blog articles which had gone unpublished for some time, occasionally dusted off and updated but left on the spike. For much of BDUK’s existence I have been supportive, and after it became clear that they were ignoring offers of help and advice from many of the people I know I had remained reluctant to be openly dismissive. But as the programme evolved it has become harder and harder to be supportive, there became fewer and fewer good news stories to write about, and my own postings became less frequent and rarely positive good news stories.

I’m publishing this now to draw a line under the whole process – time to get on with projects that make a difference in reality.

Data is king – part 2


In the first part I looked the importance of using empirical data to support a local narrative when developing a broadband policy by looking at some of the pitfalls of opinion polls on the state of a communities infrastructure. In this second part I start to look at the kind of data and exercises that can help target interventions.

The first exercise has to be to understand the state of now – what does the broadband landscape look like today? For me there is only one resource for this kind of information – Samknows.

Their data is all from primary resources – BT, Virgin, and the unbundlers – and is carefully cross-referenced against their own checks. Their data forms the backbone of the price comparison sites and their services are the basis of Ofcom broadband quality studies.

Over the last year or so I have done a number of projects with , mapping and analyzing their data. Each time it has thrown up new local insights and shone new light on the national picture. And each time it has helped to create a middle ground on which a less emotive debate can develop; when faced with the data its no longer really possible to simply blame BT or the regional government, but equally it provides policy makers with an image of the broadband landscape around which its possible to plan.

Such a rich image is only possible with very high quality data. There have been many attempts to model broadband speeds based on paper models of ADSL performance curves from manufacturers and tools that calculate the radial, as the crow flies distance from the BT exchange.

Some of these have tried to build in factors for guessing the true cable length, quality and so on – but at the end of the day they are just increasingly smart guesses – and this becomes very clear when you try to map the result. Estimating broadband speeds based on the radial distance from the exchange will create nice, uniform shapes on a map from which one glance tell you its not a reality.

Mapping quality empirical data provides a more organic image of broadband performance which starts to mould itself to the geography and topography of the area. From this its possible to build a narrative which links the cold data with the tales of broadband woe from communities – and to slay a few myths along the way.

This can perhaps be best demonstrated by a piece of work I recently completed for County Council. This is my home county and I wanted to see if I could create a map able  to highlight some of the broadband stories I knew to be true, and from that to seek out new detail which may have lain slumbering for the reasons highlighted in part 1.

So as well as the nice clean maps which blanket fill a postcode polygon with traffic light colours to represent poor, mean and good broadband speeds , I ran the data through a series of GIS modules to create a contoured heat map of the county – while its harder to say precisely what the speed is at a given location, it does provide a much richer analogue from which the broadband landscape can be described.

Trialling broadband speed contours

Its clear that the underlying data is not based on a simple model and is far from producing the neat conical contours of a radial guesstimate. A quick glance shows how broadband speeds are affected by the contours of hills and valleys, and by man-made features like railways.

From the generalities I wanted to test some established broadband legends. There were long rumours that broadband in parts of central Oxford were slow, and the reasons given seemed perfectly plausible but unproven. The story was always that BT had centralised its various telephone exchanges in the city, which increased the cable lengths, and that in this part of the city the cables had to additionally skirt around the old Morris car plant making them too long to support a good broadband service, even though some of the homes and businesses affected my be just a few hundred metres from the newer telephone exchange.

A glance at the new map clearly shows a glacial valley of poorer broadband to the North East of Oxford’s broadband summit. So while the now BMW car plant is much more compact, the data appears to support tales of  the ghost of the city’s industrial past still be haunting of one of the world’s most important knowledge centres.

Once I’d calibrated my eye using a few known problem spots, it became easier to start hunting new broadband stories. Slightly to the north of the city is Oxford Airport. Following the northern perimeter fence the map predicts the existence of an ox-bow lake of poor broadband coverage – perfectly obvious when you think about it. Would you really feel happy allowing Openreach engineers exercising the code powers to dig a trench across the runway?

The granularity of Samknows data again helps to highlights what is a small, localised broadband issue which would be completely missed by lesser datasets.

So now when I work with the good people at Samknows, I still produce the coloured political maps of broadband speed using traffic light colours to highlight potential problem areas but now I include the contoured explorers map which helps to link the data with the local legends.

In the final part of this series, I’ll start to look at some of the other data models which help to add further colour to impact of poor broadband.

Mapping broadband

The debate about broadband evolution and availability seems to become emotive often for want of  hard facts in a digestible from. As I couldn’t find anyone who could provide me with the maps I needed I started my own mini practice alongside my usual strategy work, broadband and related information to support business and technical architecture.

Over the last while I’ve worked with Samknows, the oracle of primary broadband data; blending their data with information from other sources like the Office of National Statistics provides a fantastic insight into broadband availability and the people whose lives it touches.

Representing national data means mapping systems and databases so I’ve built up a substantial repository of data, modelled around a range of mapping tools; working with Samknows as often as I can has resulted in an unparalleled picture of broadband Britain which has unpicked any number of myths and assumptions – certainly plenty of my own.

Below are some examples from this output – contact me if you’d be interested in me helping you understand the broadband landscape in your area.

Trialling broadband speed contours This first one came from recently tinkering around with some of the more hidden tools in my tool-kit to see if there were other ways of portraying the broadband landscape. Blue areas are cooler, slower broadband speeds, rising to warmer, sunlit areas with good performance. Not sure if it quite works or not but its different at least.
Mean distance between neighbours This image uses a model I put together using ONS land-use and population data t create a proxy for the cost and effectiveness of next generation broadband investments. The model estimates the mean distance between neighbours – the further two home are apart, the more expensive they will be to install fibre-optic cables, and the less effective fibre to the cabinet is likely to be.
Enfield ADSL speeds This map of the London Borough of Enfield uses a more traditional approach to mapping broadband speeds reported at ONS Lower Super Output Areas (LSOAs); at this level some of the broadband detail is lost but it does allow the data to be linked to other public datasets like deprivation indices or transport data.
Final Third Parliament map This was just a bit of election fun really – what would parliament look like if only MP’s who represent constituencies in the “Final Third” could form the Government.
DCLG NGA model for Cumbria at 70% The Department for Communities and Local Government (DCLG) commissioned a model which attempted to predict next generation access broadband for a number of penetration levels. This is the result for Cumbria should NGA reach 70% of the UK population.

Wow!!


I’ve been dabbling in systems for a while now, looking at how broadband and social data can be combined to better understand the nature of the digital divide, and to just simply understand what the broadband landscape really looks like.

For broadband information there is only one source of reliable primary data – .

For reliable social data the ONS is pretty good although finding exactly what you want can be a bit of chore  but  with the new data.gov.uk website this is only going to get better.

But at some point you need to combine all this information on a map. The UK and Ireland are fairly unique here in having a service that produces fantastically accurate and useful maps but the down side of this is that for most applications today this level of detail is rarely necessary but and the process is mind bogglingly expensive. The impact has been that I could find perfectly adequate maps of almost anywhere in the world to model my data but for a long time really struggled to find an affordable compromise in the UK.

I then started to use Openstreetmap – an opensource/creative commons mapping project which has through leaps and bounds got better and better. With the entire world held in a database on my machine I’m able to produce perfectly reasonable maps for most of the work I do – except in the most important areas I need to understand – rural areas. Openstreetmap relies on the goodwill of its supporters to trace using GPS the areas it maps – fewer people live in rural areas so naturally less of it is mapped well.

Its felt like the OS have fought to retain their right to charge very large sums to anyone wanting access to their data, regardless of the use and need, so it was with a sense of cynicism I decided to take a peek at the ’s Opendata website launched last week – the place the OS have released some of their map information to the public.

There is only one word which really captured what I found:

WOW!!

There are geo-coded maps of various scales in glorious detail and superb quality ready to be loaded straight into my tools.

There is a variety of GIS files which provide any number of other locational resources including parliamentary constituency boundaries, councils, the lot.

And there is a file which says where each and every postcode is

So like a child in a sweetshop I delved in, downloading all the files which I’d wanted for so long but had to cobble together from secondary sources – now I had them in the original form from the most respected map makers in the world.

But then a problem – one of the files, the one I really wanted containing postcode data, didn’t download. So I dropped the OS a line expecting an automated reply in a day or two, leading to some perfunctory reply in a few days to say it was really my fault and to try again.

How wrong – within a few minutes I got an apologetic mail from Jamie, one of their developers on their help desk, asking a few sensible questions and we exchanged a few more emails before his shift finished, when I got a new thread from Dr Paul who picked the problem up until it was fixed. It seems a bit of test data in their Goliath system had refused to be flushed from a cache somewhere which given the scale of their launch is a pretty minor problem.

The launch of the data is absolutely fantastic – their support during the launch is something else!

Hats off to the Ordinance Survey – I’m off to do some mapping!



Get Adobe Flash playerPlugin by wpburn.com wordpress themes

Page optimized by WP Minify WordPress Plugin