{"id":43086,"date":"2023-10-20T15:25:58","date_gmt":"2023-10-20T15:25:58","guid":{"rendered":"http:\/\/startupsmart.test\/2023\/10\/20\/seven-applications-for-government-data-that-are-actually-useful-startupsmart\/"},"modified":"2023-10-20T15:25:58","modified_gmt":"2023-10-20T15:25:58","slug":"seven-applications-for-government-data-that-are-actually-useful-startupsmart","status":"publish","type":"post","link":"https:\/\/www.startupsmart.com.au\/uncategorized\/seven-applications-for-government-data-that-are-actually-useful-startupsmart\/","title":{"rendered":"Seven applications for government data that are actually useful – StartupSmart"},"content":{"rendered":"
\"\"<\/div>\n

Plenty of government agencies are keen to use data to improve their own efficiency or better the services they provide the public, but simply lack the experience to know which questions to ask.<\/p>\n

That\u2019s the call from one of the world\u2019s leading experts on data in government, who says agencies need to get data smart staff and start asking questions.<\/p>\n

<\/span>\u201cEven if they have the people, they don\u2019t quite know what can be done,\u201d says Rayid Ghani, director of the University of Chicago\u2019s Center for Data Science and Public Policy.<\/p>\n

It doesn\u2019t help that there tend not to be enough staff to lend time to pursuing an undefined goal, either.<\/p>\n

Part of the problem is that while there are always examples of using data to improve business processes coming out of large companies like Google, the work of government is often too different or specialised to fit with private sector use cases.<\/p>\n

\u201cIf you\u2019re a retailer and you\u2019re doing pricing, you go buy a pricing tool and install it,\u201d the former Obama 2012 campaign chief scientist mused at a Monash University event earlier this week.<\/p>\n

\u201cBut if you\u2019re a health agency or human services agency and you want to predict who\u2019s at risk of something, you start with something vanilla.<\/p>\n

“People will sell you lots of tools, but none of them do that.<\/p>\n

“They do something else, and then you have to pay them to do customisation and integration and all of that, whereas the private sector already has these tools for very customised problems.\u201d<\/p>\n

But it can be extremely worthwhile if you figure out how to use data analysis well. It can provide useful answers for some highly vexed questions, but also some surprising ones.<\/p>\n

One example Ghani offered was a risk prediction system for police brutality in the United States.<\/p>\n

In the system Ghani reviewed, the criteria for which officers were most likely to offend were arbitrary, having been drawn up by people sitting around a table suggesting what they thought would be the key indicators.<\/p>\n

This rendered it basically useless as a predictive tool. Not only did it flag 50% of all officers as being at-risk, making it impossible to target interventions at the right people, but some of those who would later offend weren\u2019t even flagged \u2014 only pre-identifying 70% of problem individuals.<\/p>\n

Looking at historical data allowed for a far more accurate picture of who was likely to offend.<\/p>\n

It also threw up some unexpected results, suggesting the number of attendances at suicide or family violence-related incidents in the previous two weeks was one predictor for brutality \u2014 underlining that the solution needed to not just be about getting rid of risky people, but managing the mental health of all staff better.<\/p>\n

The seven most common applications for data<\/h3>\n

Ghani, who often works with government agencies to come up with data solutions, explained that most problems fit into one of seven types.<\/p>\n

The first four are the top operational problems he sees, while the others focus on longer term issues.<\/p>\n

1. Prevention and early warning systems<\/h4>\n

First, prevention and early warning systems. Crunching a large amount of data \u2014 some of which does not even need to be collected, but is sitting in a siloed database in a different branch or agency \u2014 can help predict where problems are likely to arise with much greater accuracy than existing systems.<\/p>\n

Ghani gave the example of lead-based paint in houses in the United States.<\/p>\n

As the US Environmental Protection Agency notes, \u201cif your home was built before 1978, there is a good chance it has lead-based paint.\u201d<\/p>\n

Young children are particularly at risk of harm from peeling lead-based paint, thanks to their habit of picking up things off the floor \u2014 which may be covered in lead-contaminated dust \u2014 and putting them in their mouths.<\/p>\n

But previously the government had no predictive system in place, relying on the clearly problematic risk indicator of children developing lead poisoning, which causes irreversible damage, to know where the problem lay.<\/p>\n

A predictive approach was needed, but it would not be practical to test every old house in the country for lead contamination.<\/p>\n

Instead, it turned out there was already a large amount of useful compliance data that had not been tapped for prevention, so by putting that pre-existing data to work, the US government was able to pinpoint which houses were most at risk of having lead paint, and is now able to work with householders most at risk of exposure to deal with the problem.<\/p>\n

Of course, as Ghani pointed out, the data itself does not solve<\/em> the problem, but is one effective tool that contributes to the solution \u2014 the issue now is convincing parents something as seemingly mundane as paint could be a serious risk to their child\u2019s health.<\/p>\n

2. Prioritisation in compliance and inspections<\/h4>\n

The second common use for data is prioritisation in compliance and inspections \u2014 the problem of \u201cI\u2019ve got this many things I need to inspect, I can only inspect this many, how do I prioritise that?\u201d Ghani explained.<\/p>\n

In work he did with the EPA Ghani discovered that many of the criteria for deciding where inspections should happen were arbitrary, meaning inspectors were not targeting those likely to be non-compliant.<\/p>\n

Again, data analysis can help reveal who is most likely to be breaking the rules, allowing the agency\u2019s finite resources to be spent where they will get results \u2014 but also meaning companies doing the right thing aren\u2019t being bothered by government.<\/p>\n

3. Scheduling service delivery<\/h4>\n

Third is scheduling of service delivery \u2014 \u201cambulances, medics, any type of thing that\u2019s figuring out who do I send, where do I send them, how do I move them around,\u201d he says. When emergency services respond, for example, they need to know whether one police car needs to be sent, or two. Or three fire trucks.<\/p>\n

If you don\u2019t dispatch enough, precious time is wasted correcting the error; dispatch too many and someone else who needs them might miss out.<\/p>\n

Being able to use data to inform those decisions can help the system run more efficiently \u2014 not only are services better but you might not need to buy as many new police cars.<\/p>\n

He gave the example of how a non-government organisation in Kenya that provides public toilets had reached its service capacity.<\/p>\n

It was sending teams to empty every toilet every day \u2014 there was no public sewage system \u2014 but could not afford to hire anyone else so could not build more.<\/p>\n

Data on how each toilet was used allowed the NGO to only attend each one as often as was necessary, boosting their capacity two and a half times \u2014 a huge return from a relatively small system tweak. City of Melbourne recently started doing something similar with so-called smart bins.<\/p>\n

4. Routing information<\/h4>\n

Fourth is routing information within the organisation \u2014 \u201cyou\u2019ve got requests coming in and you need to figure out which department should they go to,\u201d Ghani explained. \u201cIt\u2019s a pretty mundane, boring task and right now most often humans do that.\u201d<\/p>\n

Data can help software figure out where to send files; automation means lower costs.<\/p>\n

5. Intervention<\/h4>\n

Fifth is using data to figure out which intervention is most worth doing to get the desired impact. Ghani has recently been working with the Mexican government, for example, around maternal mortality.<\/p>\n

\u201cThey tried to reduce it but they just weren\u2019t sure why it wasn\u2019t going down. So they wanted more data to figure out of the 3000 policies they could possibly modify, which ones should we prioritise?<\/p>\n

“Which five or six should we narrow it down to that we then explore and decide which one is the policy to do?\u201d<\/p>\n

6. Evaluations<\/h4>\n

Sixth is conducting evaluations that can then be used to optimise policy, \u201cwhere you\u2019re really looking at historical data to see who did the policy work on, which people did it help, which people did it hurt.\u201d Often evaluation is used to justify funding, but Ghani says historical data can be used more often to figure out how to better target citizens in future.<\/p>\n

7. Structuring data<\/h4>\n

The final use, Ghani notes, is working out how to turn unstructured data into structured data. This means processing information held in audio, video or text to enter into a database which can then be used for other purposes.<\/p>\n

Commit to genuine collaboration<\/h3>\n

Education and training need to improve if governments are to properly harness the possibilities of data, Ghani says.<\/p>\n

\u201cUniversities generally do a pretty bad job at training people how to do useful things with data science in general, but especially with problems governments are facing.<\/p>\n

“Most students, you ask them what are the top five problems that a health agency faces? You\u2019re going to get a blank look,\u201d he suggests.<\/p>\n

Even hackathons tend to be of limited value, tending to result in a map, often telling agencies information they already know.<\/p>\n

Too many public sector bosses don\u2019t know which questions to ask, how to build a data team or even consume the results they\u2019re given, he adds.<\/p>\n

There is a lack of use cases governments can adapt.<\/p>\n

And the incentives make it worse \u2014 doing data analysis can have a significant positive impact, but the perceived risks and lack of criticality can mean momentum never builds.<\/p>\n

Collaboration between the public sector and universities is a great way to build experience and produce useful results.<\/p>\n

\u201cBut our constraint is these projects have to be real projects, they can\u2019t be made up,\u201d Ghani notes.<\/p>\n

It needs to be worth the effort, which will be undermined by secrecy, control or unwillingness to use the results.<\/p>\n

\u201cIt needs to have real data that you have. It can\u2019t be you download some data from somewhere on the web and play around with it. And it needs to be a real agency, an organisation that\u2019s willing to implement and validate and actually do something with this.\u201d<\/p>\n

The other thing standing in the way of governments using data science more is that the tools developed often remain on a (digital) shelf gathering dust.<\/p>\n

\u201cThey often stay siloed somewhere in whoever did the work and nobody else finds out about them. So the next person has to start from scratch,\u201d he says.<\/p>\n

\u201cSo given that government is about helping people, what about open sourcing these things, or \u2026 you could create reusable software or data. Those are the type of things we\u2019ve been focused on.\u201d<\/p>\n

This article was originally published on The Mandarin. <\/em><\/p>\n

Follow StartupSmart on<\/em> Facebook,<\/em> Twitter, LinkedIn and iTunes. <\/em><\/p>\n","protected":false},"excerpt":{"rendered":"

Plenty of government agencies are keen to use data to improve their own efficiency or better the services they provide the public,<\/p>\n","protected":false},"author":1,"featured_media":59977,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[19,18,1],"tags":[],"_links":{"self":[{"href":"https:\/\/www.startupsmart.com.au\/wp-json\/wp\/v2\/posts\/43086"}],"collection":[{"href":"https:\/\/www.startupsmart.com.au\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.startupsmart.com.au\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.startupsmart.com.au\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.startupsmart.com.au\/wp-json\/wp\/v2\/comments?post=43086"}],"version-history":[{"count":0,"href":"https:\/\/www.startupsmart.com.au\/wp-json\/wp\/v2\/posts\/43086\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.startupsmart.com.au\/wp-json\/wp\/v2\/media\/59977"}],"wp:attachment":[{"href":"https:\/\/www.startupsmart.com.au\/wp-json\/wp\/v2\/media?parent=43086"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.startupsmart.com.au\/wp-json\/wp\/v2\/categories?post=43086"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.startupsmart.com.au\/wp-json\/wp\/v2\/tags?post=43086"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}