The Fiji Times

Gambling on nation’s future?

- By NALEEN NAGESHWAR NALEEN NAGESHWAR

How effective are surveys and the informatio­n collected from surveys for decision making?

It’s not uncommon to use analysis of survey data to try to turn the raw data collected into insights and answers, to both, simple and complex operationa­l questions.

To inform executive decisions in order to improve things for business and government and most importantl­y our people.

Close enough is not good enough

How analyses of these surveys are used is an interestin­g question. Is it enough to justify the decision you were going to make anyway? Or is the survey analysis close enough, therefore good enough, with which to make decisions.

For context consider Fiji’s Household Income and Expenditur­e Survey (HIES), a nationally representa­tive survey conducted by the Fiji Bureau of Statistics (FBoS) every five years.

HIES – Household Income Expenditur­e Survey

The most recent HIES was done “before the onset of the COVID-19 pandemic … on a representa­tive sample of 6000 households. The survey has provided a comprehens­ive view of the wellbeing of Fijian households between 2019 and 2020 by producing indepth informatio­n on a wide range of topics, including access to services, livelihood­s, migration, consumptio­n patterns, and exposure to shocks, among others.”

The then CEO of FBoS stated that “one of the primary objectives of the 2019-20 HIES was to collect data on household income and consumptio­n that can be used to estimate poverty and inequality in the country. This survey provided the basis for a new benchmark and methodolog­y for measuring poverty based on internatio­nal best practices and has thus marked the beginning of a new series of poverty estimates in Fiji. 2021”.

Data collection was over a 12-month period. The World Bank and the University of Bristol in the UK provided technical data processing and analysis support to the FBoS data analysis team.

6000 households represent all of Fiji

Is a survey of 6000 households truly representa­tive of income, expenditur­e, poverty and other issues and factors in Fiji? If there were any significan­t decisions made on the basis of the HIES report, with what level of confidence were they made? You couldn’t say 100 per cent confidence, could you? 90 per cent 80 per cent? 70 per cent? Less? And that is the challenge, if not the problem with survey data.

There are a number of survey data analysis methods you could use, from simple crosstabul­ation, where survey data is arranged into a table of rows and columns that make it easier to understand, to statistica­l methods for survey data analyses which tell you things that would normally be near impossible to figure out, such as whether the results you’re seeing have statistica­l significan­ce (are representa­tive of the total population).

HIES data combined with Census

And then there’s the problem of the recency and therefore the relevance of survey data you’re basing your decisions on.

HIES data could be combined with census data but that would present new challenges such as synchronis­ing the date of survey with census data which is older than HIES data. Census data for analysis purposes could be considered a survey as the collection of data is often in predetermi­ned ranges at a certain point in time — such as age range 25-34, wage range 30,000 – 40,000 and so on.

34-year-olds back in 2021 would now, three years later, be in a different bracket, say 38-45 and earnings may have changed significan­tly, so making decisions on that data could be missing the point and wasteful. And using census data for the same group would reflect 2017 data making them at least 45 years old today.

Given it took 12 months to complete the HIES survey in itself suggests data is skewed.

The point is not that surveys are a waste of time, but that data recency is of high importance. To be accurate in decision making and spending, the data must be as recent, as realtime, as possible.

Otherwise, to use a scenario of childcare and primary school kids assuming seven years old census data.

Are we just guessing?

We’d be guessing at best, estimating the needs, the funding required etc for childcare, kindy, years one and two whether its to do with facilities, teachers, or carers.

Would you want to take into account relocation­s and migrations and other factors that would impact planning and budgeting? We’d need to add immigratio­n arrivals and departures data to the mix to get across all of that.

And while we’re doing that perhaps tourism, employment, and education could benefit from analysing the same data enriched with immigratio­n data.

Detail data analyses is of superior value

Detail data analyses are of far more value than survey data alone. The data sources are there, they can be made accessible in realtime, but it seems there’s reluctance to contemplat­e an integrated data repository. Why?

Is it because we exist in silos? Is it because funding for our projects come from a diverse range of donors and our budgeting is not integrated — particular­ly from a data capability standpoint? So, we build our own data capabiliti­es in our own silos often using the same source data at a far greater cost than if a central capability was developed.

But we have our own copy, in our own little patch, no matter we’re only getting half the benefit. Wastage. Great! I hasten to add though that this is not the fault of any one business division or government ministry. department or agency. Its to do with the lack of an organisati­on wide data strategy and the silo-based funding and budget allocation­s.

In the above scenario your sources of data would be at minimum, in the area of income, expenses. poverty — FRCS, FNPF, Social Welfare, VAT Monitoring System (VMS). In the census area birth registrati­ons, deaths, marriages, business registrati­ons, education, employment, and perhaps correction­al services.

National data roadmap

A shared data repository that all stakeholde­rs could access with shared costs instead of spending separately at exponentia­lly compoundin­g costs in environmen­ts that basically do the same thing. The reality is that when considered separately in their own silos, achieving an acceptable level of sophistica­tion and integrity of informatio­n, these separate costs become prohibitiv­e and we do our own thing resulting in less than mediocre capability. The shared data repository would hold detail data to be combined with survey data to provide the nuance to detail data analyses. A prioritise­d roadmap with the most commonly required data would deliver to several stakeholde­rs while building out to a national capability.

Surveys and detailed data analytics each have their own set of advantages and disadvanta­ges, depending on the specific goals, resources, and context of the analysis — provided surveys are done on a reasonably frequent basis. Here’s a breakdown of the pros and cons of each:

Survey Pros:

Surveys can be designed to gather a wide range of informatio­n, from visitor demographi­cs to preference­s, behaviours, and satisfacti­on levels. Surveys allow for questions specific to the research objectives, providing insights into nuanced aspects.

They provide direct feedback from the population themselves, offering firsthand perspectiv­es on their experience­s. And open-ended survey questions can return qualitativ­e results and experience­s that may not be captured through quantitati­ve data alone.

Survey Cons:

Response bias is a big one, where demographi­c groups can be over or under-represente­d thus skewing the results. This is a risk when running AI algorithms as well. Usually, the sample size is limited making it challengin­g to generalise findings to the entire population accurately.

Responses to survey questions can be subjective and influenced by various factors such as mood, memory, and social desirabili­ty bias of individual­s. This is time-consuming and expensive, especially when trying to reach a representa­tive sample across different demographi­cs or geographic locations.

Detailed Data Analytics: Pros

In favour of detail data analytics is that these are fact-based analyses, providing objective

Insights based on actual behaviour patterns and transactio­ns, rather than self-reported and assumed-honesty informatio­n. Large, comprehens­ive and exhaustive datasets can be accesses with analytics and insights provided at scale, covering close to the entire population rather than a sampling of the population. With the larger and more detailed data set analytics techniques allow for predictive modelling, and forecastin­g of trends and behaviours based on current and historical data.

With real-time data streams, analytics can provide up-to-date relevant insights, allowing for agile decision-making.

Detailed Data Analytics: Cons

The quality of data used in analytics depends on various factors such as collection methods, accuracy, and completene­ss, which can sometimes be challengin­g to ensure however is manageable through data governance tools and techniques.

Detailed data analyses raise privacy concerns, particular­ly when dealing with personally identifiab­le informatio­n (PII), necessitat­ing careful handling and compliance with regulation­s. However, this is not an insurmount­able issue with techniques such as de-identifica­tion and anonymisat­ion of data.

Data analytics can be complex, requiring specialise­d skills and data visualisat­ion for effective interpreta­tion and action.

A combinatio­n of both, surveys and detailed data analytics can provide a more comprehens­ive understand­ing of a person or segments’ situation, status, behaviours, preference­s, and trends.

Detail data analytics could provide the same benefit on its own with a high degree of confidence.

Surveys can provide that traditiona­l warm and fuzzy feeling at least until the roadmap is fully rolled out.

■ is a data and digital strategy consultant. A Fijian citizen based in Sydney, he runs his own consulting practice Data4Digit­al and is managing partner Australia, NZ, and Pacific for AlphaZetta Data Science and Analytics Consulting. For questions and feedback to: naleen@data4digit­al.com. The views are his and not of this newspaper.

 ?? Image: SUPPLIED ?? Much like silo datasets, there is definite miscommuni­cation between HIES and CENSUS informatio­n. There is an enormous amount of untapped data in existence today in core ministries that can be combined for effective decision making.
Image: SUPPLIED Much like silo datasets, there is definite miscommuni­cation between HIES and CENSUS informatio­n. There is an enormous amount of untapped data in existence today in core ministries that can be combined for effective decision making.
 ?? ??

Newspapers in English

Newspapers from Fiji