Skip to content

πŸ—ΊοΈ SOC Mapper

The SOC mapper relies on the SOC coding index released by the ONS. This dataset contains over 30,000 job titles with the SOC code.

The SOCMapper class in soc_map.py maps job title(s) to SOC(s).

πŸ”¨ Core functionality

from nlp_link.soc_mapper.soc_map import SOCMapper

soc_mapper = SOCMapper()
soc_mapper.load()
job_titles=["data scientist", "Assistant nurse", "Senior financial consultant - London"]

soc_mapper.get_soc(job_titles, return_soc_name=True)

Which will output

[((('2433/04', 'Statistical data scientists'), ('2433', 'Actuaries, economists and statisticians'), '2425'), 'Data scientist'), ((('6131/99', 'Nursing auxiliaries and assistants n.e.c.'), ('6131', 'Nursing auxiliaries and assistants'), '6141'), 'Assistant nurse'), ((('2422/02', 'Financial advisers and planners'), ('2422', 'Finance and investment analysts and advisers'), '3534'), 'Financial consultant')]

πŸ“– Read more

Read more about the methods and evaluation of the SOCMapper here.