Data Engineer (NLP Specialist), Global Data

Careers at Bloomberg

Back to Search

Princeton, NJ

Posted May 8, 2019 - Requisition No. 74664

What we do:

At Bloomberg, our primary product is our data. Our Global Data department combines technology with deep product expertise to bring unequaled value to the world’s information. We bring structure, clarity and meaning to the data we collect by building robust data processes with innovative technology that allow us to offer unique data products that drive our client’s biggest financial decisions.

Our NLP Automation Engineers are at the forefront of this process; creating cutting-edge systems to manage the relentless flow of information; we support hundreds of data analysts, data scientists and product experts around the world to bring transparency and meaning to data that makes Bloomberg the world’s premier provider of information.

This is not your typical Data Engineering position.

Our department is a dynamic, creative environment, with a focus on up-ending the status quo and leveraging the technology and resources we have available in new, innovative ways. You’ll feel more like you’re in a well-funded startup than in a global enterprise with over 170 locations, but our clients rely on the information we extract and enrich for making their most critical financial decisions and they trust our systems to get it right. Every time.

Who you are:

You thrive in environments where the data is unstructured and have worked with data sources where it’s a challenge to separate the signal from the noise. You have experience applying standard techniques for information extraction & retrieval, topic classification, and entity disambiguation, ideally in several languages and across disparate natural language sources. You can apply the basic NLP techniques to messy content in your sleep and are able to evaluate the tradeoffs between methodologies in your toolkit when facing real-life problems. You’ll be expected to help execute on a strategy to identify opportunities for NLP in existing workflows; to automate, improve and expand our product offering. You’re especially experienced in designing annotation workflows to gather the highest quality information from crowd and expert annotators, squeeze this data to its fullest extent. As a leader, you’ll bring together data annotations, algorithm selection, state-of-the-art technology for training and deployment and robust evaluation metrics to lead the way in building our next generation of NLP-powered data solutions.

What you need to have (give or take a few):

  • A BA/BS degree or higher in Computer Science, Mathematics, Computational Linguistics or relevant data technology field, and 2-5 years of professional work experience in software development, Data Engineering, data science or a related field.
  • Experience developing NLP-powered solutions to real world problems, ideally using Python; NLTK, Spacy, Python ML ecosystem; Tensorflow, PySpark, PyTorch. Additional knowledge of Go, C++ and JavaScript would be great.
  • Standard dev skills: Linux, virtualized/cloud environment, git, scripting, automated testing.
  • An awesome attitude and a collaborative working style.
  • Strong verbal and written communication skills, especially when explaining technical solutions to stakeholders and management. Experience with data visualization and presentation tools; you can tell a compelling story about the data you work with.

Does this sound like you? Does this almost sound like you?

Hit apply and we’ll be in touch!

People you’ll work with:

https://careers.bloomberg.com/job/detail/74662

https://careers.bloomberg.com/job/detail/74663

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Similar jobs