Data Scientist

Vicorl2u 400x400 Splunk | San Francisco

Splunk’s Guild of Data Science has been tasked with helping Splunk build smarter software and make data-driven decisions. While our mission is broad, individual guild members are involved in a subset of the efforts described below. We offer the flexibility to align your involvement with one or more efforts that interest you.

Key Responsibilities:

  • You understand various types of data
  • You will closely interact with the research team as well as product management to derive requirements for how the data should be processed and what we are looking for
  • You will research, evaluate and implement statistical methods to track metrics
  • Quick prototyping and evaluation of multiple solutions using frameworks like python/Spark
  • Implementation of the solutions in a product environment
  • You will derive data insights and bring them back to the product management team
  • You will strive to find the best ways to visualize the data and the results
  • You will interact with various teams and areas throughout the product organization to collect feedback, improve the backlog, and define best practices for machine learning related matters.
  • You will work as a traveler to help teams design features and products that are data-driven and/or rely on machine learning and statistics, assure the quality of the outcome, and actively participate in knowledge transfer.
  • You will derive data insights and bring them back to the product management team.
  • You will stay current on industrial and research trends, positioning, and customer needs.
  • You will advocate for data science and machine learning through engaging with the external data science community and/or contributing to technical blogs.
  • You will contribute to Splunk’s privacy and compliance efforts.
  • You will build and maintain a data repository and documentation standards.

Required Experience/Skills &Education:

  • You have expertise in statistical analysis tools such as R, Stata, Weka, Matlab - required
  • You have experience in Spark/MLlib specifically an added plus
  • You have the ability to code in Java or Scala, or python
  • You have experience working with large datasets, preferably using tools like SQL, Hadoop, MapReduce, Pig, Hive. – preferred
  • You have experience with graph mining and graph algorithms an added plus, even more so knowledge of GraphX
  • Math, Statistics or CS background with emphasis in Machine Learning - required

What We Offer You:
  • A constant stream of new things for you to learn. We're always expanding into new areas, bringing in open source projects and contributing back, and exploring new technologies.
  • A set of exceptionally talented and dedicated peers, all the way from engineering and QA to product management and customer support.
  • A stable, collaborative and supportive work environment.
  • We don't expect people to work 12 hour days. We want you to have a successful time outside of work too. Want to work from home sometimes? No problem. We trust our colleagues to be responsible with their time and dedication, and believe that balance helps cultivate an extraordinary environment.
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status

Thank you for your interest in Splunk!

Recommended Jobs: