Data Engineer - Python Development

  • Company: Capital One
  • Posted: May 07, 2016
  • Reference ID: R2717
Plano 6 (31066), United States of America, Plano, Texas

Data Engineer - Python Development

Capital One is a technology company, a research laboratory, and a nationally recognized brand with over 65 million customers. We offer a broad spectrum of financial products and services to consumers, small businesses and commercial clients - and data is at the center of everything we do. In December 2015, Capital One was named a Blue Ribbon Company by Fortune Magazine as one of only 25 companies in the world to make their top company lists four times in 2015 (Fortune’s 100 Best Companies to Work For, Global 500, Fortune 500, World’s Most Admired Companies). Come learn more about the great opportunities we have to offer!

 We are looking for driven individuals to join our team of passionate data engineers in creating Capital One’s next generation of data products and capabilities.
- You will build data pipeline frameworks to automate high-volume and real-time data delivery for our Hadoop and data platforms 
- You will build data APIs and data delivery services that support critical operational and analytical applications for our internal business operations, customers and partners
- You will work with Data Scientists and Data Analysts to transform complex analytical models into scalable, production-ready solutions
- You will continuously integrate and ship code into our on premise and cloud Production environments
- You will develop applications from ground up using a modern technology stack such as Python, Spark, Postgres, Angular and NoSQL
- You will work directly with Product Owners and customers to deliver data products in a collaborative and agile environment


- Lead and develop sustainable data driven solutions with current new gen data technologies to meet the needs of our organization and business customers
- Master new technologies rapidly as needed to progress varied initiatives
- Break down complex data issues and resolve them
- Build robust systems with an eye on the long term maintenance and support of the application
- Leverage reusable code modules to solve problems across the team and organization
- Drive cross team design and influencing / development via technical leadership / mentoring
- Influence cross team/matrix organization
- Provide technical guidance to team members

Basic Qualifications: 

- Bachelor’s Degree or military experience
- At least 3 years experience developing Python based software solutions
- At least 3 years in coding in data management, data warehousing or unstructured data environments
- At least 3 years experience with big data technologies (Hadoop, Cassandra, Accumulo, HBase, Spark, YARN, Zookeeper)

Preferred Qualifications: 

- Master's Degree
- At least 3 years with Agile engineering practices
- At least 3 years experience developing software solutions to solve complex business problems
- At least 3 years in-depth experience with the Hadoop stack 
- Familiarity with data science tools and concepts
- 3+ years experience with UNIX/Linux including basic commands and shell scripting
- At least 3 years experience developing Java based software solutions
- At least years experience with NoSQL implementation (Mongo, Cassandra, etc. a plus)
- At least years experience with Relational Database Systems and SQL
- At least years experience designing, developing, and implementing ETL

Share this Job