Job Description
● HBase, Hadoop, Hive
● Python and API development in AWS cloud; developing data pipelines
● Experience with serverless architecture and orchestration
● SQL and NoSQL databases, Document stores like MongoDB etc
● Docker, Kubernetes, Spark etc.
● Messaging systems like AWS SQS
● Python programming and analytics libraries like Pandas, NumPy, SciKit
● (Preferred) data privacy, HIPAA, PHI / PII deidentification, and role-based access
controls