Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy
Jobs / Job page
Data Engineer, Data Insights image - Rise Careers
Job details

Data Engineer, Data Insights

About PayPay

PayPay is a FinTech company that has grown to over 65M (as of August 2024) users since its launch in 2018. Our team is hugely diverse with members from over 50 different countries.

OUR VISION IS UNLIMITED_

We dare to believe that we do not need a clear vision to create a future beyond our imagination. PayPay will always stay true to our roots and realize a vision (future) that no one else can imagine by constantly taking risks and challenging ourselves. With this mindset, you will be presented with new and exciting opportunities on a daily basis and have the opportunity to grow and reach new dimensions that you could never have imagined. We are looking for people who can embrace this challenge, refresh the product at breakneck speed and promote PayPay with professionalism and passion.
※ Please note that you cannot apply or be selected in parallel with PayPay Corporation, PayPay Card Corporation and PayPay Securities Corporation.

 
 

Job Description

PayPay’s growth is driving a rapid expansion of PayPay product teams and the need for a robust Data Engineering Platform to support our growing business needs is more critical than ever. We are looking for a Data Engineer for the Data Insights department.The Data Insights department's mission is to drive product improvements by engineering systems founded on a scientific understanding of user and merchant behavior.
We are looking for talented Data Engineers to join our department and help us scale our platform across the organizations.

Main Responsibilities

  • Design, develop, and maintain scalable data ingestion pipelines using AWS Glue, Step Functions, Lambda, and Terraform
  • Optimize and manage large scale data pipelines to ensure high performance, reliability, and efficiency
  • Implement data processing workflows using Hudi, Delta Lake, Spark, and Scala
  • Maintain and enhance Lakeformation and Glue Data Catalog for effective data management and discovery
  • Design, build, and maintain infrastructure to continuously support the improvement and deployment of machine learning models
  • Collaborate with cross-functional teams to ensure seamless data flow and integration across the organization
  • Implement best practices for observability, data governance, security, and compliance

Qualifications

  • 5+ years experience as a Data Engineer or in a similar role
  • Hands-on experience with Apache Hudi, Delta Lake, Spark, and Scala
  • Experience designing, building, and operating a DataLake or Data Warehouse
  • Knowledge of Data Orchestration tools such as Airflow, Dagster, Prefect
  • Strong expertise in AWS services, including Glue, Step Functions, Lambda, and EMR
  • Familiarity with change data capture tools like Canal, Debezium, and Maxwell
  • Experience with data warehousing tools like AWS Athena, BigQuery, Databricks
  • Proficiency in Python and SQL (any variant), preferably experience in Scala and/or Java
  • Experience with data cataloging and metadata management using AWS Glue Data Catalog, Lakeformation, or Unity Catalog
  • Proficiency in Terraform for infrastructure as code (IaC)
  • Overall understanding of machine learning technologies and deep learning concepts
  • Strong problem-solving skills and ability to troubleshoot complex data issues
  • Excellent communication and collaboration skills
  • Ability to work in a fast-paced, dynamic environment and manage multiple tasks simultaneously

PayPay 5 senses


Working Conditions 

Employment Status

  • Full Time

Office Location

Work Hours

  • Super Flex Time (No Core Time)
  • In principle, 10:00am-6:45pm (actual working hours: 7h45m + 1h break)

Holidays

  • Every Sat/Sun/National holidays (In Japan)/New Year's break/Company-designated Special days

Paid leave

  • Annual leave (up to 14 days in the first year, granted proportionally according to the month of employment. Can be used from the date of hire)
  • Personal leave (5 days each year, granted proportionally according to the month of employment)
    *PayPay's own special paid leave system, which can be used to attend to illnesses, injuries, hospital visits, etc., of the employee, family members, pets, etc.

Salary

  • Annual salary paid in 12 installments (monthly)
  • Based on skills, experience, and abilities
  • Reviewed once a year
  • Special Incentive once a year *Based on company performance and individual contribution and evaluation
  • Late overtime allowance, Work from anywhere allowance (JPY100,000)

    ※Payroll payment can be changed to digital salary payment “PayPay Paycheck” for an amount set by you

Benefits

  • Social Insurance (health insurance, employee pension, employment insurance and compensation insurance)
  • 401K
  • Translation/Interpretation support
  • VISA sponsor + Relocation support
PayPay Glassdoor Company Review
3.7 Glassdoor star iconGlassdoor star iconGlassdoor star icon Glassdoor star icon Glassdoor star icon
PayPay DE&I Review
No rating Glassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star iconGlassdoor star icon
CEO of PayPay
PayPay CEO photo
Ichiro Nakayama
Approve of CEO

Average salary estimate

$100000 / YEARLY (est.)
min
max
$80000K
$120000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

What You Should Know About Data Engineer, Data Insights, PayPay

Are you ready to step into the exciting world of data? At PayPay, we're on the lookout for a talented Data Engineer to join our dynamic Data Insights team! As a part of one of the fastest-growing FinTech companies, you will play a crucial role in shaping our data infrastructure, helping us to improve our products and better understand user behavior. Imagine designing and maintaining scalable data ingestion pipelines using cutting-edge technologies such as AWS Glue, Lambda, and Terraform. You'll be optimizing large-scale data workflows with Apache Hudi, Delta Lake, and Spark—all while collaborating with diverse teams across the organization. We believe in a culture of innovation, so your ideas will greatly influence our projects and initiatives. If you have at least 5 years of experience as a Data Engineer and a strong grasp of the AWS ecosystem, this could be your dream job! But that's not all! Working here means enjoying the freedom of remote work, super flex time, and unique advantages like special paid leave to take care of personal matters. We’re all about supporting our team members in every way possible. If you're ready to embrace challenges, drive meaningful change, and explore uncharted territories in data engineering with PayPay, apply today and let's shape the future together!

Frequently Asked Questions (FAQs) for Data Engineer, Data Insights Role at PayPay
What are the main responsibilities of a Data Engineer at PayPay?

As a Data Engineer at PayPay, you'll be responsible for designing, developing, and maintaining scalable data ingestion pipelines using tools such as AWS Glue and Step Functions. You will also optimize large-scale data pipelines to ensure their performance and efficiency, implement data processing workflows using technologies like Hudi and Spark, and collaborate with cross-functional teams to ensure comprehensive data integration across the organization.

Join Rise to see the full answer
What qualifications are necessary for the Data Engineer position at PayPay?

Candidates for the Data Engineer role at PayPay should have a minimum of 5 years of experience in data engineering or a related field. Proficiency in technologies such as Apache Hudi, Delta Lake, and Spark is crucial. Additionally, a deep understanding of AWS services, including Lambda and Glue, along with experience in Data Orchestration tools is expected. Familiarity with Python, SQL, and infrastructure management using Terraform will also set you apart.

Join Rise to see the full answer
What tools and technologies do Data Engineers use at PayPay?

At PayPay, Data Engineers utilize a range of advanced tools and technologies. You'll work extensively with AWS services like Glue, Lambda, and EMR for data processing. Additionally, skills in using Apache Hudi, Delta Lake, Spark, as well as knowledge of tools such as Airflow and data warehousing technologies like AWS Athena and BigQuery will be integral to your role.

Join Rise to see the full answer
Is the Data Engineer position at PayPay remote?

Yes, the Data Engineer position at PayPay is completely remote. We embrace a Work From Anywhere policy, allowing team members to work flexibly from any location in Japan. This setup is designed to promote work-life balance and accommodate individual needs.

Join Rise to see the full answer
What is the working culture like for Data Engineers at PayPay?

The working culture for Data Engineers at PayPay is dynamic and innovation-driven. We emphasize collaboration and creativity, encouraging our team members to challenge themselves and push boundaries. With our Super Flex Time policy, there is flexibility in work hours, allowing you to create a schedule that works best for you while pursuing exciting data projects.

Join Rise to see the full answer
Common Interview Questions for Data Engineer, Data Insights
Can you describe your experience with AWS services relevant to data engineering?

When answering this question, detail the specific AWS services you've used, like Glue and Lambda. Share how you've implemented these services in your projects, and emphasize your understanding of scalability and performance optimization in data pipelines.

Join Rise to see the full answer
How do you approach optimizing data ingestion pipelines?

Discuss your methodology for identifying bottlenecks in data pipelines. Include specific tools and techniques you've used to enhance performance and reliability, such as employing best practices in data processing and regularly monitoring pipeline efficiency.

Join Rise to see the full answer
What is your experience with Apache Hudi and Delta Lake?

Provide an overview of your hands-on experience with both technologies. Highlight projects where you used Hudi or Delta Lake to manage data efficiently, focusing on features like support for streaming data and data versioning.

Join Rise to see the full answer
How do you ensure data quality and integrity in your processes?

Share your strategies for data validation and error handling. Discuss how you implement testing and monitoring throughout the data pipeline, ensuring that the data processed adheres to quality standards.

Join Rise to see the full answer
Describe a challenging data project you have completed.

Use the STAR method (Situation, Task, Action, Result) to describe a specific project. Focus on the challenges you faced, your approach to solving them, and the eventual success of the project, including what was learned.

Join Rise to see the full answer
What role do you think Data Engineers play in data-driven decision-making?

Explain how Data Engineers are critical in providing clean, reliable data for analysis and reporting. Discuss your view on collaborating with data scientists and analysts to ensure data pipelines support business needs effectively.

Join Rise to see the full answer
Can you explain your familiarity with data cataloging tools?

Detail your experience with data cataloging tools like AWS Glue Data Catalog or Lakeformation. Emphasize the importance of metadata management and how you've worked to improve data discoverability in previous roles.

Join Rise to see the full answer
How do you prioritize and manage multiple data projects simultaneously?

Talk about your time management skills and any tools or methodologies you apply for project tracking. Highlight your ability to assess project urgency and importance, allowing you to allocate your time efficiently.

Join Rise to see the full answer
What best practices do you follow for data governance?

Discuss your understanding of data governance principles, including compliance and security. Mention specific practices like auditing, monitoring access, and educating team members on data stewardship.

Join Rise to see the full answer
How do you approach collaborating with cross-functional teams?

Explain your approach to cross-team collaboration, noting the importance of communication and mutual respect. Share how you've built relationships with other teams to facilitate information sharing and streamline processes.

Join Rise to see the full answer
Similar Jobs
Photo of the Rise User
Posted 11 days ago
Photo of the Rise User
Posted 5 days ago
Photo of the Rise User
Mobile Programming LLC Remote California St, San Francisco, CA, USA
Posted 4 days ago
Posted 11 days ago
Photo of the Rise User
Performance Bonus
Photo of the Rise User
Posted 10 days ago
Photo of the Rise User
Posted yesterday
Photo of the Rise User
City of New York Remote Long Island City, NY
Posted 11 days ago

PayPay is a fintech company consisting of diverse members from more than 35 different countries, PayPay has acquired more than 35M users (as of Jan, 2021) in around 2 years since its service launch in 2018. Although the number of employees has al...

62 jobs
MATCH
VIEW MATCH
FUNDING
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
SALARY RANGE
$80,000/yr - $120,000/yr
EMPLOYMENT TYPE
Full-time, remote
DATE POSTED
January 27, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!