Tomota: I joined Daikin in July 2023 as a mid-career hire. In my previous job at a chemical manufacturer, I worked on the development of automotive lithium-ion batteries. At that time, I was seconded to Germany where I worked on a large-scale project related to data science. This experience stirred an interest inside me for not only data-driven development, but also for committing myself to this field, which led me to Daikin. At TIC, I now lead the data infrastructure team and am responsible for building databases and developing the surrounding data infrastructure.
NEWS
MI Led by First-Term Graduate of Daikin Information and Communication Technology University with a Mid-Career Data Scientist Constructing the Data Infrastructure
Yoshizaki: During my student years, I majored in inorganic chemistry and researched catalysts for automotive exhaust gas treatment. I've been with Daikin for seven years now. Right after joining the company, I became part of the first class at Daikin Information and Communication Technology University, where I studied AI for two years before being assigned to TIC. I also participated in a three-year collaborative program with the University of Tokyo, where I researched the application of materials informatics (MI) in the development of new materials. Currently, I am currently using those results of that research with a main focus on product development using MI.
Selection of 30 Molecular Structures from 500,000 Candidates and Automated Experiments
As briefly mentioned in the first part of this interview, organic polymer materials (polymers) have complex structures, and new developments in this area have been slow due to a lack of integration between input and output data for design. Even with the same composition, changes in polymerization and sample preparation conditions can alter the structure, causing fluctuations in material properties. Additionally, compared to other materials, the simulation accuracy for polymers is lower, presenting another challenge.
To address this, we utilized MI for efficiently exploring material candidates by developing technology to generate high-quality, large-scale data and to predict material properties with high accuracy. To achieve this, we focused on developing technologies that combines automated molecular simulations with automated robotic experiments. This effort targeted polymer and composite materials, specifically focusing on substrate materials for high-frequency applications.
First, we conducted molecular simulations on a computer, input molecular structures, and automatically calculated the targeted properties. We also built a machine learning model that predicts material properties based on molecular structures. From there, the model explores and proposes promising molecular structure candidates that could achieve the targeted properties, such as low dielectric constant, low dielectric loss tangent, and low thermal expansion coefficient, to facilitate the development of new materials.
Specifically, we used machine learning to narrow down 500,000 polymer structures to 1,000 candidates, and through multi-stage screening using proxy indicators, we eventually identified 30 promising molecular structures. Traditional methods would have required an enormous amount of time to simulate and select these materials, but this approach efficiently narrowed down the candidates in about three months.
We also developed a prototype for automated experiments using robotics in collaboration with a lab at the University of Tokyo. This system automates the entire process, from sample preparation to property measurement (such as dielectric constant), which was previously done manually. By integrating a conveyor belt for sample transfer and coordinating robotic arms with various devices, we can measure material properties with human-level accuracy and high reproducibility, while also accumulating the corresponding experimental data.
The new materials identified in this project are promising candidates for use in the upcoming 5G and 6G communication technologies.
In-House Data Infrastructure Uses Private Cloud to Automate Collection and Processing of Experimental Data
Tomota: So, the question is why a data infrastructure is necessary. No matter how advanced MI technology becomes, the importance of chemical experiments in the field remains unchanged. In recent years, there has been an increase in robot-assisted experiments, and managing data individually in formats like Excel or CSV format makes data-driven development difficult. To handle the growing amount of data, a powerful and flexible data infrastructure is essential.
Unfortunately, creating this data infrastructure in-house requires entirely different technologies from chemistry. This data infrastructure is specifically designed for material development and was built on a private cloud (AWS) that includes elements like databases, data pipelines, and data marts. Instead of processing experimental data locally on Excel, we integrated it with visualization and analysis tools, making it easy to output results and share them with the research team for accumulation and utilization.
When it came to system development, we chose to create it internally since outsourcing would likely result in rework, and the system would become a black box. Fortunately, Daikin's Technology and Innovation Center (TIC) has a dedicated team specializing in data utilization and information technology. With a cloud environment in place for experimenting with new technologies and frequent sharing of best practices, TIC's structure and on-site technology experts proved to be a tremendous help. Within six months, chemical and IT engineers collaborated from a unified perspective to develop a data platform tailored for the niche field of chemical R&D.
— Could you tell us about the data pipeline—how it works and how it uses the cloud to efficiently and systematically collect and analyze experimental data?
A key developmental focus of these platforms was simplifying maintenance and operations. If the system is too labor-intensive, it becomes unsustainable. Similarly, high maintenance costs could jeopardize the project due to business constraints. Thus, reducing operational workload and costs was one of the essential requirements. By leveraging AWS, managing architecture through Infrastructure as Code (IaC), and automating workflows, both operational complexity and costs were reduced to less than one-tenth compared to commercial software.
Currently, this data platform is applied to a limited set of development themes. However, in the future, we plan to apply it to other themes and gradually expand it as needed. By providing a platform optimized for the needs of chemical R&D, we aim to support efficient and effective data-driven development.
Researchers aspiring to excel in MI can try polishing their skills at Daikin
Tomota: At TIC, we are surrounded not only by young talent but also by charismatic and fascinating senior engineers, such as Chief Executive Engineer Hido (※2), a leading figure in AI, and Chief Executive Engineer Shigemoto, who is currently working with us. Their influence and inspiration have a positive impact on younger engineers. Daikin is truly a company full of originality and excitement. If you're interested in new R&D challenges and eager to innovate, we would love for you to join us.
※The information and profiles are based on the time of the interview.
※2. [Part 2] Behind the scenes of the groundbreaking project that won the "5th Japan Open Innovation Award" ~ Multi-skilling on-site workers using THINKLET® ~
Technology and Innovation Center
Joined in April 2018. Originally from Kumamoto Prefecture. Responsible for digital materials design.
"To advance digital materials design and create innovative materials that contribute to solving societal challenges."
Technology and Innovation Center
Joined in July 2023. Originally from Nara Prefecture. Responsible for chemical R&D data platform technology.
"To reform the working styles in chemical R&D and achieve cutting-edge data-driven development in the industry."



