ChatGPT on Software Engineering

Recently, I’ve been working on a new course offering in Enthought Academy titled Software Engineering for Scientists and Engineers course. I’ve focused on distilling the software engineering best practices that we use at Enthought with our clients, with the twist of “what parts are most useful for a scientist who writes software for R&D?” After all, we don’t want to simply replicate an academic 12-week course in software engineering; instead, we want to teach the subset of ideas that will make the most impact on scientific programming projects in the least amount of time. Unfortunately, I lost track of time and ran into my deadline for this blog. Panic time? Not this year.

This year, I have a trusty assistant who can draft an article for me!

Enter ChatGPT

Since software engineering was on my mind, I asked my new assistant to do the following:

Write a 1000 word article on software engineering with the following sections:  introduction, usability, maintainability, reliability, and scalability; conclude with a discussion on how all of these aspects of software engineering layer together.

My assistant gamely went off and, about 10 seconds later, live-typed to me:

Introduction:

Software engineering is a field that deals with the design, development, testing, deployment, and maintenance of software. It is a crucial aspect of modern technology, as software is a fundamental part of almost every aspect of our daily lives. From the operating system on our phones to the software that controls the traffic lights, software engineering is a critical aspect of building reliable and efficient software.

Usability:

Usability is an essential aspect of software engineering, as it ensures that the software is easy to use and navigate for its users. A usable software program has a user interface that is intuitive and straightforward, making it easy for users to interact with the program. In software engineering, usability is often achieved through the use of user-centered design methodologies that prioritize the needs and expectations of the user.

Not bad.  The article was pretty complete. It was coherent and informative; it summarized a lot of information. However, it wasn’t exactly what I had in mind.

After all, at Enthought we work primarily in the Python ecosystem, and I wanted something focused there—something about how to make good use of the various standard Python tools for software engineering: documentation, testing, performance, etc. The basic outline matched what I had asked for, but was too generic. 

Revision #1

So, I went back to my assistant and asked for some modifications: 

Rewrite this article with a focus on the Python programming language.

My assistant took off and immediately provided me with a complete rewrite:

Introduction:

Python is a widely-used programming language in software engineering that emphasizes code readability, simplicity, and ease of use. It is an interpreted, high-level programming language that supports multiple programming paradigms, including object-oriented, functional, and procedural programming. Python is used in a wide range of applications, from web development to data analysis to artificial intelligence and machine learning.

Usability:

Python’s readability and ease of use make it a popular choice among developers who value productivity and code simplicity. Its clean syntax, dynamic typing, and automatic memory management make it easy for beginners to learn and for experienced developers to write code quickly and efficiently. Python’s standard library includes a wide range of modules and tools, which reduces the need for developers to write code from scratch and streamlines the development process.

This revision was again coherent and informative, but strayed away from the original software engineering topic. Still, it had a lot of good information about Python and did emphasize some of the main good points of Python as a language vis a vis general software engineering problems. I decided to try for another revision. After all, at Enthought our focus is really on scientific R&D software, not general software development. The article needed to have some focus on the special problems that turn up in the science R&D context, such as: small teams of collaborators working in different organizations and places; the open source sharing of tools; messy and incomplete data; and rapidly changing understanding of the problems being worked on.

Revision #2

This time I asked:

Rewrite this article with a focus on software written for solving scientific research problems.

Again, my assistant provided a sub-minute, quick turnaround:

Introduction:

Software engineering plays a crucial role in scientific research as it enables researchers to develop software solutions for solving complex research problems. Scientific software development requires specialized skills and knowledge, as it often involves processing large datasets, running simulations, and implementing complex algorithms. Python is a popular programming language in scientific research, as it provides researchers with a range of tools and libraries for data analysis, visualization, and machine learning.

Usability:

In scientific research, software usability is critical as researchers need to focus on their research rather than struggling with complex software interfaces. Python’s simplicity and ease of use make it an ideal language for scientific research. Its clean syntax and dynamic typing make it easy for researchers to write code, while its extensive libraries and tools provide a range of solutions for data analysis, visualization, and modeling. Additionally, Python’s Jupyter Notebook provides researchers with a user-friendly environment for interactive data analysis and documentation.

Yet again, my assistant provided a good response, although still not exactly what I was after (this content have been a better draft for another blog I wrote: Why Python?).  It could be that my requests (prompts) were not precise enough. Still, there was a lot of good, summarized information in ChatGPT’s responses.

Next Steps

The next steps could be to continue experimenting with getting the right query to ChatGPT. Or, I could take the responses I have received and use them as solid drafts to craft an article that did match what I envisioned—something about the craft of software engineering that would be useful for scientists who collaborate with each other, who create useful tools for other researchers, and who see software as much about communication as solving complex problems.

However, I will leave it as is.  I invite you to get a free account with ChatGPT and explore this latest marvel of the digital world. Ask it about software engineering. Starting with a general query and then refining, and adding layers of complexity seems to be a good approach.  The responses are interesting.  If nothing else, they provide you with a solid summary of ideas related to your query; they can also help you refine your thinking by getting you to focus your queries more precisely on the questions you really wanted to ask, but didn’t.

In the meantime, I will go back to working on a new Enthought Academy course, Software Engineering for Scientists and Engineers. If you are interested in creating solid software, you’ll find our human-crafted course—which allows us to work directly with you and answer the questions you have about software engineering—more useful.

Enthought Academy Software Engineering for Scientists and Engineers.  By Humans. For Humans.

Author: Eric Olsen, Director, Training Solutions, holds a Ph.D. in history from the University of Pennsylvania, a M.S. in software engineering from Pennsylvania State University, and a B.A. in computer science from Utah State University. Eric spent three decades working in software development in a variety of fields, including atmospheric physics research, remote sensing and GIS, retail, and banking. In each of these fields, Eric focused on building software systems to automate and standardize the many repetitive, time-consuming, and unstable processes that he encountered.

 

Share this article:

Related Content

Jupyter AI Magics Are Not ✨Magic✨

It doesn’t take ✨magic✨ to integrate ChatGPT into your Jupyter workflow. Integrating ChatGPT into your Jupyter workflow doesn’t have to be magic. New tools are…

Read More

Top 5 Takeaways from the American Chemical Society (ACS) 2023 Fall Meeting: R&D Data, Generative AI and More

By Mike Heiber, Ph.D., Materials Informatics Manager Enthought, Materials Science Solutions The American Chemical Society (ACS) is a premier scientific organization with members all over…

Read More

Real Scientists Make Their Own Tools

There’s a long history of scientists who built new tools to enable their discoveries. Tycho Brahe built a quadrant that allowed him to observe the…

Read More

How IT Contributes to Successful Science

With the increasing importance of AI and machine learning in science and engineering, it is critical that the leadership of R&D and IT groups at...

Read More

From Data to Discovery: Exploring the Potential of Generative Models in Materials Informatics Solutions

Generative models can be used in many more areas than just language generation, with one particularly promising area: molecule generation for chemical product development.

Read More

7 Pro-Tips for Scientists: Using LLMs to Write Code

Scientists gain superpowers when they learn to program. Programming makes answering whole classes of questions easy and new classes of questions become possible to answer….

Read More

The Importance of Large Language Models in Science Even If You Don’t Work With Language

OpenAI's ChatGPT, Google's Bard, and other similar Large Language Models (LLMs) have made dramatic strides in their ability to interact with people using natural language....

Read More

4 Reasons to Learn Xarray and Awkward Array—for NumPy and Pandas Users

You know it. We know it. NumPy is cool. Pandas is cool. We can bend them to our will, but sometimes they’re not the right tools…

Read More

Leveraging AI in Cell Culture Analysis

Mammalian cell culture is a fundamental tool for many discoveries, innovations, and products in the life sciences.

Read More

Making the Most of Small Data in Scientific R&D

For many traditional innovation-driven organizations, scientific data is generated to answer specific immediate research questions and then archived to protect IP, with little attention paid...

Read More