Using AI to Generate Images for Public Works – Post 2

water color sketch of a water valve
AI-generated image of a water valve

In the first post on this topic I introduced the Midjourney AI tool which can be used to generate images. In this post, I’ll talk about a few more image generating tools I have tried. The first is Canva. This tool is fairly easy to use; however, you will need to make a free account to access it.

If you decide to try Canva and visit their site, you will find a lot going on there because Canva does a lot more than just generate images from text. To find the AI tool on their site, you can click “Apps” and when the next screen comes up look for “Text to Image.” Once you click that a box pops up asking if you want to use this in a new design or existing design. You can click “use in a new design.” If you do this, you’ll see they offer a drop down with a long list of items from which you can choose. When I am creating an image I usually want a 512 x 512 pixel image, so I choose “Custom Size.” If you make this same choice, you can indicate your image size and then click the button “Create New Design.”

screenshot of the Canva text to image app
Screen shot of the Canva Text to Image App

Next, you should see a screen like the one shown in the image above. There is an area on the right with the blank canvas and a small input area where you can type the description of what I want. Initially I tried “a person putting their garbage can at the edge of the road.” For the style I left it at “None.” After a short time, the image below is what it generated which wasn’t quite what I had hoped for. A person is only in one of the images, and the can is actually in the road.

Four images of a blue garbage can sitting in the road. One image has a woman in a red shirt and black pants crouching down near the can.
AIi-generated image of a garbage can along a road from Canva

So I tried again this time choosing the “Dreamlike” style. The images generated were a little closer to what I was looking for, but the person was a small child in two of them, and the one with an adult did not look realistic. Here is how it came out:

four images showing garbage cans in the road. A small child in standing near a can in two of them while a woman is near one in another
AI-generated image of a garbage can from Canva

After not really getting what I was looking for I tried a different image. This time I typed “a stream winding through a field of grass and trees.” I left the style at “none.” The image generated which I included below turned out close to what I had envisioned. Although if I wanted to use this to illustrate a stream in my area, I would need to try again and add text to indicate no mountains.

four images showing a stream winding through a field of grass with some trees. Three images have mountains in the background and the remaining one has a waterfall.
AI-generated image of a stream from Canva

Another image generating AI often talked about is DALL-E 2. This AI is brought to you by OpenAI – the same organization which offers ChatGPT. Just like with Canva, you will need an account to try out DALL-E 2. Once you log in, this is how the site looks:

Screen shot of the DALLE2 site
DALL-E 2 Website

In the prompt line I typed in the same text I tried with Canva: “a person putting their garbage can at the edge of the road.” DALL-E 2 generated four images, the same number produced by Midjourney and Canva. What I really like about the images generated by DALL-E 2 is they download with a DPI resolution of 300. The other tools download at a smaller size, and I always have to bring them into my image editing software to upsize them. Also, almost all of the images generated by DALL-E 2 were close to what I was looking for. I added below the one that I thought was best.

image of a brown garbage can at the side of the road with someone's arm resting on top of it. There is a car in the background.
AI-generated image of a garbage can by DALL-E 2

And just for fun, here’s another image I generated on DALL-E 2 a while ago using the following text: “a red fire hydrant van gogh style,”

an image of a red fire hydrant
AI-generated image of a fire hydrant by DALL-E 2

Of the two tools discussed here, I had better results with DALL-E 2. Plus there is the added bonus of not having to process the images in an image-editing software to get them to the resolution I need. Both were free to use, although Canva has some terms of use you may want to check out if you are using the images you create on their site for work.

Share

Using AI to Generate Images for Public Works

globe with an image of a tree growing in the ground and birds flying and mountains in the back all in a primarily green color.

As noted in the last blog post, Can AI Help Fix a Water Main Break?, Artificial Intelligence or AI may eventually be a really useful tool for those of us working in the public works field. In this post, I’m exploring its use as a tool to generate images. This is helpful because many of us are often tasked with either developing reports, newsletters, notices, or presentations. And with copyright laws, it can sometimes be a struggle to find a non-copyrighted image we want to illustrate our writing. Some of us may be talented in creating our own graphics, but that can take a lot of time even if we have those skills.

AI can help solve many of these challenges. It can generate an image very quickly. And if the first one doesn’t quite work, we can quickly generate another one. Or we can generate a few and ask colleagues for their opinion on which is best to use.

Regarding copyright, I have seen a lot of comments online suggesting this is a problem. But a recent publication in the Federal Register by the Library of Congress U.S. Copyright Office seems to indicate at least in the United States images generated by an AI from text cannot be copyrighted. Here’s some of the findings noted in that publication:

  • In the Office’s view, it is well established that copyright can protect only material that is the product of human creativity.
  • …when an AI technology receives solely a prompt from a human and produces complex written, visual, or musical works in response, the ‘‘traditional elements of authorship’’ are determined and executed by the technology—not the human user.
  • these prompts function more like instructions to a commissioned artist—they identify what the prompter wishes to have depicted, but the machine determines how those instructions are implemented in its output.

The publication goes on to indicate if a human takes the AI-generated image and either modifies it or uses it in a creative composition that is determined to now be developed by a human, then that final work may be considered to be copyrighted. There are instructions in the publication of how to submit for registration of copyright of an AI-generated work that a human has modified.

However, I am not an attorney and would suggest running this entire question of copyright by your own legal department for verification.

If you get the green light from legal to use AI to generate images to help you at work, your next question may be where is this magical AI tool and how do I use it? Over the past few months, I have tried some AI tools that I’ll share with you starting with my favorite one below:

Midjourney: so far this has become my favorite image-generating AI. Although at first it was a little uncomfortable for me to get used to. I still find their website a little scary when it initially comes up because they have a lot of changing text that makes me immediately think I accidently clicked a site that will hack me. Also, their website is not where their AI tool is found. Instead you join their Discord group and generate images there.

So if you are not used to Discord that may also be initially uncomfortable until you get used to the interface. Fortunately I have been using Discord for a while now so that part was not so bad for me. You can access and join Discord through the Discord website, and you can download and install the Discord app on your computer from their website. From the Discord site you can search for the Midjourney Discord server. Here is the direct link: Midjourney Discord Link which should bring up the site shown in the image below. From there you can join the Midjourney Discord Server by clicking the button which says “Join Server.”

Screenshot of the Midjourney community site on Discord

Once you get to the Midjourney Discord Server, as I said it can be overwhelming if you are not used to the interface. The quickest way to try it out is to click in the column on the left on a newbies listing in one of the Newcomer Rooms. The image below shows the result of me clicking on newbies-102. It shows a running result of the images generated by Midjourney based on text people have input.

Screenshot of the Midjourney Discord Server showing images generated by the AI tool

You can see the text inputs vary from a very simple description to more complex. Midjourney wll generate four different images for each request. You can further refine a specific image by clicking the boxes underneath. The U boxes are for upscale an image while the V boxes ask the tool to vary it. I created the image at the start of this blog post in Midjourney with the following text prompt: /imagine a scene showing nature in an earth day theme using green as a predominant color. There are a lot of sites online which offer guidance on how to use the tool, and Midjourney has a Quick Start Guide where you can get more detailed information than I have included here.

I will follow up with the other AI image-generating tools I have tried in the next blog posts.

Share