About the prompt rules #8

zhouwy19 · 2022-01-10T13:40:06Z

Great work!

I would like to know what kind of sentences are reasonable and valid for CLIP. Are there any specific prompt rules for style sentences in your paper?

I try the 'an image of a car of wood' for an input car.

init:

final:

What I want is:

But it turned out that the geometry of the car became disorganized and even self-intersecting, the shape of the original geometry was invisible, and the texture did not take on the texture of wood. May I ask where is the problem?

Hope you can answer my two questions: the prompt rules ahd the effect. Thanks you very mush.

Best.

roibaron · 2022-01-10T21:55:34Z

Hey @zhouwy19, great question!
Generally speaking, we don't know how the CLIP landscape looks, but repetitive patterns, such as wood, should be the easiest to achieve.

The main issue that I see with your setting is the alignment. You should aim to capture a more meaningful view as your frontview. See subsection 3.3 in the paper (anchor view).

There also could be an issue with the resolution of the mesh you are using, which seems to be slightly insufficient. How many vertices are there? Can you share the .obj file?

I believe that alone should solve your problem, but keep in mind that self intersections are an inherent problem in inverse rendering (and in any pipeline that optimized 3D geometry based on 2D views). The classical solution solution is to introduce a regularization term in the form of Laplacian energies.

Good luck!
Roi

zhouwy19 · 2022-01-11T05:13:56Z

Thank your very much for your quick reply!

I did not find any code about finding the anchor view. I see the frontview_center is set to [0,0] in the code. Do all the obj files you provide have the highest view of the clip aligned to [0,0]?

zhouwy19 · 2022-01-11T05:20:49Z

oh, I see different settings in the shell file, such as '--frontview_center 1.96349 0.6283'. It seems it's different for each mesh?

roibaron · 2022-01-11T05:28:55Z

You are right, we didn't share a script for finding this view.

The easiest way to set it up is to rotate the mesh with an external editor.

Alternatively, you can iterate views to find a view with high CLIP score, given a prompt like an image of a car.

Roi

zhouwy19 · 2022-01-11T05:34:33Z

Thank you.

I would like to ask, for such a mesh, how should I set the frontview_center to get a picture with a horizontal perspective like the one below?

And compared with the following, which one is better as an anchor view? The following one shows some information on the car.

roibaron · 2022-01-11T05:39:27Z

My intuition says that a car facing the ground is harder to capture. I would suggest a front facing mesh.

zhouwy19 · 2022-01-11T05:41:27Z

How can I rotate a car facing the ground to a front facing mesh?

ojmichel · 2022-01-11T20:07:16Z

In MeshLab you can use the rotation filter. You will want to see the front of the car when looking down the -x axis. Also, this mesh has large triangles which will make the results worse. To fix this, you can use the "Remeshing: Isotropic Explicit Remeshing" filter in MeshLab.

DaichenWang · 2022-06-08T19:51:24Z

Great work!

I would like to know what kind of sentences are reasonable and valid for CLIP. Are there any specific prompt rules for style sentences in your paper?

I try the 'an image of a car of wood' for an input car.

init:

final:

What I want is:

But it turned out that the geometry of the car became disorganized and even self-intersecting, the shape of the original geometry was invisible, and the texture did not take on the texture of wood. May I ask where is the problem?

Hope you can answer my two questions: the prompt rules ahd the effect. Thanks you very mush.

Best.

你好？请问你知道如何导入自己的obj吗？我也在尝试着项目，我用的是kaggle但是好像无法自己导入obj。

wimmerth · 2023-05-15T09:19:53Z

You are right, we didn't share a script for finding this view.

The easiest way to set it up is to rotate the mesh with an external editor.

Hey @roibaron,
I would be interested in how you used an external editor to find the view with the highest alignment using CLIP. Is there any editor that has some kind of CLIP plugin that you used? Because in the end you still need to run the inference of the rendered view with your model in Python, isn't it? Did you automatize this process of finding the anchor view? If so, can you share how?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About the prompt rules #8

About the prompt rules #8

zhouwy19 commented Jan 10, 2022

roibaron commented Jan 10, 2022 •

edited

zhouwy19 commented Jan 11, 2022

zhouwy19 commented Jan 11, 2022

roibaron commented Jan 11, 2022

zhouwy19 commented Jan 11, 2022

roibaron commented Jan 11, 2022

zhouwy19 commented Jan 11, 2022

ojmichel commented Jan 11, 2022

DaichenWang commented Jun 8, 2022

wimmerth commented May 15, 2023

About the prompt rules #8

About the prompt rules #8

Comments

zhouwy19 commented Jan 10, 2022

roibaron commented Jan 10, 2022 • edited

zhouwy19 commented Jan 11, 2022

zhouwy19 commented Jan 11, 2022

roibaron commented Jan 11, 2022

zhouwy19 commented Jan 11, 2022

roibaron commented Jan 11, 2022

zhouwy19 commented Jan 11, 2022

ojmichel commented Jan 11, 2022

DaichenWang commented Jun 8, 2022

wimmerth commented May 15, 2023

roibaron commented Jan 10, 2022 •

edited