Skip to content

Autogen Agents on Structed Data analysis!!! #1910

Answered by ekzhu
mraguth asked this question in Q&A
Discussion options

You must be logged in to vote

The main challenge of structured data is scale and schema understanding.
To make things easier you can start with small datasets that can fit into the context of GPT model, and common schema on well-known domains like movies, songs, people, etc.
Just feed the data into the model and see what they can do, combine with code writing and execution.

More interesting, now your data is huge, can't fit inside the context. You would need to think about what information do you give to the agent? Schema, sample data, etc. Different agents for different stage of your pipeline, e.g., data cleaning agent, data validation agent, data science agent, visualization agent, etc.

Even more interesting and ver…

Replies: 3 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@mraguth
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by mraguth
Comment options

You must be logged in to vote
1 reply
@mraguth
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
4 participants