-
Notifications
You must be signed in to change notification settings - Fork 11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Discovery: Determine how v2 Content Libraries can make use of Learning Core #30
Comments
Luckily there is not yet much content in v2 libraries anywhere, except for LX which we no longer need to support. So in practice there may not be much data to migrate - just whatever people create in the future after the initial edx.org release of v2. I believe we decided to limit such content to a few basic block types, so there should be no hierarchical content at all (no units) which greatly simplifies things. |
@bradenmacdonald @ormsbee I think MIT Open Learning might be using blockstore. We built a big canvas<->Open edX LTI which leveraged Libraries v2. As far as I know, though, they may be the only ones who user it and hadn't already planned to move off of it. |
Thanks @Kelketek, good point. |
It looks like @connorhaugh is planning to migrate edx.org from v1 to v2 soon, so what I said may no longer apply. https://discuss.openedx.org/t/migrating-v1-to-v2-content-libraries/9637 |
@bradenmacdonald we are currently forming plans to do so. Hoping to learn more context on learning-core storage backend, I was under the impression it was using blockstore "infrastructure." In addition, I want to know more about the implied "dual" role of having two kinds of content libraries going at once? |
@connorhaugh The big thing we learned using Blockstore for LabXchange was that storing the actual OLX data on S3 object storage rather than in MySQL/MongoDB introduced too much latency and made the performance of the system too slow. So with Learning Core we are moving to storing most OLX data in MySQL blob fields. If you want to migrate all libraries to v2 now, it's totally fine with me because blockstore performance is sufficient for libraries, but it's definitely not suitable for courses, which is part of the reason we're focusing development efforts on Learning Core. Using v2 library content in today's courses is also fine because doing so copies the content back into modulestore so performance is reasonable.
v1 and v2 libraries as they exist today can happily coexist, much like split mongo and "old" mongo did for a long time. Of course I want to avoid prolonging this situation so hopefully we can migrate everything to a single version at some future point. We do plan to switch the storage backend for v2 content libraries from blockstore to learning core. I don't know how difficult that migration will be but hopefully not too difficult as those will be more similar than modulestore->blockstore, and as it's only changing the library storage backend, it may not require any updates to the courses where the content is being used, unlike the v1->v2 migration. We may do a cutover of the backend or support both backends in parallel for a time; that's all tbd. |
ContextThe current version of Content Libraries (v2) uses the blockstore bundles to store the content of blocks. The blockstore api is used to perform the different operations and is used in api.py, library_bundle.py, and library_index.py. Learning-core has the Component model; which is used to store the metadata, the Content model; which is used to store content of the component, and the ComponentVersion model; which stores a version of the component with its different contents. Whether it's feasible to migrate data over from it's current blockstore format to the data models here.Taking into account the previous context, the learning-core data models support versioning, so it is feasible to migrate the data from the There is some data that should be discussed how to save it with learning-core:
What is necessary to add to the current data models to make it suitable for tagging related work?Based on the current state of this repository and the discoveries made about tagging, I don't see a need for the current data models to be modified. But I can put possible scenarios of how the tagging could be implemented here:
How can this work be done in a way that does not disrupt the current release plans around Content Libraries. We are not trying to fit this in before the initial edx.org release of this feature, but trying to slot it in-between that release and the tagging related release (dev work has not started on that yet).Regarding the content libraries, we can generate a second Python api like the one for blockstore, but using learning-core, in such a way that we can replicate each of the functions. And modify the views so that one or the other api is used depending on a value in an environment variable. So we can easily switch from blockstore to learning-core. In addition, we can gradually implement functions for the migration from blockstore to learning-core within content libraries. Regarding the tagging, it will depend a lot on how the architecture of this service is going to be. Since it may be possible to save the tags inside the blockstore as one more file or inside learning-core as one more |
@bradenmacdonald @ormsbee @giovannicimolin @jmakowski1123 This is ready for review |
@ChrisChV Nice discovery!
I think this sort of metadata can go into
Yup, agreed. Model fields to store tags will be built into the new tagging app.
@ormsbee @bradenmacdonald Do we need a "link" model to reference external components? My understanding was that this could be done directly, though I still don't understand how composite components (units) will work on learning-core.
@bradenmacdonald So both OLX and block files will be stored using the same fields? Won't that make querying OLX slow? Not sure there will ever be an use case for that, but this data table will be huge and might impact MySQL's performance, no? |
Yes, you are right, I have updated the discovery adding this option 👍
Fields that should go inside the core data models, the essentials are covered. Any other field I think it should go as a plugin that extends the data models. |
Some thoughts: For
I don't think we need description necessarily. But if we do, then we can add it in to the learning core models. Same with slug, do we actually need it? (Maybe). Let's just make sure we need (and will use) things before we copy them over.
I thought Learning Core itself provides a python API, so maybe we don't need to wrap it in another API layer?
Yes, that is a workable approach. If possible though I would prefer to migrate everything at once, or at least try to do the migration very quickly so that it doesn't drag on for a long time. I don't want to have to support both blockstore and learning core at the same time any longer than we have to. As for how to integrate tagging, I'll think about that more and comment on it later. |
@ChrisChV, @bradenmacdonald: Now that #41 has landed, I think it makes sense for tagging to make foreign keys to the PublishableEntity and PublishableEntityVersion models, which would make them generally applicable to any published content that we'll make going forward (e.g. Components, Units, etc.) |
Closing this discovery, as we're already well into implementation: |
In particular focus on:
The text was updated successfully, but these errors were encountered: