You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Looks like patentsview is missing the definitions for some of the CPC codes. For example, the code of B32B is missing a definition in the CPC group bulk data file, but a definition exists for this group in the official xml scheme. I count 30 missing groups out of the 656 groups.
I don't see anything obvious in the xml or in the parsing code (relevant snippet shown below) that would indicate where the problem is coming from. Sorry.
If I run locally with that line changed to text_class = [t.text for t in text_need]
I get a title for B32B and none of the other groups are missing their titles. I don't know the intent of that line but the effect was that it erased titles that contained I.E. or E.G. Generated from CPCSchemeXML201802.zip: cpc_group.tsv.zip
Looks like patentsview is missing the definitions for some of the CPC codes. For example, the code of B32B is missing a definition in the CPC group bulk data file, but a definition exists for this group in the official xml scheme. I count 30 missing groups out of the 656 groups.
I don't see anything obvious in the xml or in the parsing code (relevant snippet shown below) that would indicate where the problem is coming from. Sorry.
https://github.com/CSSIP-AIR/PatentsView-DB/blob/30ae3cbc3e7a02c46ef64fd8fd2c2ac9bfceb250/Scripts/Raw_Data_Parsers/uspto_parsers/cpc_class_tables.py#L32-L43
The text was updated successfully, but these errors were encountered: