You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The problem is, for a valid HTTP or HTTPS URL, it must contain the scheme http:// or https://. And / is not a reserved character on most Linux file systems. That is, / cannot appear in the file name, or in the url passed to check_url_content.
The implication is,
Suppose I do have a directory with infiles whose file names are some kind of 'URL's. They won't be valid HTTP(s) URL. And thus they won't be treated correctly.
I think maybe this python module is not completely/correctly implemented. Correct me if I'm wrong.
P.S. From the commit message aef3399, it reads that
The --infile argument now allows a url, file, or directory of files to be passed as the input.
So probably, it was never intended to deal with a directory of URLs.
The text was updated successfully, but these errors were encountered:
@lebensterben This is a feature we started to implement for autospec but ultimately, we never used it. So, if you see any problems with the docs or code relevant to the feature, that is why.
It's documented that
autospec
could accept a directory of URLs as argument to--infile
, which is handled byinfile_reader
function:autospec/autospec/infile_handler.py
Lines 130 to 140 in b110b31
But I cannot understand how this could work.
infile_reader
callsfile_handler
, which in turn callscheck_url_content
:autospec/autospec/infile_handler.py
Lines 146 to 147 in b110b31
autospec/autospec/infile_handler.py
Lines 112 to 114 in b110b31
check_url_content
sends a HEAD request to theurl
, which is the file name of a infile in a given directory.autospec/autospec/infile_handler.py
Line 64 in b110b31
The problem is, for a valid HTTP or HTTPS URL, it must contain the scheme
http://
orhttps://
. And/
is not a reserved character on most Linux file systems. That is,/
cannot appear in the file name, or in theurl
passed tocheck_url_content
.The implication is,
I think maybe this python module is not completely/correctly implemented. Correct me if I'm wrong.
P.S. From the commit message aef3399, it reads that
So probably, it was never intended to deal with a directory of URLs.
The text was updated successfully, but these errors were encountered: