A lot of these
are possible, or a reasonable facsimile; it's more that there's not nearly enough market to serve it.
For starters, consider accessibility tools like text-to-speech and voice recognition. These are available in the operating system, or with affordable add-ons.
Voice recognition has historically been very janky (interestingly, it's been possible since the 90s at least), but in the last decade or so it's reached accuracy good enough for mainstream use. Though the best ones (Siri, Alexa, Google) still "phone home" to a remote server ("the cloud") -- where extra processing power is available to generate the best results ... and also log your conversation for posterity.
I'm not sure what capability is available on PC, without all those privacy issues, and give or take at what price point. Probably something still usable and not very frustrating?
Image recognition is definitely there. Recognizing humanoid figures in video feeds for example has, what, >95% classifier accuracy? With a bit more training, it can probably identify you personally, too. It's... not trivial to set up, but can technically be done in the comfort of ones' own home, given the skill to do it; the tools are free and out there(!). Perhaps there's a precooked tool that can do a more streamlined/automatic build given a training set? The catch is, I'm not sure you can train such a system on say, just a few portraits -- usually it takes thousands of examples. (Or minutes of video -- amounting to as many still frames!)
Or something of a shortcut can be used, like biometrics (fingerprints, facial dimensions, eye scan, etc.), if you don't mind. There are tools already on the market, for example fingerprint scanners for automatic login. Perhaps this would be an adequate substitute -- your computer might not recognize you on sight because it's blind, but it can recognize your touch (distinguish fingerprints), much as a similarly blind human might, eh?
The rest is much as others have said above. Note that, getting meaningful interactions, is very, very far from trivial; what happens if the camera misidentifies you? Should it identify at high frequency (i.e. categorize each individual frame of video -- that may seem overly aggressive, but this is the lazier way to program it, y'see), or build a list of possible candidates ranked by confidence? Should that list be updated when so notified ("good morning Bob!" "I'm not Bob I'm Jerry." "Oh, I'm sorry Jerry. How are you this morning?")? What if someone else walks in and starts talking, should it mind its own business, should it welcome them too, --- you see, it very, very quickly spirals into unimaginable complexity, so it's not at all easy to do.
On the matter of convincing conversation, computers are getting there -- but it's still very much based on machine learning as far as I know, and as soon as you stray from the beaten path, they make very jarring errors, such as obvious mischaracterizations of the objects of sentences, or coherence errors seemingly trailing off into irrelevant subjects. And if you expect them to initiate small-talk, too, good luck... Grammar at least is pretty well solved I think, no small feat given the sheer complexity of natural language.
As for spam filters, probably a lot of this is already going on transparently and you don't even realize it! Mail systems have quite a lot of filtering already. This is a tricky one, because, do you
really trust an algorithm -- or worse still, an utterly inscrutable machine-learning system -- to identify important emails, and discard fluff, at nearing 100% accuracy? What if you get an important medical document, or job notice, or legal service or something like that, and it drops it?
Actually, let's not fool ourselves... humans are far from 100% accurate at most things they do. I wonder what miscategorization rate would be considered embarrassing for a secretary? Does it matter that the computer can't accept blame, nor give an apology, when caught?
To summarize, you basically want a computer secretary, right? There are definitely some things you can do right now, which will make things easier in that regard. We're certainly not there yet, overall; so if you truly demand the full suite of capabilities that only a human can provide, I'm afraid you have little choice but to hire a real one. Which, you definitely can, they're available for rent in your nearest classifieds...
Or, you can hire a modest sized team of very smart humans, to make your very dumb computer, somewhat less dumb -- but I'm guessing that's even less financially viable. Again, the ultimate problem is: not so much whether it's possible, but whether it's worth creating. Alas, it seems this particular combination of problems just hasn't been judged profitable yet.
Tim